Fig 1 - uploaded by Jonas Sebastian Krauss
Content may be subject to copyright.
Screenshot of the temporal distribtion chart. 

Screenshot of the temporal distribtion chart. 

Source publication
Chapter
Full-text available
This paper introduces a novel method to analyze the content of communication in social networks. Content clustering methods are used to extract a taxonomy of concepts from each analyzed communication archive. Those taxonomies are hierarchical categorizations of the concepts discussed in the analyzed communication archives. Concepts are based on ter...

Context in source publication

Context 1
... this implementation the semantic social network analysis [13] module of Condor, the successor of TecFlow [28], is extended. Condor analyzes and visualizes communication and interaction networks. The data representing the network structure needs to be available in electronic format. Condor natively supports different data sources. Email archives can be imported from Eudora, Microsoft Outlook, or directly from an IMAP server. Weblinks and blogs can be accessed via Google’s blogsearch and Microsoft’s live search API. Data gathered with Social Badges can also be loaded into Condor [29]. Not natively supported data sources can be loaded into Condor via flat files or by parsing the data directly into a MySQL database. In extension to a static analysis of interaction networks, Condor and its predecessors can be used to study dynamic networks and their evolution over time. Monitoring social networks over time helps understanding the evolution of relationships in the networks [15]. Condor visualizes the social networks with a spring-embedder model developed by Fruchterman and Reingold [30]. This algorithm enhances Eades method [31] to places the nodes and the edges of a network on a two-dimensional plane. The screenshot in figure 3 shows the resulting structure of the force-directed algorithm. An important feature of Condor is its ability to process and visualize temporal information on social networks. Especially when analyzing the content of the communication the temporal distribution of terms used by actors in the network is of interest. An increased use of terms pooled in certain clusters at one moment in time could point to the topics discussed in the social network during that time. To support users in assessing the prominence of single nodes in the taxonomies and the concepts these nodes represent this implementation allows to view the temporal distribution of each node in a chart. Figure 1 shows an example of such a distribution chart. In order to contrast the cluster scrutinized by the user to all the other clusters in the taxonomy the distribution of all terms is shown in each chart too. Values used in the temporal distribution chart are weighted with the terms’ importance scores as described in section 4.3. By using those importance scores instead of the bare numbers of term occurrences, the betweenness centrality of the actors using the terms is factored in. Utilizing the betweenness based importance of each term punishes those terms and concepts used by less important actors. An additional view on the data in each cluster is the list of the most important documents shown in figure 2. For each document a summary as well as the sender’s name and the submission date is given. The documents are ordered by their importance for the terms in the selected cluster. On the left side of figure 2 an additional window with the content of the selected document can be seen. The evaluation of clustering algorithms can be conducted with statistical measures as introduced in [32]. Those methods assess the quality of the clustering algorithms. However, the quality of the resulting clustering needs to be judged by human users. A reduced Enron dataset 4 is used to show the functionality of the developed module. This dataset consist of emails collected from Enron employees during the Enron scandal. Figure 3 shows the top-level clusters of this dataset. Important facts on the Enron scandal can be grasped at a glance without further knowledge on the background of this dataset. The system identifies Enron’s Executive Vice President Steven J. Kean as one of the key players in the scandal in which attorneys and New York played a crucial role. A combination of information retrieval means and social network analysis techniques is introduced in this paper. The aim is to reveal discussed topics in social networks like email archives and their relationships among each other. Instead of relying only on information retrieval techniques the structure of the underlying social network is taken into account. Foundations of information retrieval and social network analysis are described, techniques of both fields are combined to obtain taxonomies of the topics discussed in the communication of social networks. Instead of solely relying on methods of IR when determining concepts discussed in the communication archives, the structure of the underlying network is respected by factoring in SNA key measures. Unsupervised clustering algorithms are used to scrutinize the content of communication in social networks. Classification of terms with those algorithms is a new approach to gain insights on communication networks. The resulting taxonomies of terms can be used to obtain an overview on the whole communication network at a glance. A temporal analysis module allows assessing the development of discussed topics over time. Finally, the design and details of the implementation are presented. The aim of this paper was to provide users of social network analysis packages like Condor with an automated method to reveal topics discussed in analyzed networks and their hidden relations among each other. First steps were made in this paper to allow users to gain an impression on the nature of discussions in analyzed networks. This work can only serve as a step in the right direction of automatically revealing discussed topics in social networks. Supporting users with enhanced automated or semi-automated methods to analyze the content of social networks is crucial with more and more data available on social ...

Citations

... Park et al. (2015) use a Social Network approach to observe the emergent editing behavior of Wikipedia in different languages. Fuehres et al. (2010) assess the importance of an actor in Wikipedia by computing degree centrality in the net of conversational exchanges. Himelboim et al. (2013) applies SNA to identify the clusters that emerge through the political talk on Twitter and, by integrating this measure with content analysis, the authors observe that Twitter users tend to cluster in politically homogenous groups and thus they are unlikely to be exposed to cross-ideological content. ...
... Mais cette mesure -comme beaucoup d'autres -nécessite une connaissance complète du graphe et souffre d'une complexité de O(n 3 ). Fuehres et al. [27] utilisent la « betweenness centrality » des utilisateurs d'un graphe social pour infléchir la similarité des contenus manipulés par ces derniers. Tang et al. [65] définissent des mesures temporelles sémantiques comme « Temporal Betweenness Centrality » et « Temporal Closeness Centrality » pour situer un utilisateur dans un réseau de communication social. ...
... Il est composé de différents modules qui réalisent des extractions et calculs sur différentes sources de données. Fuehres et al. [27] étendent notamment le module sémantique de Condor pour réaliser des communautés par partitionnement selon les éléments sémantiques détectés. Mais cette extension n'est pas disponible sur le Web. ...
Thesis
Full-text available
With the rise of Web 2.0 and collaborative technologies that are attached to, the Web has now become a broad platform of exchanges between users. The majority of websites is now dedicated to social interactions of their users, or offers tools to develop these interactions. Our work focuses on the understanding of these exchanges, as well as emerging community structures arising, through a semantic approach. To meet the needs of web analysts, we analyze these community structures to identify their essential characteristics as their thematic centers and central contributors. Our semantic analysis is mainly based on reference light ontologies to define several new metrics such as the temporal semantic centrality and the semantic propagation probability. We employ an online approach to monitor user activity in real time in our community analysis tool WebTribe. We have implemented and tested our methods on real data from social communication systems on the Web.
... Initial methods relied on a basic relationship between users like friendship in social networks or answers or citations in communication networks (like forums or emails). For communication networks, the semantics of the message itself is progressively taken into account [7]. In parallel, recent works [25] incorporate the temporal dynamic of messages, but without their semantics. ...
... A recent work [7] obtains a ranking by computing the betweenness centrality on the communication graph. In this approach, there is an unoriented edge between two users if they exchanged a message once. ...
Conference Paper
Full-text available
Understanding communication structures in huge and versatile online communities becomes a major issue. In this paper we propose a new metric, the Semantic Propagation Probability, that characterizes the user's ability to propagate a concept to other users, in a rapid and focused way. The message semantics is analyzed according to a given ontology. We use this metric to obtain the Temporal Semantic Centrality of a user in the community. We propose and evaluate an efficient implementation of this metric, using real-life ontologies and data sets.
... Gloor's group has also been applying LSA (Gloor and Zhao, 2006), semantic SNA (Gloor et al., 2009), and clustering technologies to the network sciences (Fuehres et al., 2011). A successful model has not yet developed, however. ...
... As they concluded, "This work can only serve as a step in the right direction of automatically revealing discussed topics in social networks. Supporting users with enhanced automated or semi-automated methods to analyse the content of social networks is crucial" (Fuehres et al., 2011). ...
Article
Full-text available
In this paper, we examine a methodology for assessing '21st century skills' in the classroom. We have developed a tool named the knowledge building discourse explorer (KBDeX) to facilitate social network analysis-based analysis of collaborative learning discourses. A purpose of this study is to assess the capability of KBDeX as a self-assessment tool for learners. An experimental study was conducted with seven university students, and the results were qualitatively evaluated. This paper presents four cases of students using the tool, and analyses the following: 1) the semantic relationship of words; 2) trend analysis using degree centrality charts; 3) analysis of pivotal points, phases, and characteristics of the discussion; 4) analysis of a social network by using semantics. We confirmed satisfactory results that student findings fit within the theoretical framework of the network analysis and were grounded by discourse data.
Article
Conversations on Twitter create networks with identifiable contours as people reply to and mention one another in their tweets. These conversational structures differ, depending on the subject and the people driving the conversation. Six structures are regularly observed: divided, unified, fragmented, clustered, and inward and outward hub and spoke structures. These are created as individuals choose whom to reply to or mention in their Twitter messages and the structures tell a story about the nature of the conversation.
Article
Full-text available
This paper presents the course design titled "Learning Management" of which the goal is to "learn collaborative learning" for a first-year undergraduate student. The objective of the class design is to help transform the student's belief of learning from a passive, individual model to an active, collaborative model which is supported by the concept of "Knowledge Building" or "Constructive Interaction". We conducted an empirical study where the students analyzed their own discourse with KBDeX which is the software they used to assist their analysis in the experimental group whereas the students in the control group reflected their project activities in their own way. We examined the transformation of their beliefs through the qualitative analysis of their reports after the course. The results showed that the design led to the transforming of their learning beliefs from "just experiences of the participation of the collaborative learning" to "active contribution for collaborative knowledge creation". The course succeeded in changing the students' preferences about collaborative learning from negative to positive as well.