Conference Paper

Topic-Based Coordination for Visual Analysis of Evolving Document Collections.

Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Carlos, Brazil
DOI: 10.1109/IV.2009.31 Conference: 13th International Conference on Information Visualisation, IV 2009, 15-17 July 2009, Barcelona, Spain
Source: DBLP


Document interpretation is a crucial task in many visual analytics applications, made harder by the widespread availability of freely available textual files. In this paper we propose an approach based on topic detection coupled with multiple coordinated views to assist analysis of time varying document collections. Given multiple document maps built from a set of text files, we define a strategy to support users locating the evolution of topics addressed by the documents, along various time steps. The approach is supported by a new algorithm for topic extraction from texts, also introduced. Finally, we show several examples illustrating how the proposed strategy may be applied in the analysis of document collections.

8 Reads
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Scientific articles are the major mechanism for researchers to re-port their results, and a collection of papers on a discipline can reveal a lot about its evolution, such as the emergence of new top-ics. Nonetheless, given a broad collection of papers it is typically very difficult to grasp important information that could help readers to globally interpret, navigate and then focus on the relevant items for their task. Content-based document maps are visual representa-tions created from evaluating the (dis)similarity amongst the doc-uments, and have been shown to support exploratory tasks in this scenario. Documents are represented by visual markers placed in the 2D space so that documents close share similar content. Al-beit the maps allow visually identifying groups of related docu-ments and frontiers between groups, they do not explicitly convey the temporal evolution of a collection. We propose a technique for creating content-based similarity maps of document collections that highlight temporal changes along time. Our solution constructs a sequence of maps from time-stamped sub-sets of the data. It adopts a cumulative backwards strategy to preserve user context across successive time-stamps, i.e., maps do not change drastically from one time stamp to the next, favouring user perception of changes.
    Full-text · Article · Apr 0002
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Context: Systematic mapping provides an overview of a research area to assess the quantity of evidence existing on a topic of interest. In spite of its relevance, the establishment of consistent categories and classification of primary studies in these categories are manually conducted. Objective: We propose an approach, named SM-VTM (Systematic Mapping based on Visual Text Mining), to support categorization and classification stages in the systematic mapping using Visual Text Mining (VTM), aiming at reducing time and effort required in this process. Method: We established SM-VTM, selected a VTM tool and conducted a case study comparing results of two systematic mappings: one performed manually and another using our approach. Results: The results of both systematic mappings were very similar, showing the viability of SM-VTM. Furthermore, since our approach was applied using a tool, reduction of time and effort can be achieved. Conclusions: The application of VTM seems to be very relevant in the context of systematic mapping.
    Full-text · Conference Paper · Apr 2010
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We review recent visualization techniques aimed at supporting tasks that require the analysis of text documents, from approaches targeted at visually summarizing the relevant content of a single document to those aimed at assisting exploratory investigation of whole collections of documents.Techniques are organized considering their target input material—either single texts or collections of texts—and their focus, which may be at displaying content, emphasizing relevant relationships, highlighting the temporal evolution of a document or collection, or helping users to handle results from a query posed to a search engine.We describe the approaches adopted by distinct techniques and briefly review the strategies they employ to obtain meaningful text models, discuss how they extract the information required to produce representative visualizations, the tasks they intend to support and the interaction issues involved, and strengths and limitations. Finally, we show a summary of techniques, highlighting their goals and distinguishing characteristics. We also briefly discuss some open problems and research directions in the fields of visual text mining and text analytics. © 2012 Wiley Periodicals, Inc. © 2012 Wiley Periodicals, Inc.
    Full-text · Article · Nov 2012 · Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Show more