ThesisPDF Available

Visualization and exploration of texts. A theoretical framework and two practical approaches for improvement of digital libraries and information retrieval systems

Authors:

Abstract and Figures

This research project aims to improve the way humans work with textual documents when doing tasks such as exploring, discovering, searching, filtering, collecting, indexing, comparing, or just reading. This research project studies theoretical foundations and practical uses of text visualization techniques. As a contribution to theory, the research project presents a classification schema for text visualization approaches based on visual features instead of task-solving capabilities (Paper I). As a contribution to practice, how to improve interfaces of digital libraries (DL) has been studied. Two practical proposals for approaching text are introduced: one for single text representation, called Texty (Paper II), and another for text collections exploration and overview, called Area (Paper III). This research project discusses the contrast between the growing popularity of text visualization, presented as a subfield of data visualization, and the lack and urgency, nowadays, of interactive and visual interfaces to DL
Content may be subject to copyright.
A preview of the PDF is not available
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Introduction. The presentation of the results page in a search system plays an important role in satisfying the information needs of a user. The usual performance management criteria and tools to organise results have limitations that may hinder the satisfaction of those needs. We present Texty as a new approach that can help improve the search experience of users. Method. The corpus of texts to which we applied Texty were papers from Information Research. To filter the texts we have build five groups of words or vocabularies on concrete fields of knowledge: conceptual approach, experimental approach, qualitative methodology, quantitative methodology and computers/IT. Results. We show how Texty, intrinsically, is capable of encoding or offer its users information about the text that other alternative classic representations (bar or lines charts, mainly) are not able to offer. Conclusions. Texty is a complementary tool that improves intellectual interaction with a lists of texts, allowing users to choose texts more effectively knowing their structure before reading them.
Article
Full-text available
This paper presents a review of approaches to text visualization and exploration. Text visualization and exploration, we argue, constitute a subfield of data visualization, and are fuelled by the advances being made in text analysis research and by the growing amount of accessible data in text format. We propose an original classification for a total of 49 cases based on the visual features of the approaches adopted, identified using an inductive process of analysis. We group the cases (published between 1994 and 2013) in two categories: single-text visualizations and text-collection visualizations, both of which can be explored and compared online.
Article
Full-text available
Although various visual interfaces for digital libraries have been developed in prototypical systems, very few of these visual approaches have been integrated into today's digital libraries. In this position paper we argue that this is most likely due to the fact that the evaluation results of most visual systems lack comparability. There is no fix standard on how to evaluate visual interactive user interfaces. Therefore it is not possible to identify which approach is more suitable for a certain context. We feel that the comparability of evaluation results could be improved by building a common evaluation setup consisting of a reference system, based on a standardized corpus with fixed tasks and a panel for possible participants.
Article
Full-text available
Information architecture (IA), which is based on the classical principles of solid traditional Information Science, was born in the late '90. This discipline deals with structuring, organizing and tagging elements of informational environments to facilitate searching and retrieval of information they contain, thereby improving the usefulness and applications of IA. Main systems or structures that build a website's architectonic anatomy are the organization systems, labeling systems, navigation systems, search systems and controlled vocabularies. Nowadays, the information architecture praxis and design is centered on user needs.
Article
Research into online catalog use and users has found some pervasive problems with subject searching in these systems. Subject searches too often fail to retrieve anything, and those that do succeed often retrieve "too much" material. This article examines these problems and how they might be remedied. The theoretical principles for the design of effective information retrieval systems are discussed, and an experimental online catalog system based on these principles is described. The system, CHESHIRE, uses a method called "classification clustering," combined with probabilistic retrieval techniques, to provide natural language searching (which helps to reduce search failure) and to provide effective control of "information overload" in subject searching.