Andreas Stoffel’s research while affiliated with University of Konstanz and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (27)


Visual Analytics
  • Chapter

January 2016

·

48 Reads

·

3 Citations

Daniel A. Keim

·

Florian Mansmann

·

Andreas Stoffel

·


VisExpress: Visual exploration of differential gene expression data

December 2015

·

76 Reads

·

5 Citations

Information Visualization

·

Sebastian Mittelstädt

·

·

[...]

·

Daniel A. Keim

Biologists are keen to understand how processes in cells react to environmental changes. Differential gene expression analysis allows biologists to explore functions of genes with data generated from different environments. However, these data and analysis lead to unique challenges since tasks are ill-defined, require implicit domain knowledge, comprise large volumes of data, and are, therefore, of explanatory nature. To investigate a scalable visualization-based solution, we conducted a design study with three biologists specialized in differential gene expression analysis. We stress our contributions in three aspects: first, we characterize the problem domain for exploring differential gene expression data and derive task abstractions and design requirements. Second, we investigate the design space and present an interactive visualization system, called VisExpress. Third, we evaluate the usefulness of VisExpress via a Pair Analytics study with real users and real data and report on insights that were gained by our experts with VisExpress.


Integrated visual analysis of patterns in time series and text data - Workflow and application to financial data analysis

April 2015

·

90 Reads

·

17 Citations

Information Visualization

In this article, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which are significantly connected in time to quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a priori method. First, based on heuristics, we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a priori method supports the discovery of such sequential temporal patterns. Then, various text features such as the degree of sentence nesting, noun phrase complexity, and the vocabulary richness, are extracted from the news items to obtain meta-patterns. Meta-patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time, cluster, and sequence visualization and analysis functionality. We provide a case study and an evaluation on financial data where we identify important future work. The workflow could be generalized to other application domains such as data analysis of smart grids, cyber physical systems, or the security of critical infrastructure, where the data consist of a combination of quantitative and textual time series data.


Figure 4: Black encircled off-screen clusters depict temporal changes of the epidemic spread.  
Ambient Grids: Maintain Context-Awareness via Aggregated Off-Screen Visualization
  • Conference Paper
  • Full-text available

January 2015

·

59 Reads

·

4 Citations

When exploring large spatial datasets, zooming and panning interactions often lead to the loss of contextual overview. Existing overview-plus-detail approaches allow users to view context while inspecting details, but they often suffer from distortion or overplotting. In this paper, we present an off-screen visualization method called Ambient Grids that strikes the balance between overview and details by preserving the contextual information as color grids within a designated space around the focal area. In addition, we describe methods to generate Ambient Grids for point data using data aggregation and projection. In a use case, we show the usefulness of our technique in exploring the VAST Challenge 2011 microblog dataset.

Download

Fig. 2: Detailed part of the process model including action and cognition paths. Actions can either lead directly to visual analytic components (blue arrows) or to their mappings (blue, dashed arrows). Humans can observe reactions of the system (red arrows) in order to generate findings. 
Fig. 4: Van Wijks model including Green et al.'s changes [13, 14] and labels. 
Fig. 5: Human Cognition Model by Green et al. [13, 14]. 
Knowledge Generation Model for Visual Analytics

December 2014

·

1,796 Reads

·

450 Citations

IEEE Transactions on Visualization and Computer Graphics

Visual analytics enables us to analyze huge information spaces in order to support complex decision making and data exploration. Humans play a central role in generating knowledge from the snippets of evidence emerging from visual data analysis. Although prior research provides frameworks that generalize this process, their scope is often narrowly focused so they do not encompass different perspectives at different levels. This paper proposes a knowledge generation model for visual analytics that ties together these diverse frameworks, yet retains previously developed models (e.g., KDD process) to describe individual segments of the overall visual analytic processes. To test its utility, a real world visual analytics system is compared against the model, demonstrating that the knowledge generation process model provides a useful guideline when developing and evaluating such systems. The model is used to effectively compare different data analysis systems. Furthermore, the model provides a common language and description of visual analytic processes, which can be used for communication between researchers. At the end, our model reflects areas of research that future researchers can embark on.


Methods for Compensating Contrast Effects in Information Visualization

June 2014

·

110 Reads

·

36 Citations

Color, as one of the most effective visual variables, is used in many techniques to encode and group data points according to different features. Relations between features and groups appear as visual patterns in the visualization. However, optical illusions may bias the perception at the first level of the analysis process. For instance, in pixel-based visualizations contrast effects make pixels appear brighter if surrounded by a darker area, which distorts the encoded metric quantity of the data points. Even if we are aware of these perceptual issues, our visual cognition system is not able to compensate these effects accurately. To overcome this limitation, we present a color optimization algorithm based on perceptual metrics and color perception models to reduce physiological contrast or color effects. We evaluate our technique with a user study and find that the technique doubles the accuracy of users comparing and estimating color encoded data values. Since the presented technique can be used in any application without adaption to the visualization itself, we are able to demonstrate its effectiveness on data visualizations in different domains.


Figure 1: The event detection and exploration pipeline used to structure this report.
Figure 2: Distribution of surveyed papers over publication year. The majority of papers were published between 2007 and 2013. We consider only one paper from 2014, because this report was written in early 2014.
State-of-the-Art Report of Visual Analysis for Event Detection in Text Data Streams

June 2014

·

552 Reads

·

53 Citations

Event detection from text data streams has been a popular research area in the past decade. Recently, the evolution of microblogging and social network services opens up great opportunities for various kinds of knowledge-based intelligence activities which require tracking of real-time events. In a sense, visualizations in combination with analytical processes could be a viable method for such tasks because it can be used to analyze the sheer amounts of text streams. However, data analysts and visualization experts often face grand challenges stemming out of the ill-defined concept of event and various kinds of textual data. As a result, we have few guidelines on how to build successful visual analysis tools that can handle specific event types and diverse textual data sources. Our goal is to take the first step towards answering the question by organizing insights from prior research studies on event detection and visual analysis. In the scope of this report, we summarize the evolution of event detection in combination with visual analysis over the past 14 years and provide an overview of the state-of-the-art methods. Our investigation sheds light on various kinds of research areas that can be the most beneficial to the field of visual text event analytics.


Collaborative Data Analysis with Smart Tangible Devices

February 2014

·

789 Reads

·

7 Citations

Proceedings of SPIE - The International Society for Optical Engineering

We present a tangible approach for exploring and comparing multi-dimensional data points collaboratively by combining Sifteo Cubes with glyph visualizations. Various interaction techniques like touching, shaking, moving or rotating the displays support the user in the analysis. Context dependent glyph-like visualization techniques make best use of the available screen space and cube arrangements. As a first proof of concept we apply our approach to real multi-dimensional datasets and show with a coherent use case how our techniques can facilitate the exploration and comparison of data points. Finally, further research directions are shown when combining Sifteo Cubes with glyphs and additional context information provided by multi-touch tables.


Visual analytics for the big data era — A comparative review of state-of-the-art commercial systems

October 2012

·

1,628 Reads

·

186 Citations

Visual analytics (VA) system development started in academic research institutions where novel visualization techniques and open source toolkits were developed. Simultaneously, small software companies, sometimes spin-offs from academic research institutions, built solutions for specific application domains. In recent years we observed the following trend: some small VA companies grew exponentially; at the same time some big software vendors such as IBM and SAP started to acquire successful VA companies and integrated the acquired VA components into their existing frameworks. Generally the application domains of VA systems have broadened substantially. This phenomenon is driven by the generation of more and more data of high volume and complexity, which leads to an increasing demand for VA solutions from many application domains. In this paper we survey a selection of state-of-the-art commercial VA frameworks, complementary to an existing survey on open source VA tools. From the survey results we identify several improvement opportunities as future research directions.


Rolled‐out Wordles: A Heuristic Method for Overlap Removal of 2D Data Representatives

June 2012

·

233 Reads

·

62 Citations

When representing 2D data points with spacious objects such as labels, overlap can occur. We present a simple algorithm which modifies the (Mani-) Wordle idea with scan-line based techniques to allow a better placement. We give an introduction to common placement techniques from different fields and compare our method to these techniques w.r.t. euclidean displacement, changes in orthogonal ordering as well as shape and size preservation. Especially in dense scenarios our method preserves the overall shape better than known techniques and allows a good trade-off between the other measures. Applications on real world data are given and discussed. © 2012 Wiley Periodicals, Inc.


Citations (21)


... Step 4. Coding data using the developed framework and assuring reliability ML is often utilized in tasks such as prediction, classification and clustering, which involve various forms of data processing (Athey, 2018a(Athey, , 2018b. It offers powerful tools for analyzing a collection of texts, such as academic journal papers (Wanner et al., 2014). Moreover, the ML approach offers several advantages, including wider generalizability, increased objectivity, improved replicability, enhanced statistical power and the ability to identify hidden linguistic features (El-Haj et al., 2019). ...

Reference:

Corporate governance characteristics and involvement in ESG activities: current trends and research directions
State-of-the-Art Report of Visual Analysis for Event Detection in Text Data Streams

... Visual analytics is defined as "the science of analytical reasoning facilitated by interactive visual interfaces" [367, p.4]; visual analytics tools and techniques are aimed at helping users (i) synthesize and derive insight from complex and dynamic data; (ii) detect the expected and discover the unexpected; (iii) provide timely, motivated, and understandable assessments; and (iv) support decision-making [367]. Visual analytics goes beyond information visualization, defined as "the use of computer-supported, interactive, visual representations of abstract data to amplify cognition" [86, p.7]. Visual analytics has also been defined as a combination of "automated analysis techniques with interactive visualisations for an effective understanding, reasoning and decision making" [227, p.7]. Visual analytics can be described by the interplay of three main components [229]: data analysis, visualization, and interaction (see Figure 1.1). As such, visual analytics combines the strengths of humans and computers, in particular the enormous computational power of machines to manage and process data with the cognitive and perceptual capabilities of humans to direct the analysis and make sense of the data based on priori domain knowledge. ...

Visual Analytics
  • Citing Chapter
  • January 2016

... 'Overviewing' large, complex, potentially dynamic data spaces presents a challenge in various domains (Hornbaek & Hertzum, 2011). In the context of different monitoring tasks, for example, experts need to gain and maintain an overview of changing (information) situations (Jäckle et al., 2015;Lee et al., 2020;Lienert et al., 2009;Mittelstaedt et al., 2013). Hornbaek & Hertzum describe 'overviewing' as the process of (actively) acquiring (situation) awareness, in the meaning of a "coherent mental picture of what is happening" (Hornbaek & Hertzum, 2011, p. 519). ...

Ambient Grids: Maintain Context-Awareness via Aggregated Off-Screen Visualization

... In general, the AI research into pattern visualisation only offers a static perspective on the discoveries; specifically, the user does not have the opportunity to interactively produce the visualisations that are relevant to them in various levels of abstractions. Recent application of data science has noted the importance of interactive and exploratory tools for knowledge discovery and decision support in genetic data and temporal medical data [55,72]. ...

VisExpress: Visual exploration of differential gene expression data
  • Citing Article
  • December 2015

Information Visualization

... When different measurement items measure the same variable, higher reliability indicates that the observations do not change due to changes in form or time and are fairly stable [28]. In this paper, SPSS 17.0 was used to test the internal reliability of the variables, which yielded the results of Cronbach's  values of the measured items, corrected item-total correlations and the item removed Cronbach's  values for each variable, as shown in Table 3. ...

Integrated visual analysis of patterns in time series and text data - Workflow and application to financial data analysis
  • Citing Article
  • April 2015

Information Visualization

... In other words, it is how we create and validate new knowledge in research. The field of visualization orbits questions of knowledge: how to leverage existing knowledge [29]; how to gain new knowledge through visualizations [132]; and how to communicate knowledge [102]. And yet, for a field so interested in knowledge, there is hardly anything written about epistemology. ...

Knowledge Generation Model for Visual Analytics

IEEE Transactions on Visualization and Computer Graphics

... To achieve this, an appropriate combination of visual data mining algorithms, dynamic queries' mechanisms and the visualization techniques must be accomplished. Numerous studies (Quigley, 2002;Robinson et al., 2005;Lee et al., 2005;Ziegler et al., 2008;Faisal el., 2008;Tory & Staub-French, 2008;Oelke et al., 2008;Strobelt et al., 2009;Mansmann et al., 2009;Wanner et al., 2009;Hao et al., 2010;Keim et al., 2010;Johansson et al., 2010;Wu & Ren, 2010;Roth et al., 2010;Kohlhammer et al., 2010;Yu et al., 2010;Stoffel et al., 2010;Zheng et al., 2011;Koh et al., 2011;Schaefer et al., 2011;Lirong et al., 2011;Simon et al. 2011;Meyer, 2012;Wang et al., 2012;Krstajic, 2012;Stoffel et al., 2012;Zhong et al., 2012;Ning et al., 2012;Pinto et al., 2012Bowen, 2013 in different domains have been undertaken, concerning design of InfoVis frameworks, but none addresses the data on HEIs students' explicit knowledge. Table 1 summarizes these previous studies into domains and references with their respective contributions. ...

AMPLIO VQA – A Web Based Visual Query Analysis System for Micro Grid Energy Mix Planning
  • Citing Article

... Moreover, perceived color distances should correspond to distances in the data, which requires color scales that take the capabilities of human perception into account. For example, Mittelstädt et al. (2014) optimizes color scales to reduce physiological color contrasts, which can considerably improve the identification of data values. To support location tasks, on the other hand, color scales should be designed so that data of interest can be located quickly and easily, ideally pre-attentively (see Healey and Enns, 2012). ...

Methods for Compensating Contrast Effects in Information Visualization
  • Citing Conference Paper
  • June 2014

... While document structures have more degrees of freedom when represented as three-dimensional objects, lower-dimensional representation spaces have their own advantages, notably more spatially compact representation, an absence of occlusion, and they allow to visualize the model from all angles without manipulation. The Ribbons technique ( Figure 6) belongs to this rich class of tiny diagrams, some of which are ubiquitous-the visualization of location and keyword frequency in document search results, located next to the sliding bar of application windows such as a web pages (Hearst 1995)-while others are experimental: document thumbnails with semantic highlighting, variable text size, and selection of significant graphics (Stoffel et al. 2010;Stoffel et al. 2012;Stoffel et al. 2009), a dashboard of topic distribution in documents (Humphreys et al. 2018) depicted as plots of acoustic signals (Favata et al. n.d.), chat logs timelines (Donath & Viégas 2002), line modification frequency in software code (Eick, Steffen, & Summer 1992), or dominant colors in movie frames (Brodbeck 2011). ...

Document Thumbnails with Variable Text Scaling
  • Citing Article
  • June 2012

... LLMs are deep learning driven AI models trained on extensive datasets, allowing them to recognize linguistic patterns, predict contextual meanings, and generate consistent outputs [7] [8]. These models leverage large-scale pretraining followed by fine-tuning on domain-specific datasets to enhance their performance, particularly in the fields of healthcare, finance, and law [9] [10]. Their proficiency in understanding complex queries, reasoning over large knowledge bases, and adapting to various linguistic contexts makes them invaluable in a wide range of automated systems [11] [12]. ...

Visual analytics for the big data era — A comparative review of state-of-the-art commercial systems