Vedran Sabol

Vedran Sabol
  • Dr.techn.
  • Research Area Manager at Know-Center

About

64
Publications
13,560
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
717
Citations
Current institution
Know-Center
Current position
  • Research Area Manager
Additional affiliations
October 2012 - present
Graz University of Technology
Position
  • Lecturer
November 2011 - present
Know-Center
Position
  • Research Area Manager
August 2001 - November 2011
Know-Center
Position
  • Senior Researcher

Publications

Publications (64)
Conference Paper
Exploring large datasets and identifying meaningful information is still an active topic in many application fields. Dealing with large datasets is currently not only a matter of simply collecting and structuring data for retrieval, but sometimes it also requires the provision of adequate means for guiding the user through the exploration process....
Article
Full-text available
Whenever users engage in gathering and organizing new information, searching and browsing activities emerge at the core of the exploration process. As the process unfolds and new knowledge is acquired, interest drifts occur inevitably and need to be accounted for. Despite the advances in retrieval and recommender algorithms, real-world interfaces h...
Conference Paper
Full-text available
More and more learning activities take place online in a self-directed manner. Therefore, just as the idea of self-tracking activities for fitness purposes has gained momentum in the past few years, tools and methods for awareness and self-reflection on one's own online learning behavior appear as an emerging need for both formal and informal learn...
Conference Paper
Content based recommender systems are commonly applied to provide automatic support to users searching for relevant information. However, as the retrieved number of resources may grow large, and because the user does not have direct control over the search process, re-finding and analyzing the retrieved information can become a difficult task. We i...
Conference Paper
When using classical search engines, researchers are often confronted with a number of results far beyond what they can realistically manage to read; when this happens, recommender systems can help, by pointing users to the most valuable sources of information. In the course of a long-term research project, research into one area can extend over se...
Article
Full-text available
Graphical interfaces and interactive visualisations are typical mediators between human users and data analytics systems. HCI researchers and developers have to be able to understand both human needs and back-end data analytics. Participants of our tutorial will learn how visualisation and interface design can be combined with data analytics to pro...
Conference Paper
Full-text available
An information landscape is commonly used to represent relatedness in large, high-dimensional datasets, such as text document collections. In this paper we present interactive metaphors, inspired in map reading and visual transitions, that enhance the landscape representation for the analysis of topical changes in dynamic text repositories. The goa...
Conference Paper
Full-text available
Visualizations have a distinctive advantage when dealing with the information overload problem: since they are grounded in basic visual cognition, many people understand them. However, creating them requires specific expertise of the domain and underlying data to determine the right representation. Although there are rules that help generate them,...
Article
Full-text available
Supporting individuals who lack experience or competence to evaluate an overwhelming amout of information such as from cultural, scientific and educational content makes recommender system invaluable to cope with the information overload problem. However, even recommended information scales up and users still need to consider large number of items....
Conference Paper
Full-text available
The analysis of temporal relationships in large amounts of graph data has gained significance in recent years. In- formation providers such as journalists seek to bring order into their daily work when dealing with temporally distributed events and the network of entities, such as persons, organisations or locations, which are related to these even...
Conference Paper
Full-text available
Linked Data has grown to become one of the largest available knowledge bases. Unfortunately, this wealth of data remains inaccessi-ble to those without in-depth knowledge of semantic technologies. We describe a toolchain enabling users without semantic technology back-ground to explore and visually analyse Linked Data. We demonstrate its applicabil...
Conference Paper
Full-text available
Providing easy to use methods for visual analysis of Linked Data is often hindered by the complexity of semantic technologies. On the other hand, semantic information inherent to Linked Data provides opportunities to support the user in interactively analysing the data. This paper provides a demonstration of an interactive, Web-based visualisa-tion...
Conference Paper
Full-text available
Linked Data has grown to become one of the largest available knowledge bases. Unfortunately, this wealth of data remains inaccessible to those without in-depth knowledge of semantic technologies. We describe a toolchain enabling users without semantic technology background to explore and visually analyse Linked Data. We demonstrate its applicabilit...
Conference Paper
Full-text available
The proposed graph visualization method employs hierarchical aggregation of graph nodes and edges, and applies edge bundling on each hierarchy level to reduce clutter and improve the clarity of the representation. Our visualization system enables users to explore large, hierarchically aggregated graphs in a node-link based graph visualization. Hier...
Conference Paper
Linked Open Data has grown into a large and recognised source of data, however its uptake and commercial exploitation does not yet reflect its potential value. Two factors with potential to contribute to the value of data are correlating previously uncorrelated data and providing answers based on the data. We present a data-centric question answeri...
Conference Paper
Full-text available
Research papers are published in various digital libraries, which deploy their own meta-models and technologies to manage, query, and analyze scientific facts therein. Commonly they only consider the meta-data provided with each article, but not the contents. Hence, reaching into the contents of publications is inherently a tedious task. On top of...
Chapter
Full-text available
Research depends to a large degree on the availability and quality of primary research data, i.e., data generated through experiments and evaluations. While the Web in general and Linked Data in particular provide a platform and the necessary technologies for sharing, managing and utilizing research data, an ecosystem supporting those tasks is stil...
Chapter
Full-text available
Providing means for effectively accessing and exploring large textual data sets is a problem attracting attention of text mining and information visualization experts alike. Rapid growth of the data volume, heterogeneity and richness of metadata, and the dynamic nature of text repositories add to the complexity of the task. This chapter provides an...
Conference Paper
Project managers increasingly rely on software support for monitoring status and development of projects gathered in large portfolios. The status of a project is characterized by a large number of mostly numerical parameters. Various chart visualisations are commonly used to express parameters, provide aggregation and support comparison. We propose...
Conference Paper
Linked Data has become an essential part of the Semantic Web. A lot of Linked Data is already available in the Linked Open Data cloud, which keeps growing due to an influx of new data from research and open government activities. However, it is still quite difficult to access this wealth of semantically enriched data directly without having in-dept...
Conference Paper
Full-text available
Scientific publications constitute an extremely valuable body of knowledge and can be seen as the roots of our civilisation. However, with the exponential growth of written publications, comparing facts and findings between different research groups and communities becomes nearly impossible. In this paper, we present a conceptual approach and a fir...
Conference Paper
Full-text available
Incrementally computed information landscapes are an effective means to visualize longitudinal changes in large document repositories. Resembling tectonic processes in the natural world, dynamic rendering reflects both long-term trends and short-term fluctuations in such repositories. To visualize the rise and decay of topics, the mapping algorithm...
Conference Paper
Ontology alignment is the process of mapping related concepts from different ontologies. A lot of research effort has been invested in development of algorithmic methods supporting automatic discovery of mappings between ontological concepts. However, automatic alignment remains potentially prone to errors especially with large real-world ontologie...
Conference Paper
Full-text available
This paper collates eight expert opinions about Knowledge Visualization, what it is and what it should be. An average of 581 words long, topics span from representation, storytelling and criticizing the lack of theory, to communication, analytics for the masses and reasoning, to trendy Visual Thinking and creativity beyond PowerPoint. These individ...
Article
Full-text available
Semantic technologies are of paramount importance to the future Internet. The reuse and integration of semantically described resources, such as data or services, necessitates the bringing of ontologies into mutual agreement. Ontology alignment deals with the discovery of correspondences between concepts and relations from different ontologies. Ali...
Conference Paper
Automatic generation of taxonomies can be useful for a wide area of applications. In our application scenario a topical hierarchy should be constructed reasonably fast from a large document collection to aid browsing of the data set. The hierarchy should also be used by the InfoSky projection algorithm to create an information landscape visualizati...
Conference Paper
Classifiers can be used to automatically dispatch the abundance of newly created documents to recipients interested in particular topics. Identification of adequate training examples is essential for classification performance, but it may prove to be a challenging task in large document repositories. We propose a classifier hypothesis generation me...
Conference Paper
Full-text available
This paper presents a technique for the visual analysis of topical shifts in dynamically changing textual archives. Our approach is based on the well-known information landscape metaphor, whereby topical changes are represented by changes in landscape topography. Incremental clustering and multi-dimensional scaling algorithms are periodically appli...
Conference Paper
Knowledge discovery involves data driven processes where data is transformed and processed by various algorithms to identify new knowledge. KnowMiner is a service oriented framework providing a rich set of knowledge discovery functionalities with focus on text data sets. Complementing results of automatic machine analysis with the immense processin...
Article
Full-text available
This paper presents a procedure to construct robust tax-onomies from natural language German encyclopedic text. Taxonomic relations are extracted through Hearst patterns, validated via search en-gines and incorporated into an base ontology. By verifying the relations using an external, web-based source of evidence, the procedure extracted an accura...
Article
Full-text available
Services for increasing information quality in unstructured information are central to mostly all applications of information management. Service-oriented computing is a promising paradigm for embedding such intelligent services in different application scenarios and heterogeneous IT landscapes. In this paper we present our knowledge discovery fram...
Chapter
Information technology has developed past its traditional focus on text-based data. The much-cited rapid growth of available information has been accompanied by a diversification of information types. Multimedia data is rapidly becoming the predominant form of information created, processed and distributed in many application domains.Multimedia dat...
Article
Large collections of text documents are increasingly common, both in business and personal information environments. Tools from the field of information visualisation are being used to help users make sense of and extract useful knowledge from such collections. Flat text collections are often visualised using distance calculations between documents...
Conference Paper
The domain of astronomy contributes a wealth of knowledge to the corpus of any general encyclopedia. Modern multimedia encyclopedias are capable of displaying complex, three-dimensional visualizations in real-time, enabling the integration of a planetarium, a virtual theater presenting astronomical facts in an educational and entertaining way. The...
Conference Paper
In many application areas which deal with heterogeneous data sets temporal developments and topical relationships both play an important role. A number of visual representations have been developed which separately address each of these two aspects of the data. However, a simultaneous analysis of both aspects is also often required. In this paper w...
Conference Paper
Full-text available
The MISTRAL system, a service oriented architecture for semantic extraction of multimedia data from meeting recordings is described shortly. It improves on other similar systems by extracting a variety of semantic metadata from one media type and integrating it with concepts derived from other media types, as well as by adding inference capabilitie...
Conference Paper
Full-text available
The advent of electronic media enabling rapid publishing and instant access to news articles has vastly increased the working pace of the media industry. Press agencies can no longer afford to neglect plagiarisms formerly perceived as irrelevant because such acts instantly void the commercial value of original and potentially exclusive news article...
Conference Paper
The archives of large national and international news agencies typically contain millions of articles featuring significant textual content and annotated metadata. Boolean queries and relevance ranked result lists have been traditional means of inquiry in such a context. This article presents an interface for query formulation and visual query anal...
Conference Paper
Full-text available
Multimedia data has a rich and complex structure in terms of inter- and intra-document references. Its potential is severely limited unless effective methods for semantic extraction and semantic-based cross-media exploration and retrieval can be devised. Today's leading-edge techniques in this area are working well for low-level feature extraction...
Conference Paper
Full-text available
Recent trends show that more and more digital cameras, video cam- eras and DVD recorders are sold and the number of emails and other messages sent increases each year. For example it is estimated that there will be nearly 300 million digital image capture devices in use worldwide through 2004, cap- turing about 29 billion digital pictures (12). Use...
Article
Full-text available
It is no longer unusual for large document collections to contain many millions of documents. In order to manage this size of repository, it is often essential to structure the repository according to a thematic classification hierarchy. InfoSky is a system enabling users to explore such large, hierarchically structured document col-lections. Simil...
Conference Paper
The WebRat is a light-weight, web-based retrieval, clustering and visualisation framework which can be used to quickly design and implement search solutions for a wide area of application domains. We have employed this framework to create a web meta search engine combined with an interactive visualisation and navigation toolkit. Based on the SVG gr...
Article
This publication presents InfoSky, a system enabling exploration of large, hierarchically structured knowledge spaces. InfoSky employs a two-dimensional graphical representation with variable magnification, much like a real-world telescope, to visualise individual documents as stars, hierarchical structures as constellations, and the whole knowledg...
Conference Paper
WebRat is an interactive system for visualising and refining and refining search result sets. Documents matching a query are dynamically clustered on the fly and visualised as a contour map of islands. Thematic clusters are built, analysed, and visualised in real time. Users can interactively explore the visualisation and refine queries by selectin...
Article
Full-text available
This publication presents InfoSky, a system enabling exploration of large, hierarchically structured knowledge spaces. InfoSky employs a two-dimensional graphical representation with variable magnification, much like a real-world telescope, to visualise individual documents as stars, hierarchical structures as constellations, and the whole knowledg...
Conference Paper
Today’s web search engines return very large result sets for query formulations consisting of few specific keywords. Results are presented as ranked lists containing textual description of found items. Such representations do not allow identification of topical clusters, and consequentially make it difficult for users to refine queries efficiently....
Article
InfoSky is a system enabling users to explore large, hierarchically structured document collections. Similar to a real-world telescope, InfoSky employs a planar graphical representation with variable magnification. Documents of similar content are placed close to each other and are visualised as stars, forming clusters with distinct shapes. For gre...
Conference Paper
The xFIND gatherer-broker architecture provides a wealth of metadata, which can be used to provide sophisticated search functionality. Local or remote documents are indexed and summaries and metadata are stored on an xFIND broker (server). An xFIND client can search a particular broker and access rich metadata for search result presentation, withou...
Article
Full-text available
High quality software documentation is a substantial issue to understand software systems. Shorter time-to-market software cycles increase the importance of automatism to keep the documentation up to date. In this paper, we describe the automatic support of the software documentation process using a social semantic software approach. Therefore, we...
Article
Full-text available
Challenges in Visual Analytics frequently involve massive repositories, which do not only contain a large number of information artefacts, but also a high number of relevant dimensions per artefact. Dimensionality reduction algorithms are commonly used to transform high-dimensional data into low-dimensional representations which are suitable for vi...
Article
Full-text available
Recent trends in structure and content of global knowledge spaces present new chal-lenges to the field of Knowledge Discovery. Very large, highly structured repositories are in-creasingly replacing smaller, flat information spaces. Such repositories are often filled with multimedia documents, including image, audio and video data. This publication...
Conference Paper
Full-text available
The InfoSky visual explorer is a system enabling users to interactively explore large, hierarchically structured document collections. Similar to a real-world telescope, InfoSky employs a planar graphical representation with variable magnification. Documents of similar content are placed close to each other and displayed as stars, while collections...

Network

Cited By