Claudia Niederee

Claudia Niederee
Leibniz Universität Hannover · L3S Research Center

About

103
Publications
12,124
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,033
Citations

Publications

Publications (103)
Preprint
Full-text available
For their attractiveness, comprehensiveness and dynamic coverage of relevant topics, community-based question answering sites such as Stack Overflow heavily rely on the engagement of their communities: Questions on new technologies, technology features as well as technology versions come up and have to be answered as technology evolves (and as comm...
Preprint
Full-text available
Inhibition is one of the core concepts in Cognitive Psychology. The idea of inhibitory mechanisms actively weakening representations in the human mind has inspired a great number of studies in various research domains. In contrast, Computer Science only recently has begun to consider inhibition as a second basic processing quality beside activation...
Article
Inhibition is one of the core concepts in Cognitive Psychology. The idea of inhibitory mechanisms actively weakening representations in the human mind has inspired a great number of studies in various research domains. In contrast, Computer Science only recently has begun to consider concepts such as digital forgetting or suppression of irrelevant...
Article
Full-text available
Social media posts are a great source for life summaries aggregating activities, events, interactions and thoughts of the last months or years. They can be used for personal reminiscence as well as for keeping track with developments in the lives of not-so-close friends. One of the core challenges of automatically creating such summaries is to deci...
Article
Trends like digital transformation even intensify the already overwhelming mass of information knowledge workers face in their daily life. To counter this, we have been investigating knowledge work and information management support measures inspired by human forgetting. In this paper, we give an overview of solutions we have found during the last...
Preprint
Trends like digital transformation even intensify the already overwhelming mass of information knowledge workers face in their daily life. To counter this, we have been investigating knowledge work and information management support measures inspired by human forgetting. In this paper, we give an overview of solutions we have found during the last...
Chapter
Full-text available
Current trends, like digital transformation and ubiquitous computing, yield in massive increase in available data and information. In artificial intelligence (AI) systems, capacity of knowledge bases is limited due to computational complexity of many inference algorithms. Consequently, continuously sampling information and unfiltered storing in kno...
Conference Paper
Full-text available
The observation of social media provides an important complementing source of information about an unfolding event such as a crisis situation. For this purpose we have developed and demonstrate Sover!, a system to monitor real-time dynamic events via Twitter targeting the needs of aid organizations. At its core it builds upon an effective adaptive...
Conference Paper
In this paper, we present a neural network based framework for answering non-factoid questions. The framework consists of two main components: Answer Retriever and Answer Ranker. In the first component, we leverage off-the-shelf retrieval models (e.g. bm25) to retrieve a pool of candidate answers regarding to the input question. Answer Ranker is th...
Chapter
Multimedia content and especially personal multimedia content is created in abundance today. Short- to mid-term storage of this content is typically no problem due to decreased storage prices and the availability of storage services. However, for the long-term perspective, i.e., preservation, adequate technologies and best practices for keeping the...
Chapter
In the cognitive processes of humans, forgetting is a very effective way for focusing on the important things, while unstressing things, which are (currently) less important. The translation of forgetting into the digital world is, thus, a promising approach for better dealing with the increasing problem of information overload. Information overloa...
Chapter
The idea of the Preserve-or-Forget (PoF) approach introduced in this book is to follow a forgetful, focused approach to digital preservation, which is inspired by human forgetting and remembering. Its goal is to ease the adoption of preservation technology especially in the personal and organizational context and to ensure that important content is...
Article
Full-text available
Recently a lot of progress has been made in rumor modeling and rumor detection for micro-blogging streams. However, existing automated methods do not perform very well for early rumor detection, which is crucial in many settings, e.g., in crisis situations. One reason for this is that aggregated rumor features such as propagation features, which wo...
Conference Paper
Full-text available
Recently a lot of progress has been made in rumor modeling and rumor detection for micro-blogging streams. However, existing automated methods do not perform very well for early rumor detection, which is crucial in many settings, e.g., in crisis situations. One reason for this is that aggregated rumor features such as propagation features, which wo...
Conference Paper
The diffusion of digital photography lets people take hundreds of photos during personal events, such as trips and ceremonies. Many methods have been developed for summarizing such large personal photo collections. However, they usually emphasize the coverage of the original collection, without considering which photos users would select, i.e. thei...
Conference Paper
Full-text available
Entities and their relatedness are useful information in various tasks such as entity disambiguation, entity recommendation or search. In many cases, entity relatedness is highly affected by dynamic contexts, which can be reflected in the outcome of different applications. However, the role of context is largely unexplored in existing entity relate...
Article
Full-text available
For many European countries, in 2015 the refugee situation developed from a remote tragedy reported upon in the news to a situation they have to deal with in their own neighborhood. Driven by this observation, we investigated the development of the perception of the refugee situation during 2015 in Twitter. Starting from a dataset of 1.7 Million tw...
Chapter
The development of efficient and robust algorithms for Big Data processing is a demanding task, which has to cope with the characteristics of this type of data (3Vs). Putting such algorithms as processing elements into larger pipelines adds an extra level of complexity, which can be alleviated by relying on a model-based approach including code gen...
Conference Paper
With the growing amount of content stored in personal and organizational information spaces, finding and re-finding documents becomes both more crucial and challenging. In this work, we propose an approach to reduce information overload in navigation by automatically focusing on important documents, adaptively to the tasks at hand. Based on the ide...
Conference Paper
Full-text available
Long-running, high-impact events such as the Boston Marathon bombing often develop through many stages and involve a large number of entities in their unfolding. Timeline summarization of an event by key sentences eases story digestion, but does not distinguish between what a user remembers and what she might want to re-check. In this work, we pres...
Article
With the growing volumes of and reliance on digital content, there is a clear need for better information managementapproaches that keep relevant information accessible and usable in long-term encapsulated together with the evolving context information that is needed for its interpretation. Inspired by the role of forgetting in the human brain, in...
Conference Paper
Full-text available
Due to their first-hand, diverse and evolution-aware reflection of nearly all areas of life, web archives are emerging as gold-mines for content analytics of many sorts. However, supporting search, which goes beyond navigational search via URLs, is a very challenging task in these unique structures with huge, redundant and noisy temporal content. I...
Conference Paper
Full-text available
When selecting important photos from a personal photo collection - e.g. for creating an enjoyable sub-collection for revisiting or preservation - photos are not considered in isolation. Therefore, collection-level criteria are also taken into account by automated photo selection methods. However, the typical two-step process of first clustering and...
Article
With the growing volumes of and reliance on digital content, there is a clear need for better information management approaches that keep relevant information accessible and usable in long-term encapsulated together with the evolving context information that is needed for its interpretation. Inspired by the role of forgetting in the human brain, in...
Conference Paper
Full-text available
Social networks provide a platform for people to connect and share information and moments of their lives. With the increasing engagement of users in such platforms, the volume of personal information that is exposed online grows accordingly. Due to carelessness, unawareness or difficulties in defining adequate privacy settings, private or sensitiv...
Conference Paper
Full-text available
With the reflection of nearly all types of social cultural, societal and everyday processes of our lives in the web, web archives from organizations such as the Internet Archive have the potential of becoming huge gold-mines for temporal content analytics of many kinds (e.g., on politics, social issues, economics or media). First hand evidences for...
Conference Paper
Full-text available
Fully understanding an older news article requires context knowledge from the time of article creation. Finding information about such context is a tedious and time-consuming task, which distracts the reader. Simple contextualization via Wikification is not sufficient here. The retrieved context information has to be time-aware, concise (not full W...
Conference Paper
Full-text available
Fully understanding an older news article requires context knowledge from the time of article creation. Finding information about such context is a tedious and time-consuming task, which distracts the reader. Simple contextualization via Wikification is not sufficient here. The retrieved context information has to be time-aware, concise (not full W...
Article
Each year makes it easier to accumulate large numbers of photos and videos in the social and personal digital space. Their long-term existence is mostly driven by chance rather than by clear guidelines or rules for archiving them. Thus, unfortunately, cases of nonintended both the exposure and disappearance of personal photos happen much too often....
Conference Paper
Full-text available
Understanding a text, which was written some time ago, can be compared to translating a text from another language. Complete interpretation requires a mapping, in this case, a kind of time-travel translation between present context knowledge and context knowledge at time of text creation. In this paper, we study time-aware re-contextualization, the...
Conference Paper
One of the core challenges of automatically creating Social Web summaries is to decide which posts to remember, i.e., to consider for summary inclusion and which to forget. Keeping everything would overwhelm the user and would also neglect the often intentionally ephemeral nature of Social Web posts. In this paper, we analyze high-impact features t...
Conference Paper
Full-text available
Topic modeling has gained a lot of popularity as a means for identifying and describing the topical structure of textual documents and whole corpora. There are, however, many document collections such as qualitative studies in the digital humanities that cannot easily benefit from this technology. The limited size of those corpora leads to poor qua...
Conference Paper
Full-text available
The idea of the European project GLOCAL is to use events as the central concept for search, organization and combination of multimedia content from various sources. For this purpose methods for event detection and event matching as well as media analysis are developed. Considered events range from private, over local, to global events.
Conference Paper
Full-text available
A prerequisite for leveraging the vast amount of data available on the Web is Entity Resolution, i.e., the process of identifying and linking data that describe the same real-world objects. To make this inherently quadratic process applicable to large data sets, blocking is typically employed: entities (records) are grouped into clusters - the bloc...
Chapter
An important task in data integration and data cleaning is the identification of data that describe the same real-world object, such as an event, a person, or a movie. There are various techniques to tackle this problem. The typical methodology is to collect matching evidence, such as similarities between the entity strings, and based on them, gene...
Conference Paper
Full-text available
Web browser users return to Web pages for various reasons. Apart from pages visited due to backtracking, they typically have a number of favorite/important pages that they monitor or tasks that reoccur on an infrequent basis. In this paper, we introduce the architecture of a system that facilitates revisitations through the effective prediction of...
Conference Paper
Full-text available
Blocking methods are crucial for making the inherently quadratic task of Entity Resolution more efficient. The blocking methods proposed in the literature rely on the homogeneity of data and the availability of binding schema information; thus, they are inap-plicable to the voluminous, noisy, and highly heterogeneous data of the Web 2.0 user-genera...
Conference Paper
Full-text available
We have recently witnessed an enormous growth in the volume of structured and semi-structured data sets available on the Web. An important prerequisite for using and combining such data sets is the detection and merge of information that describes the same real-world entities, a task known as Entity Resolution. To make this quadratic task efficient...
Conference Paper
Full-text available
Entity resolution is the task of identifying entities that refer to the same real-world object. It has important applications in the context of digital libraries, such as citation matching and author disambiguation. Blocking is an established methodology for efficiently addressing this problem; it clusters similar entities together, and compares so...
Conference Paper
Entity linkage deals with the problem of identifying whether two pieces of information represent the same real world object. The traditional methodology computes the similarity among the entities, and then merges those with similarity above some specific threshold. We demonstrate LinkDB, an original entity storage and querying system that deals wit...
Conference Paper
Full-text available
Individuals contribute content on the Web at an unprecedented rate, accumulating immense quantities of (semi-) structured data. Wisdom of the Crowds theory advocates that such information (or parts of it) is constantly overwritten, updated, or even deleted by other users, with the goal of rendering it more accurate, or up-to-date. This is particula...
Conference Paper
Full-text available
Selecting and presenting content culled from multiple heterogeneous and physically distributed sources is a challenging task. The exponential growth of the web data in modern times has brought new requirements to such integration systems. Data is not any more produced by content providers alone, but also from regular users through the highly popula...
Article
Full-text available
Selecting and presenting content culled from multiple heteroge-neous and physically distributed sources is a challenging task. The exponential growth of the web data in modern times has brought new requirements to such integration systems. Data is not any more produced by content providers alone, but also from regular users through the highly popul...
Article
Full-text available
Entity linkage is central to almost every data integration and data cleaning scenario. Traditional techniques use some computed similarity among data structure to perform merges and then answer queries on the merged data. We describe a novel framework for entity linkage with uncertainty. Instead of using the linkage information to merge structures...
Conference Paper
Full-text available
We present the Entity Name System (ENS), an enabling infrastructure, which can host descriptions of named entities and provide unique identifiers, on large-scale. In this way, it opens new perspectives to realize entity-oriented, rather than keyword-oriented, Web information systems. We describe the architecture and the functionality of the ENS, al...
Conference Paper
Full-text available
Given the growing number of structured and evolving online repositories, the need for lightweight information integration has increased in the past years. We have developed an integration approach which relies on partial mappings for query rewriting and combines them with a controlled way of relaxation. In this paper we propose a novel approach f...
Chapter
The success of industrial and scientific research has always been dependent on new discoveries and innovations, but tighter budgets and increasing global competition push the pace with which innovation must happen nowadays. Bringing new products to the market before competitors do constitutes a crucial competitive advantage for many companies and o...
Article
Full-text available
Semantic annotation of content is a crucial building block of making the Semantic Web fly. The (semi-)automatic support of the underlying seman-tic knowledge supply chain requires contributions from different research dis-ciplines and well-defined pipelines, which step-by-step create such annotations from raw content objects. This paper presents an...
Article
Full-text available
Loosely structured heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. A common problem these information spaces need to address is that various data describe the same real-word entities (e.g., conferences, organizations). In this demo, we introduce necessity, an e...
Conference Paper
Light-weight information integration has to be able to effectively deal with settings where only partial mappings between the information sources to be integrated are defined. In our query rewriting approach, such partial mappings are used as the basis for defining several strategies to substitute elements in the query, and for introducing wildcard...
Conference Paper
Full-text available
Recognizing that information from different sources refers to the same (real world) entity is a crucial challenge in instance-level information integration, as it is a pre-requisite for combining the information about one entity from different sources. The required entity matching is time consuming and thus imposes a crucial limit for large-scale,...
Article
Full-text available
Purpose – The discovery of the “right” ontology or ontology part is a central ingredient for effective ontology re-use. The purpose of this paper is to present an approach for supporting a form of adaptive re-use of sub-ontologies, where the ontologies are deeply integrated beyond pure referencing. Design/methodology/approach – Starting from an ont...
Conference Paper
Full-text available
Heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. These sources often use different identifiers for data that describe the same real-word entity (for example an artist, a conference, an organization). In this paper we propose a new probabilistic Entity Linkage al...
Article
A common trend that can be currently observed in information management is a movement towards less central control and more flexibility and self-organization. This is true on the level of the infrastructure technologies as well as in information management and services, where personalized and community-based solutions such as Flickr or collaborativ...
Conference Paper
In this paper we present metadata based recommendation algorithms addressing two scenarios within social desktop communities: a) recommendation of resources from the co-worker's desktop, and b) recommendation of metadata for enriching the own annotation layer. Together with the algorithms we present first evaluation results as well as empirical eva...
Conference Paper
We present a flexible information integration approach which addresses the dynamic integration needs in a personal desktop environment where only partial mappings are defined between the sources to be integrated. Our approach is based on query rewriting using substitution rules. In addition to exploiting defined mappings, we employ substitution str...
Conference Paper
The discovery of the "right" ontology or ontology part is a central ingredient for effective ontology re-use. We present an approach for supporting a form of adaptive re-use of sub- ontologies, where the ontologies are deeply integrated beyond pure referencing. Starting from an ontology draft which reflects the intended modeling perspective, the on...
Article
E-Science (or "enhanced Science") demands innovative methods, services, and technology infrastructures that effectively support researchers in their daily working processes. In the past, e-Science mainly focussed on those scientific domains and tasks which create or use very large data sets, and require large amounts of computational power, i.e., t...
Chapter
The success of industrial and scientific research has always been dependent on new discoveries and innovations, but tighter budgets and increasing global competition push the pace with which innovation must happen nowadays. Bringing new products to the market before competitors do constitutes a crucial competitive advantage for many companies and o...
Conference Paper
Algorithms designed to support users in retrieving relevant information base their relevance computations on user profiles, in which representations of the users interests are maintained. This paper focuses on the use of supervised machine learning techniques to induce user profiles for Intelligent Information Access. The access must be personalize...
Chapter
Full-text available
Im Kontext des Semantic Web werden die inzwischen etablierten Web-Anwendungen mit neuen Herausforderungen konfrontiert. Um die neue Umgebung effektiv zu nutzen und das neue „Klientel“ Software-Agenten adäquat zu bedienen, sind einige Erweiterungen der traditionellen Web-Anwendungen notwendig. Dieses Papier diskutiert Ansätze, um mit diesen neuen He...
Conference Paper
Full-text available
Personalization today has wide spread use on many Web sites. Systems and applications store preferences and information about users in order to provide personalized access. However, these systems store user profiles in proprietary formats. Although some of these systems store similar information about the user, exchange or reuse of information is n...
Conference Paper
Mediating between available information objects and individual information