Frank Hopfgartner

Frank Hopfgartner
Universität Koblenz-Landau

PhD

About

192
Publications
22,977
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,882
Citations

Publications

Publications (192)
Article
Bias in news search engines has been shown to influence users' perceptions of a news topic and contribute to the polarisation of society. As a result, there is a need for news search engines that increase user awareness of biases in the search results. While technical approaches have been developed to mitigate biases in search, very few studies hav...
Chapter
Hit song prediction, one of the emerging fields in music information retrieval (MIR), remains a considerable challenge. Being able to understand what makes a given song a hit is clearly beneficial to the whole music industry. Previous approaches to hit song prediction have focused on using audio features of a record. This study aims to improve the...
Preprint
Full-text available
Hit song prediction, one of the emerging fields in music information retrieval (MIR), remains a considerable challenge. Being able to understand what makes a given song a hit is clearly beneficial to the whole music industry. Previous approaches to hit song prediction have focused on using audio features of a record. This study aims to improve the...
Article
Full-text available
Emails, much like communicative genres such as letters that predate them, are a rich source of data for researchers, but they are replete with privacy considerations. This paper explores the resulting friction between privacy concerns and email data access. Studies of email can often be centred on understanding patterns of behaviour and/or relation...
Article
Full-text available
Objectives Settings in identifying need for emergency care amongst those with suspected COVID-19 infection and identify factors which affect triage accuracy. ApproachAn observational cohort study of adults who contacted the NHS 111 telephone triage service provided by Yorkshire Ambulance Service between March and June 2020 with symptoms indicating...
Conference Paper
Programming courses, the entry of the digital world, can be overwhelming to freshers, especially for international students experiencing academic transition. To better understand and improve international students’ programming learning experience, we conducted two studies. The first study identified international students’ lack of intrinsic motivat...
Conference Paper
The BASE prototype aims to improve user awareness of biases in search engine results. It utilises existing resources and NLP tools to identify biases in news articles. It incorporates bias visualisation features to inform users of biases in each news article and at the search results level. It also incorporates results reranking features to allow u...
Article
Full-text available
Background COVID-19 infected millions of people and increased mortality worldwide. Patients with suspected COVID-19 utilised emergency medical services (EMS) and attended emergency departments, resulting in increased pressures and waiting times. Rapid and accurate decision-making is required to identify patients at high-risk of clinical deteriorati...
Article
Full-text available
Many disciplines, including the broad Field of Information (iField), offer Data Science (DS) programs. There have been significant efforts exploring an individual discipline's identity and unique contributions to the broader DS education landscape. To advance DS education in the iField, the iSchool Data Science Curriculum Committee (iDSCC) was form...
Conference Paper
NTCIR-16 saw the fourth edition of the Lifelog task, which aimed to foster comparative benchmarking of approaches to automatic and interactive information retrieval from multimodal lifelog archives. In this paper, we describe the test collection employed, along with the tasks, the submissions and the findings from this NTCIR16 Lifelog-4 LEST sub-ta...
Article
Toxic comment classification models are often found biased towards identity terms, i.e., terms characterizing a specific group of people such as “Muslim” and “black”. Such bias is commonly reflected in false positive predictions, i.e., non-toxic comments with identity terms. In this work, we propose a novel approach to debias the model in toxic com...
Article
Full-text available
Objective To assess accuracy of emergency medical service (EMS) telephone triage in identifying patients who need an EMS response and identify factors which affect triage accuracy. Design Observational cohort study. Setting Emergency telephone triage provided by Yorkshire Ambulance Service (YAS) National Health Service (NHS) Trust. Participants...
Article
Full-text available
Objective To assess accuracy of telephone triage in identifying need for emergency care among those with suspected COVID-19 infection and identify factors which affect triage accuracy. Design Observational cohort study. Setting Community telephone triage provided in the UK by Yorkshire Ambulance Service NHS Trust (YAS). Participants 40 261 adult...
Conference Paper
The increasing use of social media as an information source brings further challenges - social media platforms can be an excellent medium for disseminating public awareness and critical information, that can be shared across large populations. However, misinformation in social media can have immense implications on public health, risking the effect...
Conference Paper
Full-text available
This study examines the relationship between international students’ movement and their digital transition on Twitter. By using the Twitter API, timelines for 17 Saudi students studying in the UK were retrieved. An in-depth qualitative content analysis for these accounts was conducted for a two year period, before and after their move. The study id...
Article
Full-text available
Background Tools proposed to triage patient acuity in COVID-19 infection have only been validated in hospital populations. We estimated the accuracy of five risk-stratification tools recommended to predict severe illness and compared accuracy to existing clinical decision making in a prehospital setting. Methods An observational cohort study using...
Conference Paper
The unprecedented events of the COVID-19 pandemic have generated an enormous amount of information and populated the Web with new content relevant to the pandemic and its implications. Visual information such as images has been shown to be crucial in the context of scientific communication. Images are often interpreted as being closer to the truth...
Conference Paper
Matching large and heterogeneous Knowledge Graphs (KGs) has been a challenge in the Semantic Web research community. This work highlights a number of limitations with current matching methods, such as: (1) they are highly dependent on string-based similarity measures, and (2) they are primarily built to handle well-formed ontologies. These features...
Preprint
Full-text available
Background Emergency Medical Services (EMS) have experienced surges in demand as the COVID-19 pandemic has progressed with ambulances services in the UK declaring major incidents due to the risk of care being compromised. COVID-19 specific EMS telephone triage tools have been introduced to help manage demand. There has been no previous evaluation o...
Chapter
Full-text available
Museum websites have been designed to provide access for different types of users, such as museum staff, teachers and the general public. Therefore, understanding user needs and demographics is paramount to the provision of user-centred features, services and design. Various approaches exist for studying and grouping users, with a more recent empha...
Preprint
Toxic comment classification models are often found biased toward identity terms which are terms characterizing a specific group of people such as "Muslim" and "black". Such bias is commonly reflected in false-positive predictions, i.e. non-toxic comments with identity terms. In this work, we propose a novel approach to tackle such bias in toxic co...
Preprint
Full-text available
Study Objective Tools proposed to triage patient acuity in COVID-19 infection have only been validated in hospital populations. We estimated the accuracy of five risk-stratification tools recommended to predict severe illness and compare accuracy to existing clinical decision-making in a pre-hospital setting. Methods An observational cohort study...
Preprint
Full-text available
Objective: To assess accuracy of telephone triage in identifying patients who need emergency care amongst those with suspected COVID-19 infection and identify factors which affect triage accuracy. Design: Observational cohort study Setting: Community telephone triage in the Yorkshire and Humber, Bassetlaw, North Lincolnshire and North East Lincolns...
Article
Full-text available
During times of crisis, information access is crucial. Given the opaque processes behind modern search engines, it is important to understand the extent to which the “picture” of the Covid-19 pandemic accessed by users differs. We explore variations in what users “see” concerning the pandemic through Google image search, using a two-step approach....
Article
The first FATE Winter School, organized by the Cyprus Center for Algorithmic Transparency (CyCAT) provided a forum for both students as well as senior researchers to examine the complex topic of Fairness, Accountability, Transparency and Ethics (FATE). Through a program that included two invited keynotes, as well as sessions led by CyCAT partners a...
Conference Paper
Full-text available
KGMatcher is a scalable and domain independent matching tool that matches the schema (classes) of larger Knowledge Graphs by following a hybrid matching approach. KGMatcher is composed of an instance-based matcher which only uses annotated instances of knowledge graph classes to generate candidate class alignments, and a stringbased matcher. This y...
Chapter
Full-text available
Lifelogging can be described as the process by which individuals use various software and hardware devices to gather large archives of multimodal personal data from multiple sources and store them in a personal data archive, called a lifelog. The Lifelog task at NTCIR was a comparative benchmarking exercise with the aim of encouraging research into...
Conference Paper
In the last decade, a remarkable number of Knowledge Graphs (KGs) were developed, such as DBpedia, NELL and Google knowledge graph. These KGs are the core of many web-based applications such as query answering and semantic web navigation. The majority of these KGs are semi-automatically constructed, which has resulted in a significant degree of het...
Article
This paper aims to establish digital forensics and data exploration as a methodology for supporting archival practice and research into a filmmaker's creative processes. We approach this by exploring the digital legacy hard drives of the late artist Stephen Dwoskin (1939–2012), who is recognised as an influential filmmaker at the forefront of the s...
Conference Paper
Full-text available
MART (Micro-activity Retrieval Task) was a NTCIR-15 collaborative benchmarking pilot task. The NTCIR-15 MART pilot aimed to motivate the development of irst generation techniques for high-precision micro-activity detection and retrieval, to support the identiication and retrieval of activities that occur over short time-scales such as minutes, rath...
Conference Paper
This paper presents a mobile data collection method using a 360° camera developed in an ongoing qualitative doctoral research project. The method captures audio, participant action and the running environment visuals of a situation in an immersive way compared to traditional audio‐video capture. The method contributes towards an understanding of in...
Article
Full-text available
Nowadays, multiple applications benefit from user modeling and personalization with different purposes. In the context of smart cities, this is particularly important for imp roving the citizens’ daily experience in many areas such as transportation, traffic, energy consumption, urban infrastructure, leisure, public participation, etc. The ubiquito...
Article
Increasingly, people are making use of diverse digital services that create many types of personal data. The most recent addition to such services are self-tracking devices that are capable of creating very detailed personal activity records. The focus of this special issue is to explore how such activity records can be exploited to provide user-ce...
Conference Paper
Full-text available
In their transition to a new country, international students often feel lost, anxious or stressed. Saudi students in the UK in particular may face further challenges due to the cultural, social and religious differences that they experience. There is a lot of evidence that social media play a crucial role in this experience. By interviewing 12 Saud...
Chapter
In their transition to a new country, international students often feel lost, anxious or stressed. Saudi students in the UK in particular may face further challenges due to the cultural, social and religious differences that they experience. There is a lot of evidence that social media play a crucial role in this experience. By interviewing 12 Saud...
Conference Paper
There have been multiple calls for integrating topics related to fairness, accountability, transparency, ethics (FATE) and social justice into Data Science curricula, but little exploration of how this might work in practice. This paper presents the findings of a collaborative auto-ethnography (CAE) engaged in by a MSc Data Science teaching team ba...
Article
Purpose: This paper provides an overview of the special issue on lifelogging behaviour and practice. Lifelogging refers to the automatic capturing and recording of one's life activities, e.g., using mobile Apps, Wearable devices and sensor platforms. Societal Implications: Given the increased uptake of lifelogging devices, lifelogging is becoming a...
Chapter
While vast volumes of personal data are being gathered daily by individuals, the MMM community has not really been tackling the challenge of developing novel retrieval algorithms for this data, due to the challenges of getting access to the data in the first place. While initial efforts have taken place on a small scale, it is our conjecture that a...
Chapter
Lifelogging refers to the process of digitally capturing a continuous and detailed trace of life activities in a passive manner. In order to assist the research community to make progress in the organisation and retrieval of data from lifelog archives, a lifelog task was organised at NTCIR since edition 12. Lifelog-3 was the third running of the li...
Conference Paper
The transition from the home to the host country for international students has been always considered a sensitive period. Students face multiple social, cultural and academic challenges during this time. For Saudi students coming to the UK, the experience can be especially challenging, because they are moving from a conservative Muslim culture to...
Conference Paper
While vast volumes of personal data are being gathered daily by individuals, the MMM community has not really been tackling the challenge of developing novel retrieval algorithms for this data, due to the challenges of getting access to the data in the first place. While initial efforts have taken place on a small scale, it is our conjecture that a...
Conference Paper
Lifelog-3 was the third instance of the lifelog task at NTCIR. At NTCIR-14, the Lifelog-3 task explored three different lifelog data access related challenges, the search challenge, the annotation challenge and the insights challenge. In this paper we review the activities of participating teams who took part in the challenges and we suggest next s...
Chapter
The websites of Cultural Heritage institutions attract the full range of users, from professionals to novices, for a variety of tasks. However, many institutions are reporting high bounce rates and therefore seeking ways to better engage users. The analysis of transaction logs can provide insights into users’ searching and navigational behaviours a...
Article
Full-text available
Gamification is now a well-established technique in Human-Computer Interaction. However, research on gamification still faces a variety of empirical and theoretical challenges. Firstly, studies of gamified systems typically focus narrowly on understanding individuals. short-term interactions with the system, ignoring more difficult to measure outco...
Conference Paper
Full-text available
Lifelog-3 was the third instance of the lifelog task at NTCIR. At NTCIR-14, the Lifelog-3 task explored three different lifelog data access related challenges, the search challenge, the annotation challenge and the insights challenge. In this paper we review the activities of participating teams who took part in the challenges and we suggest next s...
Article
The spread of toxic content online has attracted a wealth of research into methods of automatic detection and classification in recent years. However, two limitations still exist: 1) the lack of support for multi-label classification; and 2) the lack of understanding of the impact of the typical unbalanced datasets on such tasks. In this work, we b...
Conference Paper
It is well known that international students' transition from their home to the host country is accompanied by many challenges. During the transition period, students are more likely to be depressed, anxious, lonely and socially disconnected. Social media, with its informational and communication characteristics, may be an increasingly important as...
Conference Paper
It is our great pleasure to welcome you to the UMAP 2019 Workshop on Explainable and Holisitic User Modeling (ExHUM). Our workshop took inspiration from the analysis of the recent Web dynamics: according to a recent claim by IBM, 90% of the data available today have been created in the last two years. Such an exponential growth of personal informat...
Conference Paper
NewsREEL Multimedia premiers 2018 as part of the MediaEval Benchmarking Initiative. The NewsREEL task combines recommendation algorithms with image and text analysis. Participants must predict engagement with news items based on text snippets and annotated images. Several major German news portals have supplied data. The algorithms are evaluated in...
Conference Paper
Thanks to recent advances in the field of ubiquitous computing, an increasing number of users now rely on tools and apps that allow them to track specific aspects of their lives. An example are step counters and activity trackers that are promoted as unobtrusive tools to monitor our fitness levels. Interestingly, although significant research and d...
Chapter
A/B testing is currently being increasingly adopted for the evaluation of commercial information access systems with a large user base since it provides the advantage of observing the efficiency and effectiveness of information access systems under real conditions. Unfortunately, unless university-based researchers closely collaborate with industry...
Article
Full-text available
Evaluation in empirical computer science is essential to show progress and assess technologies developed. Several research domains such as information retrieval have long relied on systematic evaluation to measure progress: here, the Cranfield paradigm of creating shared test collections, defining search tasks, and collecting ground truth for these...
Conference Paper
One of the key selling points of smart home devices is that they provide solutions tailored to our needs. Identifying this need, however, is not always trivial, especially when dealing with infants who are not yet able to express their wishes using clear words. In this paper, we present preliminary work on identifying infants’ needs based on catego...
Conference Paper
Full-text available
It is our great pleasure to welcome you to the UMAP 2018 HUM (Holistic User Modeling) Workshop. According to a recent claim by IBM, 90% of the data available today have been created in the last two years. This exponential growth of online information has given new life to research in the area of user modeling and personalization, since information...
Conference Paper
iSchools have their roots in the collection, storage, analysis, and dissemination of archived materials of human activities. We foresee that sensing data via lifelogging devices (or Internet of Things at large) will eventually shape its significant part in the coming years. Information Research and Learning with Lifelogging Devices (IRLLD) aims to...
Conference Paper
Full-text available
Gamification has been attracted much interest, not only in the HCI community, in the last few years. However, there is still a lack of insights and theory on the relationships between game design elements, motivation, domain context and user behavior. In this workshop we want to discover the potentials of data-driven gamification design optimizatio...
Conference Paper
Full-text available
Quantified Self (QS) field needs to start thinking of how situated needs may affect the use of self-tracking technologies. In this workshop we will focus on the idiosyncrasies of specific categories of users.
Conference Paper
News recommender systems provide users with access to news stories that they find interesting and relevant. As other online, stream-based recommender systems, they face particular challenges, including limited information on users’ preferences and also rapidly fluctuating item collections. In addition, technical aspects, such as response time and s...
Conference Paper
Recommender System research has evolved to focus on developing algorithms capable of high performance in online systems. This development calls for a new evaluation infrastructure that supports multi-dimensional evaluation of recommender systems. Today's researchers should analyze algorithms with respect to a variety of aspects including predictive...
Conference Paper
Full-text available
The importance of user modeling and personalization is taken for granted in several scenarios. According to this widespread paradigm, each user can be modeled through some (explicitly or implicitly gathered) information about her knowledge or about her preferences, in order to adapt the behavior of a generic intelligent system to her specific chara...
Conference Paper
Context-awareness has become a critical factor in improving the predictions of user interest in modern online TV recommendation systems. In addition to individual user preferences, existing context-aware approaches such as tensor factorization incorporate system-level contextual bias to increase predicting accuracy. We analyzed a user interaction d...
Article
Full-text available
In today’s society where audio-visual content such as professionally edited and user-generated videos is ubiquitous, automatic analysis of this content is a decisive functionality. Within this context, there is an extensive ongoing research about understanding the semantics (i.e., facts) such as objects or events in videos. However, little research...
Conference Paper
Due to increasing possibilities to create digital video, we are facing the emergence of large video archives that are made accessible either online or offline. Though a lot of research has been spent on video retrieval tools and methods, which allow for automatic search in videos, still the performance of automatic video retrieval is far from optim...
Article
Full-text available
The third workshop on Gamification for Information Retrieval (GamifIR) took place on the 21th of July 2016 in conjunction with SIGIR 2016 in Pisa, Italy. It was the first GamifIR held in conjunction with the SIGIR, the first and second GamifIR workshops were both colocated with ECIR. The workshop program included one invited keynote presentation, s...
Book
This book constitutes the proceedings of the International Summit on Electronic Healthcare, eHealth 360°, held in Budapest, Hungary, in June 2016. The 55 revised full papers presented along with 9 short papers were carefully reviewed and selected from 81 submissions. The papers represent the latest results from the co-located conferences as the tra...
Conference Paper
Increasingly, educators make use of learning-by-doing approaches to teach students of STEM programmes the skills that they need to become successful in careers in research and development. However, we argue that the technical challenges addressed in these programmes are often too limited and therefore do not support the students in gaining the more...
Conference Paper
Full-text available
Gamification has been widely accepted in the HCI community in the last few years. However, the current debate is focused on its short-term consequences, such as effectiveness and usefulness, while its side-effects, long-term criticalities and systemic impacts are rarely raised. This workshop will explore the gamification design space from a critica...
Conference Paper
In real-world scenarios, recommenders face non-functional requirements of technical nature and must handle dynamic data in the form of sequential streams. Evaluation of recommender systems must take these issues into account in order to be maximally informative. In this paper, we present Idomaar—a framework that enables the efficient multi-dimensio...
Conference Paper
In this position paper, we take the experimental approach of putting algorithms aside, and reflect on what recommenders would be for people if they were not tied to technology. By looking at some of the shortcomings that current recommenders have fallen into and discussing their limitations from a human point of view, we ask the question: if freed...
Conference Paper
Full-text available
While the Quantified Self (QS) community is described in terms of "self-knowledge through numbers" people are increasingly demanding value and meaning. In this workshop we aim at refocusing the QS debate on the value of data for providing new services.
Conference Paper
Successful news recommendation requires facing the challenges of dynamic item sets, contextual item relevance, and of fulfilling non-functional requirements, such as response time. The CLEF NewsREEL challenge is a campaign-style evaluation lab allowing participants to tackle news recommendation and to optimize and evaluate their recommender algorit...
Conference Paper
Full-text available
One of the main challenges faced by providers of interactive information access systems is to engage users in the use of their systems. The library sector in particular can benefit significantly from increased user engagement. In this short paper, we present a preliminary analysis of a university library system that aims to trigger users' extrinsic...
Conference Paper
Test collections have a long history of supporting repeatable and comparable evaluation in Information Retrieval (IR). However, thus far, no shared test collection exists for IR systems that are designed to index and retrieve multimodal lifelog data. In this paper we introduce the first test collection for personal lifelog data, which has been empl...
Conference Paper
Stronger engagement and greater participation is often crucial to reach a goal or to solve an issue. Issues like the emerging employee engagement crisis, insufficient knowledge sharing, and chronic procrastination. In many cases we need and search for tools to beat procrastination or to change people's habits. Gamification is the approach to learn...
Article
Full-text available
The news industry has gone through seismic shifts in the past decade with digital content and social media completely redefining how people consume news. Readers check for accurate fresh news from multiple sources throughout the day using dedicated apps or social media on their smartphones and tablets. At the same time, news publishers rely more an...
Chapter
With more and more wearable devices and smartphone apps being released that are capable of unobtrusively recording various aspects of our life, we are currently witnessing the emergence of a new trend. Followers of this trend rely on these apps and devices to track their every day activities and to gain insights into their personal well-being. Alth...
Article
In today's society where audio-visual content is ubiquitous, violence detection in movies and Web videos has become a decisive functionality, e.g., for providing automated youth protection services. In this paper, we concentrate on two important aspects of video content analysis: Time efficiency and modeling of concepts (in this case, violence mode...
Conference Paper
Full-text available
The news industry has gone through seismic shifts in the past decade with digital content and social media completely redefining how people consume news. Readers check for accurate fresh news from multiple sources throughout the day using dedicated apps or social media on their smartphones and tablets. At the same time, news publishers rely more an...