Dimitra Gkatzia

Dimitra Gkatzia
  • PhD in Computer Science
  • Professor (Associate) at Edinburgh Napier University

About

55
Publications
5,017
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
510
Citations
Current institution
Edinburgh Napier University
Current position
  • Professor (Associate)
Additional affiliations
January 2016 - present
Edinburgh Napier University
Position
  • Professor (Associate)
December 2014 - December 2015
Heriot-Watt University
Position
  • Research Associate
Education
September 2011 - June 2015
Heriot-Watt University
Field of study
  • Computer Science
September 2010 - August 2011
Heriot-Watt University
Field of study
  • Artificial Intelligence
October 2004 - September 2008
University of Piraeus
Field of study
  • Education Technology and Digital Systems

Publications

Publications (55)
Preprint
Automatic metrics are extensively used to evaluate natural language processing systems. However, there has been increasing focus on how they are used and reported by practitioners within the field. In this paper, we have conducted a survey on the use of automatic metrics, focusing particularly on natural language generation (NLG) tasks. We inspect...
Article
Pre-trained language models are a highly effective source of knowledge transfer for natural language processing tasks, as their development represents an investment of resources beyond the reach of most researchers and end users. The widespread availability of such easily adaptable resources has enabled high levels of performance, which is especial...
Article
Full-text available
This paper summarizes the structure and findings from the first Workshop on Troubles and Failures in Conversations between Humans and Robots . The workshop was organized to bring together a small, interdisciplinary group of researchers working on miscommunication from two complementary perspectives. One group of technology-oriented researchers was...
Conference Paper
Full-text available
The most effective way of communication between humans and robots is through natural language communication. However, there are many challenges to overcome before robots can effectively converse in order to collaborate and work together with humans. This paper introduces TaskMaster 1 a novel cross-platform spoken dialogue system (SDS) for human-rob...
Article
Full-text available
Earlier research has shown that few studies in Natural Language Generation (NLG) evaluate their system outputs using an error analysis, despite known limitations of automatic evaluation metrics and human ratings. This position paper takes the stance that error analyses should be encouraged, and discusses several ways to do so. This paper is not jus...
Chapter
This paper argues that future dialogue systems must retrieve relevant information from multiple structured and unstructured data sources in order to generate natural and informative responses as well as exhibit commonsense capabilities and flexibility in dialogue management. To this end, we firstly review recent methods in document-grounded dialogu...
Preprint
Full-text available
Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requi...
Article
Full-text available
The Covid-19 pandemic required many aspects of life to move online. This accelerated a broader trend for increasing use of ICT and AI, with implications for both the world of work and career development. This article explores the potential benefits and challenges of including AI in career practice. It provides an overview of the technology, includi...
Preprint
Full-text available
Large scale adoption of large language models has introduced a new era of convenient knowledge transfer for a slew of natural language processing tasks. However, these models also run the risk of undermining user trust by exposing unwanted information about the data subjects, which may be extracted by a malicious party, e.g. through adversarial att...
Conference Paper
Full-text available
This paper proposes a novel task on commonsense-enhanced task-based dialogue grounded in documents and describes the Task2Dial dataset, a novel dataset of document-grounded task-based dialogues, where an Information Giver (IG) provides instructions (by consulting a document) to an Information Follower (IF), so that the latter can successfully compl...
Preprint
Full-text available
This paper proposes a novel task on commonsense-enhanced task-based dialogue grounded in documents and describes the Task2Dial dataset, a novel dataset of document-grounded task-based dialogues, where an Information Giver (IG) provides instructions (by consulting a document) to an Information Follower (IF), so that the latter can successfully compl...
Preprint
Full-text available
Deep learning-based language models have achieved state-of-the-art results in a number of applications including sentiment analysis, topic labelling, intent classification and others. Obtaining text representations or embeddings using these models presents the possibility of encoding personally identifiable information learned from language and con...
Preprint
Full-text available
We observe a severe under-reporting of the different kinds of errors that Natural Language Generation systems make. This is a problem, because mistakes are an important indicator of where systems should still be improved. If authors only report overall performance metrics, the research community is left in the dark about the specific weaknesses tha...
Article
Full-text available
Conversational systems aim to generate responses that are accurate, relevant and engaging , either through utilising neural end-to-end models or through slot filling. Human-to-human conversations are enhanced by not only the latest utterance of the interlocutor, but also by recalling and referring to relevant information about concepts/objects cove...
Article
Neural Referring Expression Generation (REG) models have shown promising results in generating expressions which uniquely describe visual objects. However, current REG models still lack the ability to produce diverse and unambiguous referring expressions (REs). To address the lack of diversity, we propose generating a set of diverse REs, rather tha...
Article
Full-text available
The proliferation of social media platforms changed the way people interact online. However, engagement with social media comes with a price, the users’ privacy. Breaches of users’ privacy, such as the Cambridge Analytica scandal, can reveal how the users’ data can be weaponized in political campaigns, which many times trigger hate speech and anti-...
Article
Decision-making is often dependent on uncertain data, e.g. data associated with confidence scores or probabilities. This article presents a comparison of different information presentations for uncertain data and, for the first time, measures their effects on human decision-making, in the domain of weather forecast generation. We use a game-based s...
Article
Full-text available
Data-to-text systems are powerful in generating reports from data automatically and thus they simplify the presentation of complex data. Rather than presenting data using visualisation techniques, data-to-text systems use natural (human) language, which is the most common way for human-human communication. In addition, data-to-text systems can adap...
Conference Paper
Full-text available
Decision-making is often dependent on uncertain data, e.g. data associated with confidence scores or probabilities. We present a comparison of different information presentations for uncertain data and, for the first time, measure their effects on human decision-making. We show that the use of Natural Language Generation (NLG) improves decision-mak...
Conference Paper
Full-text available
We propose a novel approach for handling first-time users in the context of automatic report generation from time-series data in the health domain. Handling first-time users is a common problem for Natural Language Generation (NLG) and interactive systems in general-the system cannot adapt to users without prior interaction or user knowledge. In th...
Preprint
Decision-making is often dependent on uncertain data, e.g. data associated with confidence scores or probabilities. We present a comparison of different information presentations for uncertain data and, for the first time, measure their effects on human decision-making. We show that the use of Natural Language Generation (NLG) improves decision-mak...
Conference Paper
Full-text available
We present a newly crowd-sourced data set of natural language references to objects anchored in complex urban scenes (In short: The REAL Corpus – Referring Expressions Anchored Language). The REAL corpus contains a collection of images of real-world urban scenes together with verbal descriptions of target objects generated by humans, paired with da...
Conference Paper
Full-text available
Navigation when running is exploratory, characterised by both starting and ending in the same location, and iteratively foraging the environment to find areas with the most suitable running conditions. Runners do not wish to be explicitly directed , or refer to navigation aids that cause them to stop running, such as maps. Such undirected navigatio...
Conference Paper
Full-text available
We present a novel approach for automatic report generation from time-series data, in the context of student feedback generation. Our proposed methodology treats content selection as a multi-label classification (MLC) problem, which takes as input time-series data (students' learning data) and outputs a summary of these data (feedback). Unlike prev...
Conference Paper
Full-text available
This work presents a new approach for modelling " unknown unknowns " in Natural Language Generation (NLG) systems. We first address the question of how to adapt to unknown users, using a combination of cluster-based user modelling and Pareto optimal Multi-Objective Optimisa-tion. Next, we present a corpus study on generating referring expressions (...
Conference Paper
Full-text available
Understanding and interpreting medical sensor data is an essential part of pre-hospital care in medical emergencies, but requires training and previous knowledge. In this paper, we describe ongoing work towards a medical decision support tool, which automatically generates textual summaries of underlying sensor data. In particular, we present resul...
Conference Paper
Full-text available
We present a novel approach for automatic report generation from time-series data, in the context of student feedback generation. Our proposed methodology treats content selection as a multi-label (ML) classification problem, which takes as input time-series data and outputs a set of templates, while capturing the dependencies between selected temp...
Conference Paper
Full-text available
We describe a statistical Natural Language Generation (NLG) method for summarisation of time-series data in the context of feedback generation for students. In this paper, we initially present a method for collecting time-series data from students (e.g. marks, lectures attended) and use example feedback from lecturers in a datadriven approach to co...
Article
Full-text available
The proposed work focuses on demonstrating how personal interests and preferences can be used to improve report generation from time-series data. We apply these ideas in the weather and health informatics domains. In this extended abstract, we propose a dynamic User Model approach to data-driven generation that adds personalisation to the generated...

Network

Cited By