Sheila Castilho

Sheila Castilho
Dublin City University | DCU · Adapt Centre

Research Fellow at Dublin City University

About

45
Publications
28,668
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
839
Citations

Publications

Publications (45)
Article
Full-text available
In the present study, we investigated the post-editese phenomenon, i.e., the unique features that set machine translated post-edited texts apart from human-translated texts. We used two literary texts, namely, the English children’s novel by Lewis Carroll Alice’s Adventures in Wonderland (AW) and Paula Hawkins’ popular book The Girl on the Train (T...
Preprint
Full-text available
In the present study, we investigate the post-editese phenomenon, i.e., the unique features that set machine translated post-edited texts apart from human-translated texts. We use two literary texts, namely, the English children's novel by Lewis Carroll Alice’s Adventures in Wonderland (AW) and Paula Hawkins' popular book The Girl on the Train (TGO...
Conference Paper
Full-text available
With official status in both Ireland and the EU, there is a need for high-quality English-Irish (EN-GA) machine translation (MT) systems which are suitable for use in a professional translation environment. While we have seen recent research on improving both statistical MT and neural MT for the EN-GA pair, the results of such systems have always b...
Preprint
Full-text available
The quality of machine translation has increased remarkably over the past years, to the degree that it was found to be indistinguishable from professional human translation in a number of empirical investigations. We reassess Hassan et al.'s 2018 investigation into Chinese to English news translation, showing that the finding of human-machine parit...
Article
Full-text available
The quality of machine translation has increased remarkably over the past years, to the degree that it was found to be indistinguishable from professional human translation in a number of empirical investigations. We reassess Hassan et al.'s 2018 investigation into Chinese to English news translation, showing that the finding of human–machine parit...
Conference Paper
Full-text available
Despite increasing efforts to improve evaluation of machine translation (MT) by going beyond the sentence level to the document level, the definition of what exactly constitutes a "document level" is still not clear. This work deals with the context span necessary for a more reliable MT evaluation. We report results from a series of surveys involvi...
Article
This book “The Human Factors in Machine Translation” edited by Chan Sin-Wai—also the series editor of Studies in Translation Technology—brings together twelve chapters covering a variety of topics such as localisation, terminology and crowdsourcing. The goal of the book is to give a comprehensive look at the role of the human translator in machine...
Conference Paper
Full-text available
The translation of ambiguous words still poses challenges for machine translation. In this work, we carry out a systematic quantitative analysis regarding the ability of different machine translation systems to disambiguate the source language conjunctions "but" and "and". We evaluate specialised test sets focused on the translation of these two co...
Article
Full-text available
In the context of recent improvements in the quality of machine translation (MT) output and new use cases being found for that output, this article reports on an experiment using statistical and neural MT systems to translate literature. Six professional translators with experience of literary translation produced English-to-Catalan translations un...
Preprint
Full-text available
We reassess a recent study (Hassan et al., 2018) that claimed that machine translation (MT) has reached human parity for the translation of news from Chinese into English, using pairwise ranking and considering three variables that were not taken into account in that previous study: the language in which the source side of the test set was original...
Article
Full-text available
This article reports a multifaceted comparison between statistical and neural machine translation (MT) systems that were developed for translation of data from massive open online courses (MOOCs). The study uses four language pairs: English to German, Greek, Portuguese, and Russian. Translation quality is evaluated using automatic metrics and human...
Chapter
Full-text available
The continuing growth in digital content means that there is now significantly more linguistic content to translate using more diverse workflows and tools than ever before. This growth necessitates broader requirements for Translation Quality Assessment (TQA) that include appropriate methods for the domain, text type, workflow, and end-user.With th...
Book
Full-text available
This is the first volume that brings together research and practice from academic and industry settings and a combination of human and machine translation evaluation. Its comprehensive collection of papers by leading experts in human and machine translation quality and evaluation who situate current developments and chart future trends fills a clea...
Chapter
In this chapter, we argue that education and training in translation quality assessment (TQA)is being neglected for most, if not all, stakeholders of the translation process, from translators, post-editors, and reviewers to buyers and end-users of translation products and services. Within academia, there is a lack of education and training opportun...
Chapter
Full-text available
In both research and practice, translation quality assessment is a complex task involving a range of linguistic and extra-linguistic factors. This chapter provides a critical overview of the established and developing approaches to the definition and measurement of translation quality in human and machine translation workflows across a range of res...
Conference Paper
Full-text available
This paper reports on a pilot experiment that compares two different machine translation (MT) paradigms in reading comprehension tests. To explore a suitable methodology, we set up a pilot experiment with a group of six users (with English, Spanish and Simplified Chinese languages) using an English Language Testing System (IELTS), and an eye-tracke...
Conference Paper
Full-text available
This paper reports on a pilot experiment that compares two different machine translation (MT) paradigms in reading comprehension tests. To explore a suitable methodology, we set up a pilot experiment with a group of six users (with English, Spanish and Simplified Chinese languages) using an English Language Testing System (IELTS), and an eye-tracke...
Conference Paper
Full-text available
The present work describes a multilingual corpus of online content in the educational domain, i.e. Massive Open Online Course material, ranging from course forum text to subtitles of online video lectures, that has been developed via large-scale crowdsourcing. The English source text is manually translated into 11 European and BRIC languages using...
Conference Paper
Full-text available
The limited availability of in-domain training data is a major issue in the training of application-specific neural machine translation models. Professional outsourcing of bilingual data collections is costly and often not feasible. In this paper we analyze the influence of using crowdsourcing as a scalable way to obtain translations of target in-d...
Article
As machine translation (MT) continues to be used increasingly in the translation industry, there is a corresponding increase in the need to understand MT quality and, in particular, its impact on end-users. To date, little work has been carried out to investigate the acceptability of MT output among end-users and, ultimately, how acceptable they fi...
Conference Paper
Full-text available
In this paper, we report on a pilot mixed-methods experiment investigating the effects on productivity and on the translator experience of integrating machine translation (MT) post-editing (PE) with voice recognition (VR) and translation dictation (TD). The experiment was performed with a sample of native Spanish participants. In the quantitative p...
Conference Paper
Full-text available
This paper reports on a comparative evaluation of phrase-based statistical machine translation (PBSMT) and neural machine translation (NMT) for four language pairs, using the PET interface to compare educational domain output from both systems using a variety of metrics, including automatic evaluation as well as human rankings of adequacy and fluen...
Article
Full-text available
This paper discusses neural machine translation (NMT), a new paradigm in the MT field, comparing the quality of NMT systems with statistical MT by describing three studies using automatic and human evaluation methods. Automatic evaluation results presented for NMT are very promising, however human evaluations show mixed results. We report increases...
Conference Paper
Full-text available
In this paper we study the impact of using images to machine-translate user-generated e-commerce product listings. We study how a multi-modal Neural Machine Translation (NMT) model compares to two text-only approaches: a conventional state-of-the-art attentional NMT and a Statistical Machine Translation (SMT) model. User-generated product listings...
Conference Paper
Full-text available
In this paper, we study how humans perceive the use of images as an additional knowledge source to machine-translate user-generated product listings in an e-commerce company. We conduct a human evaluation where we assess how a multi-modal neural machine translation (NMT) model compares to two text-only approaches: a conventional state-of-the-art at...
Article
Full-text available
****This article is under copyright. The publisher should be contacted for permission to re-use or reprint the material in any form.*** * * * * Abstract: In September 2015, the ADAPT Centre for Digital Content Technology carried out a focus group study of 70 translators at the European Commission’s Directorate-General for Translation (DGT). The ai...
Research
Full-text available
As questões relativas às práticas de leitura são muito discutidas nos meios acadêmicos por pesquisadores de múltiplas áreas do conhecimento. No entanto, o resultado dessas pesquisas dificilmente chega às escolas que, em sua maioria, propõem leituras que não são capazes de despertar nos alunos o gosto pela leitura pois, salvo raras exceções, desenvo...
Thesis
This research measures end-user acceptability of machine-translated enterprise content. In cooperation with industry partners, the acceptability of machine translated, post-edited and human translated texts, as well as source text were measured using a user-centred translation approach (Suojanen, Koskinen and Tuominen 2015). The source language was...
Article
Today’s companies are overwhelmed with the need to create a huge amount of content, faster, customized, and for numerous media platforms, in order to support their products. Struggling with managing this amount of information, companies have now realised that the strategic management of multilingual enterprise content has become essential. Strategi...
Conference Paper
Full-text available
We show that metrics derived from recording gaze while reading, are better proxies for machine translation quality than automated metrics. With reliable eyetracking technologies becoming available for home computers and mobile devices, such metrics are readily available even in the absence of representative held-out human translations. In other wor...
Conference Paper
Full-text available
This paper presents a freely available resource for research on handling negation and speculation in review texts. The SFU Review Corpus, consisting of 400 documents of movie, book, and consumer product reviews, was annotated at the token level with negative and speculative keywords and at the sentence level with their linguistic scope. We report s...
Article
Full-text available
Given the significant improvements in Machine Translation (MT) quality and the increasing demand for translations, post-editing of automatic translations is becoming a popular practice in the translation industry. It has been shown to allow for larger volumes of translations to be produced, saving time and costs. In addition, the post-editing of au...
Article
Full-text available
We present an approach for translating subtitles where standard time and space constraints are modeled as part of the gen-eration of translations in a phrase-based statistical machine translation system (PB-SMT). We propose and experiment with two promising strategies for jointly trans-lating and compressing subtitles from En-glish into Portuguese....
Conference Paper
Full-text available
With the increasing demand for fast and accurate audiovisual translation, subtitlers are starting to consider the use of translation technologies to support their work. An important issue that arises from the use of such technologies is measuring how much effort needs to be put in by the subtitler in post-editing (semi-)automatic translations. In t...

Network

Cited By