Jorge Gracia

Jorge Gracia
University of Zaragoza | UNIZAR · Departamento de Informatica e Ingenieria de Sistemas

PhD in Computer Science

About

80
Publications
14,386
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,168
Citations
Citations since 2016
30 Research Items
558 Citations
2016201720182019202020212022020406080100
2016201720182019202020212022020406080100
2016201720182019202020212022020406080100
2016201720182019202020212022020406080100
Introduction
My main research interests are Semantic Web, Ontology Matching, Multilingual Web of Data, Linguistic Linked Data, Query Interpretation, Web Intelligence. Currently I am researching on how to move language resources (with a special focus on dictionaries) from their data silos into the multilingual Web of Data and make them interoperable, to support a future generation of (Linked Data-aware) Natural Language Processing tools. I am also interested on methods and techniques for cross-lingual linking and cross-lingual information access.
Additional affiliations
September 2009 - present
Universidad Politécnica de Madrid
Position
  • PostDoc Position
October 2004 - December 2009

Publications

Publications (80)
Article
Full-text available
This article provides a comprehensive and up-to-date survey of models and vocabularies for creating linguistic linked data (LLD) focusing on the latest developments in the area and both building upon and complementing previous works covering similar territory. The article begins with an overview of some recent trends which have had a significant im...
Chapter
Full-text available
We describe the use of linguistic linked data to support a cross-lingual transfer framework for sentiment analysis in the pharmaceutical domain. The proposed system dynamically gathers translations from the Linked Open Data (LOD) cloud, particularly from Apertium RDF, in order to project a deep learning-based sentiment classifier from one language...
Conference Paper
Full-text available
We describe the use of linguistic linked data to support a cross-lingual transfer framework for sentiment analysis in the pharmaceutical domain. The proposed system dynamically gathers translations from the Linked Open Data (LOD) cloud, particularly from Apertium RDF, in order to project a deep learning-based sentiment classifier from one language...
Conference Paper
Full-text available
In this paper we describe the contributions made by the European H2020 project "Prêt-à-LLOD" ('Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors') to the further development of the Linguistic Linked Open Data (LLOD) infrastructure. Prêt-à-LLOD aims to develop a new methodology for building data value chains applic...
Chapter
LD technologies allow metadata of datasets to be exposed on the Web in order to improve their automated discovery, sharing and reuse by humans and software agents. In this chapter we deal with the representation of metadata for LRs, with the idea of enabling their cataloguing, discovery and later reuse. We will distinguish two types of metadata: ge...
Chapter
This chapter introduces the Lexicon Model for Ontologies (lemon) as defined by the Ontolex W3C community group. The model was originally developed to enrich ontologies with lexical information expressing how the elements of the ontology including classes, properties and individuals are referred to in a given language. In this chapter we cover the c...
Chapter
In previous chapters, we discussed how to model linguistic data sets using the Resource Description Framework as a basis to publish them as linked data on the Web. In this chapter, we describe a methodology that can be followed in the transformation of legacy linguistic datasets into linked data. The methodology comprises of different tasks, includ...
Chapter
This chapter introduces the Linguistic Linked Open Data (LLOD) Cloud. In recent years, there has been increasing interest in publishing linguistic datasets following linked data principles. A number of community-driven activities, foremost organized by the Open Linguistics Working Group (OWLG), have fostered and supported the publication of open li...
Chapter
This chapter introduces preliminaries that are essential to follow the content in the remainder of this book. First of all, we introduce the core data model of the Semantic Web and linked data, that is the Resource Description Framework, RDF. This format was designed in the 1990s and its core purpose is to represent data and knowledge in a Web-comp...
Chapter
In this chapter we address the question of how links can be discovered between different datasets published as Linguistic Linked Open Data. We describe common patterns to represent links both between data that are on the same language (monolingual scenario) and between data in different languages (cross-lingual scenario). Further, we describe techn...
Chapter
The (re-)usability of NLP tools and language resources has long been recognized as a key challenge in the language resource and NLP communities. Reuse of resources, however, requires a minimum level of interoperability, and in this chapter, we focus on conceptual interoperability, i.e. harmonization between different annotation schemas by means of...
Chapter
This chapter describes how linguistic annotations can be represented in RDF. Web Annotation and NIF provide the means to reference text segments on the web. Yet, representing linguistic annotations requires appropriate vocabularies. We discuss relevant vocabularies and illustrate how they can be applied to support annotation at different levels.
Chapter
Text annotation consists in defining markables (elements to be annotated), their features (attributes and values of annotations) and relations between markables (e.g. syntactic dependencies or semantic links). In this chapter we describe the principles for annotating text data using RDF-compliant formalisms. These principles provide the basis for m...
Chapter
The Linguistic Linked Data (LLD) paradigm was introduced about 8 years ago by the Open Linguistics Working Group (OWLG). The original mission of this group was to (1) promote the use of open standards in linguistics; (2) act as a central point of reference and provide support for those interested in open linguistic data; (3) develop best practices...
Chapter
In this chapter we describe principles and architectures that support the development of NLP workflows and pipelines based on linked data technology. The benefit of NLP workflows that build on linked data standards is that they build on an open set of data models and Web technologies that can be implemented with standard functionality not requiring...
Chapter
Finding appropriate language resources for a particular research purpose or task is of crucial importance and represents a significant challenge at the same time. Currently, there are a number of distributed data repositories which contain metadata about many language resources. However, the metadata formats and metadata content is not harmonized a...
Chapter
In recent years, Digital Humanities (DH) has become an increasingly flourishing field of research, often posing novel research challenges that require extensions or revisions of existing technologies. One characteristic of this area is the great heterogeneity of scientific disciplines and user communities involved. This leads to heterogeneity of da...
Chapter
Wordnets are the most widely used lexical resources in natural language processing (NLP). There exist wordnets in more than 40 languages by now and all of these are connected to the original Princeton WordNet. The origins of linguistic linked data (LD) can thus in some sense be traced to the WordNet project. The implementation of the linking, howev...
Book
This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that...
Article
Full-text available
We are pleased to introduce this special issue on the topic of “Towards the Multilingual Web of Data”, which we feel is a timely and valuable topic in our increasingly multilingual and interconnected world.[...]
Article
Full-text available
As the interest of the Semantic Web and computational linguistics communities in linguistic linked data (LLD) keeps increasing and the number of contributions that dwell on LLD rapidly grows, scholars (and linguists in particular) interested in the development of LLD resources sometimes find it difficult to determine which mechanism is suitable for...
Chapter
Ontolex-lemon is the de facto standard to represent lexica relative to ontologies and it can be used to encode term banks as RDF. A multi-lingual, multi-jurisdictional term bank of copyright-related concepts has been published as linked data based on the ontolex-lemon model. The terminology links information from WIPO (concepts and definitions), IA...
Article
Bilingual electronic dictionaries contain collections of lexical entries in two languages, with explicitly declared translation relations between such entries. Nevertheless, they are typically developed in isolation, in their own formats and accessible through proprietary APIs. In this paper we propose the use of Semantic Web techniques to make tra...
Book
This book constitutes the combined refereed proceedings of ISWC Satellite Wor shops KEKI and NLP&DBpedia 2016 which were held in conjunction with ISWC 2016 in Kobe, Japan, in October 2016. The 9 papers presented were carefully selected and reviewed from 20 submissions. They focus on the use of linguistic linked open data, the linguistic aspects of...
Book
This book constitutes the proceedings of the First International Conference on Language, Data and Knowledge, LDK 2017, held in Galway, Ireland, in June 2017. The 14 full papers and 19 short papers included in this volume were carefully reviewed and selected from 68 initial submissions. They deal with language data; knowledge graphs; applications in...
Conference Paper
Recently, a growing number of linguistic resources in different languages have been published and interlinked as part of the Linguistic Linked Open Data (LLOD) cloud. However, in comparison to English and other prominent languages, the presence of Chinese in such a cloud is still limited, despite the fact that Chinese is the most spoken language wo...
Conference Paper
Full-text available
The Open Linguistics Working Group (OWLG) brings together researchers from various fields of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections. A major outcome of our work is the Linguistic Link...
Conference Paper
Full-text available
The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes' degree, to automatically generate new translation relations between words, and therefore to enrich existing bilingual dictionaries with new entries. Currently, the Apertium RDF Graph includes data from 22 Apertium bilingual dictiona...
Article
Full-text available
Ontology localization is the task of adapting an ontology to a different cultural context, and has been identified as an important task in the context of the Multilingual Semantic Web vision. The key task in ontology localization is translating the lexical layer of an ontology, i.e., its labels, into some foreign language. For this task, we hypothe...
Conference Paper
Full-text available
Terminesp is a multilingual terminological resource with terms from a range of specialized domains. Along with definitions, notes, scientific denominations and provenance information, it includes translations from Spanish into a variety of languages. A linked data resource with these features would represent a potentially relevant source of knowled...
Conference Paper
META-SHARE is an infrastructure for sharing Language Resources (LRs) where significant effort has been made into providing carefully curated metadata about LRs. However, in the face of the flood of data that is used in computational linguistics, a manual approach cannot suffice. We present the development of the META-SHARE ontology, which transform...
Article
Full-text available
Problem-based learning has been applied over the last three decades to a diverse range of learning environments. In this educational approach, different problems are posed to the learners so that they can develop different solutions while learning about the problem domain. When applied to conceptual modelling, and particularly to Qualitative Reason...
Conference Paper
Recently, experts and practitioners in language resources have started recognizing the benefits of the linked data (LD) paradigm for the representation and exploitation of linguistic data on the Web. The adoption of the LD principles is leading to an emerging ecosystem of multilingual open resources that conform to the Linguistic Linked Open Data C...
Chapter
Full-text available
Linked Data technologies and methods are enabling the creation of a data network where pieces of data are interconnected on the Web using machine-readable formats such as Resource Description Framework (RDF). This paradigm offers great opportunities to connect and make available knowledge in different languages. However, in order to make this visio...
Article
Full-text available
Articulating thought in computerbased media is a powerful means for humans to develop their understanding of phenomena. We have created DynaLearn, an intelligent learning environment that allows learners to acquire conceptual knowledge by constructing and simulating qualitative models of how systems behave. DynaLearn uses diagrammatic representatio...
Conference Paper
Full-text available
CIDER-CL is the evolution of CIDER, a schema-based ontology alignment system. Its algorithm compares each pair of ontology entities by analysing their similarity at different levels of their ontological context (linguistic description, superterms, subterms, related terms, etc.). Then, such elementary similarities are combined by means of artificial...
Conference Paper
Full-text available
In this article, we argue that there is a growing number of linked datasets in different natural languages, and that there is a need for guidelines and mechanisms to ensure the quality and organic growth of this emerging multilingual data network. However, we have little knowledge regarding the actual state of this data network, its current practic...
Article
Full-text available
Lexica and terminology databases play a vital role in many NLP applications, but currently most such resources are published in application-specific formats, or with custom access interfaces, leading to the problem that much of this data is in “data silos” and hence difficult to access. The Semantic Web and in particular the Linked Data initiative...
Conference Paper
Full-text available
Recently, the Semantic Web has experienced significant advancements in standards and techniques, as well as in the amount of semantic information available online. Even so, mechanisms are still needed to automatically reconcile semantic information when it is expressed in different natural languages, so that access to Web information across languag...
Article
Full-text available
The Semantic Web is an extension of the traditional Web in which meaning of information is well defined, thus allowing a better interaction between people and computers. To accomplish its goals, mechanisms are required to make explicit the semantics of Web resources, to be automatically processed by software agents (this semantics being described b...
Article
The Web has witnessed an enormous growth in the amount of semantic information published in recent years. This growth has been stimulated to a large extent by the emergence of Linked Data. Although this brings us a big step closer to the vision of a Semantic Web, it also raises new issues such as the need for dealing with information expressed in d...
Conference Paper
DynaLearn (http://www.DynaLearn.eu) develops a cognitive artefact that engages learners in an active learning by modelling process to develop conceptual system knowledge. Learners create external representations using diagrams. The diagrams capture conceptual knowledge using the Garp3 Qualitative Reasoning (QR) formalism [2]. The expressions can be...
Conference Paper
In this paper we present the cast of pedagogical agents in the DynaLearn Intelligent Learning Environment. We describe the different character roles and how they interact with the learners. Our aim in using these characters is to increase the learners’ motivation.
Conference Paper
Motivation is a critical requirement for successful learning. Previous research has identified that animated pedagogical agents can increase motivation. Following these results, we present the cast of pedagogical agents in the DynaLearn Intelligent Learning Environment. Each of these agents is associated with one of the different support types avai...
Conference Paper
Full-text available
Conceptual modeling is a complex task that requires domain specific knowledge as well as a good command of modeling techniques. In this paper we propose an approach that aims to capture relevant knowledge from an online pool of conceptual models. This knowledge is brought to the user in order to assist the construction of new conceptual models. Wit...
Conference Paper
Full-text available
CIDER is a schema-based ontology alignment system. Its algorithm compares each pair of ontology terms by, firstly, extracting their ontological contexts up to a certain depth (enriched by using lightweight inference) and, secondly, combining different elementary ontology matching techniques. In its current version, CIDER uses artificial neural netw...
Conference Paper
Full-text available
Tagging systems are nowadays a common feature in web sites where user-generated content plays an important role. However, the lack of semantics and multilinguality hamper information retrieval process based on folksonomies. In this paper we propose an approach to bring semantics to multilingual folksonomies. This approach includes a sense disambigu...
Technical Report
Full-text available
This document discusses the core Semantic Technologies in DynaLearn: i) The semantic repository, which supports the online storage and access of qualitative reasoning models, ii) the grounding process, which establishes semantic equivalences between the concepts in the models and the concepts in a background knowledge source, and iii) the use of on...
Conference Paper
There is a need for software that supports learners in actively dealing with theoretical concepts by having them create models and perform concept prediction and explanation (e.g. [3,4,5]). DynaLearn seeks to address this by developing a domain independent Interactive Learning Environment (ILE) based on Qualitative Reasoning (QR) [1]. The QR vocabu...
Conference Paper
Full-text available
DynaLearn is an Interactive Learning Environment that facilitates a constructive approach to developing a conceptual understanding of how systems work. The software can be put in different interactive modes facilitating alternative learning experiences, and as such provides a toolkit for educational research.
Conference Paper
Full-text available
Conceptual modelling tools allow users to construct formal representations of their conceptualisations. These models are typically developed in isolation, unrelated to other user models, thus losing the opportunity of incorporating knowledge from other existing models or ontologies that might enrich the modelling process. We propose to apply Semant...
Technical Report
Full-text available
This document describes the Ontology-Based Feedback process addressed to improve the quality of the models developed by learners. The core idea is to compare the knowledge contained in a learner model with the knowledge contained in a reference model (made by an expert) in order to get feedback on the quality of the former one. This is carried out...