Ely MatosUniversidade Federal de Juiz de Fora · Faculdade de Letras
Ely Matos
PhD - Cognitive Linguistics (Computational)
About
36
Publications
2,650
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
146
Citations
Publications
Publications (36)
This paper presents MoCCA, a Model of Comparative Concepts for Aligning Constructicons under development by a
consortium of research groups building Constructicons of different languages including Brazilian Portuguese, English,
German and Swedish. The Constructicons will be aligned by using comparative concepts (CCs) providing
language-neutral defi...
In this paper we present a database management and annotation tool for running an enriched FrameNet database, the FrameNet Brasil WebTool. We demonstrate how the entity-based model of such a tool allows for the addition of two types of data-structure to FrameNet Brasil, both of which aimed at refining the granularity of the semantic representations...
Public data systems gather a series of different information about Brazilian citizens. Such information is inserted in the system both via the selection of parameterized options and via open text fields. In this paper we describe the effort of modeling semantic frames for the lexicon of the healthcare domain as a means of tagging the open text fiel...
Public data systems gather different information about Brazilian citizens. Such information is inserted in the system both via the selection of parameterized options and via open text fields. In this paper we describe the effort of modeling semantic frames for the lexicon of the healthcare domain as a means of tagging the open text fields in public...
This paper reports on negative results in a task of automatic identification of schematic clausal constructions and their elements in Brazilian Portuguese. The experiment was set up so as to test whether form and meaning properties of constructions, modeled in terms of Universal Dependencies and FrameNet Frames in a Constructicon, would improve the...
Since its foundation in the 1980's, Construction Grammar has been crossing the traditionally imposed borders. From superimposed levels of analysis to the lexicon-grammar continuum, the constructionist approach to language has been built by, quoting Charles Fillmore, "the insistence on seeing specific grammatical patterns as serving given semantic (...
This paper presents Lutma, a collaborative, semi-constrained, tutorial-based tool for contributing frames and lexical units to the Global FrameNet initiative. The tool parameterizes the process of frame creation, avoiding consistency violations and promoting the integration of frames contributed by the community with existing frames. Lutma is struc...
This paper presents Charon, a web tool for annotating multimodal corpora with FrameNet categories. Annotation can be made for corpora containing both static images and video sequences paired - or not - with text sequences. The pipeline features, besides the annotation interface, corpus import and pre-processing tools.
This paper argues in favor of the adoption of annotation practices for multimodal datasets that recognize and represent the inherently perspectivized nature of multimodal communication. To support our claim, we present a set of annotation experiments in which FrameNet annotation is applied to the Multi30k and the Flickr 30k Entities datasets. We as...
Frame Semantics includes context as a central aspect of the theory. Frames themselves can be regarded as a representation of the immediate context against which meaning is to be construed. Moreover, the notion of frame invocation includes context as one possible source of information comprehenders use to construe meaning. As the original implementa...
In this paper we present Scylla, a methodology for domain adaptation of Neural Machine Translation (NMT) systems that make use of a multilingual FrameNet enriched with qualia relations as an external knowledge base. Domain adaptation techniques used in NMT usually require fine-tuning and in-domain training data, which may pose difficulties for thos...
There are now so-called constructiCons in development for several different languages. Hence, there are also possibilities for multilingual application of these linguistic resources. Due to the complexity of comparing grammatical constructions in different languages, an approach of trying to establish direct construction equivalents in different la...
Multimodal aspects of human communication are key in several applications of Natural Language Processing, such as Machine Translation and Natural Language Generation. Despite recent advances in integrating multimodality into Computational Linguistics, the merge between NLP and Computer Vision techniques is still timid, especially when it comes to p...
A FrameNet Brasil é um projeto de lexicografia computacional que tem como objetivo usar frames para a descrição de significados de palavras. Ela gera uma rede de frames que se ligam com relacionamentos específicos. O processo de criação ou atualização de um frame pode passar por mais de um especialista da Linguística, em momentos diferentes, o que...
In constructionist theory, a constructicon is an inventory of constructions making up the full set of linguistic units in a language. In applied practice, it is a set of construction descriptions – a “dictionary of constructions”. The development of constructicons in the latter sense typically means combining principles of both construction grammar...
In constructionist theory, a constructicon is an inventory of constructions making up the full set of linguistic units in a language. In applied practice, it is a set of construction descriptions – a “dictionary of constructions”. The development of constructicons in the latter sense typically means combining principles of both construction grammar...
p class="Default"> Este trabalho apresenta uma proposta para representar computacionalmente construções do Português Brasileiro, no âmbito do Constructicon da FrameNet Brasil. Dessa forma, demonstra de que maneira as teorias irmãs da Semântica de Frames e da Gramática das Construções podem ser implementadas computacionalmente com vias a sustentar a...
This paper presents an application developed for automatically suggesting translation equivalents in a frame-based domain specific trilingual electronic dictionary covering the domains of the World Cup and Tourism. By comparing the syntactic and semantic affordances of a lexical unit in the source language with those shown by all lexical units evok...
FrameNet Project is being developed by ICSI at Berkeley, with the goal of documenting the English language lexicon based on Frame Semantics. For Brazilian Portuguese, the FrameNet-Br Project, hosted at UFJF, follows the same theoretical and methodological perspective. This work presents a service-based infrastructure that combines Semantic Web tech...
This paper reports on the development of a domain-specific multilingual electronic dictionary covering the domains of soccer, tourism and the World Cup: the Copa 2014 FrameNet Brasil project. Specifically, we discuss three points: (i) the definition of the tourism frames and their deployment as interlingual representations; (ii) the implementation...
This paper proposes three policies for the annotation of constructions in FrameNet Brasil, and, potentially, in other FrameNets. Annotation policies are defined so as to both avoid uncontrolled redundancy in the database and respect the theoretical and methodological foundations of Frame Semantics and Construction Grammar. The first policy is conce...
This paper presents the Copa 2014 FrameNet Brasil software (C-14/FN-Br): a frame-based trilingual electronic dictionary covering the domains of Football, Tourism and the World Cup. The dictionary relies on the infrastructure of FrameNet and is meant to be used by tourists, journalists and the staff involved in receiving foreign visitors. Vocabulary...
While in the field of Syntax techniques, algorithms and applications in Natural Language Processing are well known and relatively well established, the same situation does not hold for the field of Semantics. Aiming at contributing to the studies in Computational Semantics, this work implements ideas and insights offered by Cognitive Linguistics, w...
Scientific computing is a multidisciplinary field that goes beyond the use of computer as machine where researchers write simple texts, presentations or store analysis and results of their experiments. Because of the huge hardware/software resources invested in experiments and simulations, this new approach to scientific computing currently adopted...
This paper describes QDA ontology - Quality Driven Approach for e-Science Ontologies, composed of stages, activities, participants, artifacts and quality criteria. The development process is centered in an evolutionary model, therefore, cycles can be repeated at each ontology evolution. The proposal approach is illustrated with artifacts from Cell...
The amount of information generated by biological research has lead to an intensive use of models. Mathematical and computational modeling needs accurate description to share, reuse and simulate models as formulated by original authors. In this paper, we introduce the Cell Component Ontology (CelO), expressed in OWL-DL. This ontology captures both...
The amount of information generated by biological researches has lead to a huge volume of biomedical data Internet-available. However, data are distributed into heterogeneous biological data sources, with little or even none information organization. Therefore, integration and exchange of data and scientific applications within and among organizati...
The amount of information generated by biological research has lead to an intensive use of models. Mathematical and computational
modeling needs accurate description to share, reuse and simulate models as formulated by original authors. In this paper,
we introduce the Cell Component Ontology - CelO, expressed in OWL-DL. This ontology captures both...
A quantidade e variedade de informações geradas pelas ciências biológicas têm levado a um uso intenso de modelos na área. Neste contexto, tanto a modelagem matemática quanto a modelagem computacional necessitam de uma descrição acurada que permita que esses modelos sejam compartilhados, reutilizados e simulados tal como foram formulados por seus cr...
The amount of information generated by research in biological sciences has lead to an intensive use of models. Mathematical and computational modeling need accurate description to share and simulate models as formulated by original authors. CellML is a markup language that uses XML to describe such models in the context of biological process. It al...
Modularity is crucial for the in silico design and testing of biological systems [1]. Recently, we developed an online library of modular mathematical model components for synthetic biology [2] using the modular model exchange format CellML [3]. In addition to synthetic biology, where new biological constructs are being created, this library is now...
Our research focus on the use of ontologies to enrich the construction of new cell models. In this paper we introduce the Cell Component Ontology - CelO, an ontology expressed in OWL-DL. This ontology captures both the structure of a cell model and the properties of functional components. We are using this ontology in a Web project CelOWS where the...
The amount of e-Science software components deserves a sophisticated treatment. This article presents the MathWS, a broker, based on ontologies, to register, discover and execute mathematical web services. The Broker prototype uses innovative technologies such as OWL-S and MathML languages, and inference engines. Services can implement an algorithm...