Rodrigo Alarcón’s research while affiliated with National Autonomous University of Mexico and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (11)


Figure 1. Describe Overview
Figure 2. Components on the server side
Web Exploitation for Definition Extraction
  • Conference Paper
  • Full-text available

December 2009

·

247 Reads

·

11 Citations

·

Rodrigo Alarcon

·

·

We present a system to extract definitions from a term using the Web. Definitions are organized according to a typology and its context. The structural and functional design is described; emphasizing its relevant components: the extractor of definitional context to candidates using the Yahoo!'s BOSS API; the extractor of definitional contexts and a clustering module based on textual energy as a measure of similarity.

Download

Fig. 1. Ontology building with definition extraction 
Fig. 1. System architecture. 
Fig. 2. Example of the identification of DCs elements. 
Fig. 2. Definition extraction and systematic lexicalization during BADELE upgrade 
Description and evaluation of a definition extraction system for Spanish language

January 2009

·

236 Reads

·

6 Citations

In this paper we present a description and evaluation of a pattern-based approach for definition extraction in Spanish specialised texts. The system is based on the search for definitional verbal patterns related to four different kinds of definitions: analytical, extensional, functional and synonimical. This system could be helpful in the development of ontologies, databases of lexical knowledge, glossaries or specialised dictionaries.


Table 2. DVPs employed for definitional knowledge extraction 
Figure 2. ECODE architecture  
Figure 3. Identification of left position's constitutive elements  
Table 6. Evaluation of DCs elements identification 
Definitional verbal patterns for semantic relation extraction

June 2008

·

3,297 Reads

·

75 Citations

Terminology International Journal of Theoretical and Applied Issues in Specialized Communication

In this paper we present a description of the role of definitional verbal patterns for the extraction of semantic relations. Several studies show that semantic relations can be extracted from analytic definitions contained in machine-readable dictionaries (MRDs). In addition, definitions found in specialised texts are a good starting point to search for different types of definitions where other semantic relations occur. The extraction of definitional knowledge from specialised corpora represents another interesting approach for the extraction of semantic relations. Here, we present a descriptive analysis of definitional verbal patterns in Spanish and the first steps towards the development of a system for the automatic extraction of definitional knowledge. http://repositori.upf.edu/handle/10230/6177


Fig. 1. Example of the identification of DCs elements  
Table 3 . Precision & Recall results
Table 4 . Evaluation of DCs elements identification
ECODE: A definition extraction system

October 2007

·

205 Reads

·

3 Citations

Lecture Notes in Computer Science

Terminological work aims to identify knowledge about terms in specialised texts in order to compile dictionaries, glossaries or ontologies. Searching for definitions about the terms that terminographers intend to define is therefore an essential task. This search can be done in specialised corpus, where they usually appear in definitional contexts, i.e. text fragments where an author explicitly defines a term. We present a research focused on the automatic extraction of those definitional contexts. The methodology includes three different processes: the extraction of definitional patterns, the automatic filtering of non-relevant contexts, and the automatic identification of constitutive elements, i.e., terms and definitions. http://hdl.handle.net/10230/24631


Developing a Definitional Knowledge Extraction System

January 2007

·

243 Reads

·

6 Citations

One of the main goals of terminological work is the identification of knowledge about terms in specialised texts. In order to compile dictionaries, glossaries or ontologies, terminographers use to search definitions about the terms they intend to define. The search of definitions can be done in specialised corpus, where they usually appear in definitional contexts, i.e. text fragments where an author explicitly defines a term. In this paper we present a research focused on the automatic extraction of those definitional contexts. Our methodology includes three different processes: the extraction of definitional patterns, the automatic filtering of non definitional contexts, and the automatic identification of constitutive elements, i.e., terms and definitions.


Figure 1: Typology of definitions 
Table 2 : Tags for identification of DC units
Figure 3: Three descriptions of XML tags in the Body 
Towards the building of a corpus of definitional contexts

January 2006

·

78 Reads

·

6 Citations

A repository of Definitional Contexts, even more than to be conceived of as traditional corpora, is very valuable tool for lexicography. This paper describes a corpus of Definitional Contexts, from the definition of the term to the description of the applications of the corpus in lexicography. The intended applications for the work, and how these affected the corpus design, are described. In addition, the methodology for corpus building and the corpus structure are outlined, and the preliminary results of the work in progress are presented.



Figure 1. Analytic definition 
Towards the extraction of conceptual information from corpora

January 2003

·

50 Reads

·

8 Citations

From a computational linguistics point of view, specifically related to information extraction, terminology uses statistical and rule-based methods (Cabré et. al. 2001) to extract terms from specialised texts. Furthermore, terminology needs to identify the corresponding definitions of a specific term. Often, when an author introduces a new term, which is not well known to the readers, he/she provides the definition emphasising the new concept with a set of syntactic and typographical features. We thus call definitional context the structure consisting of the term, the definition and the emphatic features in a specialised text. In order to develop a tool capable of extracting definitional contexts from annotated corpora, an inventory of recurrent patterns used by authors to introduce concepts is necessary, as well as a computational linguistic technique capable of identifying concepts from specialised texts. One of the objectives of the Language Engineering Group, from the Instituto de Ingeniería, UNAM (National Autonomous University of Mexico) is to extract conceptual information from corpora. For this reason, the group is working on the study of recurrent patterns in definitional contexts, the elaboration of an annotated corpus on engineering, and the development of the tools required for conceptual information extraction. In this paper we present the process of developing a definitional context extraction tool. First, we present the minimal elements of a definitional context. Second, we define a typology of all the recurrent patterns found in definitional contexts of Spanish specialised texts. Third, we present the kinds of tagging necessary. And fourth, we briefly mention the characteristics that a search engine must have. 2 Definitional contexts Our investigation is based on previous efforts such as the systematic identification of definitions based on lexical and metalinguistic patterns (Pearson, 1998), the analysis of Explicit Metalinguistic Operations (Rodríguez, 1999) and the analysis of Knowledge-Rich Contexts (Meyer, 2001).



El rol de las predicaciones verbales en la extracción automática de conceptos

219 Reads

·

19 Citations

In specialised texts, there are some recurrent typographical and syntactic patterns that authors use to define a term. Some of these syntactic patterns are verbal predications that function as connectors between terms and definitions. In this paper we analyse the role of the verbal predications involved in definitional contexts of specialised texts in Spanish, in order to develop a system capable of extracting definitional contexts.


Citations (11)


... Many researchers have used DE for developing question answering systems [13], ontology generation [21], supporting terminology applications [10,14,15] and for developing e-learning applications [1,18]. ...

Reference:

An Explainable Machine Learning Approach for Definition Extraction
Towards the building of a corpus of definitional contexts

... Como se mencionaba anteriormente, las pvd son patrones verbales recurrentes que introducen definiciones (Alarcón et al., 2007). Los verbos que se suelen emplear a la hora de proponer la definición de un término suelen ser être, s'entendre, désigner, dénommer, etc. Estos verbos pueden aparecer en forma de participio pasado o en una construcción pasiva en francés (est entendu, dénommé, etc.), en contraposición al español, que suele redactarse en forma reflexiva con la partícula se (se define, se denomina, se entiende) (Alarcón y Sierra, 2017). Teniendo esto en cuenta, a continuación se presenta una ecuación de búsqueda basada en cql. ...

El rol de las predicaciones verbales en la extracción automática de conceptos

... non definitional contexts), by using a set of linguistic and contextual rules to determine those cases where no definitional contexts are found. -The automatic identification of their constituent elements (i.e. the term and its definition), by making use of a decision tree [2]. -Finally, an automatic ranking of the results, by using a set of heuristic rules to identify those candidates that follow a prototypical structure of terms and definitions. ...

Description and evaluation of a definition extraction system for Spanish language

... In general words, those studies employ definitional patterns as a common start point for the extraction of knowledge about terms. Our own related previous work on the linguistic description and analysis of DC contexts as well as their place in the task of elaboration a definitional knowledge extraction tool, can be found in Alarcón & Sierra (2006). ...

Reglas léxico-metalingüísticas para la extracción automática de contextos definitorios
  • Citing Article
  • January 2006

... Algunas investigaciones han mostrado, con base en análisis de extracción de información y corpus lingüísticos, que efectivamente se puede extraer información de tipo conceptual vía contextos definitorios(Sierra, Medina, Alarcón & Aguilar, 2003) y que las definiciones de una palabra pueden ser divididas de acuerdo con su naturaleza en: sinonímicas -como las que ...

Towards the extraction of conceptual information from corpora

... Sea T un texto con etiquetas POS (part of speech) que contiene un marcador de Capaldi, M. Considérese el contexto izquierdo C_i y derecho C_d de M en T; es decir, la cadena C_iMC_d ocurre en T. Si para otro término M' se cumple que C_iM'C_d ocurre en T, diremos que M' satisface el contexto de M y, por lo tanto, puede ser un marcador equivalente a M. Este método tomó como base las ideas presentadas por Marti Hearst en la identificación de hipónimos [9]. Además puede aplicarse este enfoque de inducción en la identificación de definiciones [10]. ...

Developing a Definitional Knowledge Extraction System

... Finalmente, cabe mencionar los pp, los cuales sirven para contextualizar diferentes aspectos de los términos y las definiciones, con alusiones a autoría, temporalidad, contextos de uso, frecuencia de uso, etc. (Alarcón et al., 2007;Sierra et al., 2008), lo que ayudaría a extraer información analítica sobre el término. ...

CORPUS DE CONTEXTOS DEFINITORIOS: UNA HERRAMIENTA PARA LA LEXICOGRAFÍA Y LA TERMINOLOGÍA

·

Rodrigo Alarcón

·

·

[...]

·

Itzia Baca

... Describe is an important work that has been developed in Mexico [16] and is capable of extracting DCs from web searches. The approach used in Describe separates the task in two modules: the first deals with online searches using 15 patterns surrounding a term in an exact text search, and the second module filters out results that do not contain definitions by using decision trees and syntactic tags. ...

Web Exploitation for Definition Extraction

... Par exemple, dans le fragment : En general, el miedo a la aguja es el más frecuente, (En générale, la peur de l'aiguille est la plus fréquente) on observe que le terme aguja est présent ainsi que le patron ser+determinante, mais évidemment il ne s'agit pas d'une définition. Un fragment de texte est un candidat à contexte définitoire (CCD) s'il contient un terme et un PD mais il n'est pas une définition (Sierra et Alarcón 2002). La très grande quantité d'informations récupérées (environ 3,000 résultats par terme), s'opposait aux critères d'évaluation de l'étude (lecture et interprétation directe). ...

Identification of Recurrent Patterns to Extract Definitory Contexts

Lecture Notes in Computer Science