Slim Mesfar

Slim Mesfar
Institut des Hautes Etudes Commerciales de Carthage · Méthodes quantitatives

Associate professor

About

26
Publications
5,370
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
259
Citations
Additional affiliations
March 2005 - November 2008
University of Franche-Comté
Position
  • Doctorant

Publications

Publications (26)
Chapter
Language resources are a necessary component to language Development in NLP. They are useful for any empirical language study including linguistic analysis, language translation and language disambiguation. The linguistic development environment NooJ (http://www.nooj4nlp.net/) allow formalizing complex linguistic phenomena such as compound words ge...
Chapter
Most question answering systems have been designed to answer short questions (straight answers such as dates, locations), but there are only a few pieces of research about complex questions. In this paper, we present a method for analyzing complex questions at the syntactic and the morphological levels with a pattern-based structure. These linguist...
Chapter
Nowadays, most question-answering systems have been designed to answer factoid or binary questions (looking for short and precise answers such as dates, locations), however little research has been carried out to study complex questions.
Book
This book constitutes the refereed proceedings of the 13th International Conference, NooJ 2019, held in Hammamet, Tunisia, in June 2019. NooJ is a linguistic development environment that allows linguists to formalize several levels of linguistic phenomena. NooJ provides linguists with tools to develop dictionaries, regular grammars, context-free gr...
Article
Full-text available
Since the continuous proliferation of the journalistic content online and the changing political landscape in many Arabic countries, we started our current research in order to implement a media monitoring system about the opinion mining in political field. This system allows political actors, despite of the large volume of online data, to be const...
Conference Paper
NooJ is a linguistic development environment that allows formalizing complex linguistic phenomena such as compound words generation, processing as well as analysis. We will take advantage of NooJ’s linguistic engine strength in order to create a new large coverage terminological compound word’s dictionary for Modern Standard Arabic language. Classi...
Conference Paper
Due to their wide popularity and easy access to the published contents, social media such as Facebook and Twitter have attracted the interest of media to disseminate their information (information, news, events …). Nowadays, we are witnessing a much-accelerated rhythm of events shared on social media. These events are covering several topics like p...
Conference Paper
CompounDic is an Arabic MWEs dictionary that lists many entries, divided into more than 20 domains. It lists only MWEs in their base form. With regard to syntactic and morphological flexibility, the lexicon covers 2 types of MWEs: Fixed MWEs (no variation allowed) and semi-fixed MWEs (variation in their structural pattern). Arabic presents distinct...
Conference Paper
Full-text available
Since the continuous advent of the journalistic content online and the changing political landscape in many Arabic countries, we started our research for the implementation of a media monitoring system for opinion mining in the political field. This system allows political actors, despite of the large volume of online data, to be constantly informe...
Article
Full-text available
Since 2006 we have undertaken to describe the differences between 17th century English and contemporary English thanks to NLP software. Studying a corpus spanning the whole century (tales of English travellers in the Ottoman Empire in the 17th century, Mary Astell's essay A Serious Proposal to the Ladies and other literary texts) has enabled us to...
Conference Paper
Research work on texts often needs to rely upon the latest scientific methods to improve its results. When talking about old manuscripts, which make us go far behind, and often exhibit dark zones, the latest software techniques may help. For the scrutiny of one of our manuscripts, an Arabic translation of a now lost Greek Treatise, why not use the...
Conference Paper
This paper presents a cascade of morpho-syntactic tools to deal with Arabic natural language processing. It begins with the description of a large coverage formalization of the Arabic lexicon. The built electronic dictionary, named "El-DicAr", which stands for “Electronic Dictionary for Arabic”, links inflectional, morphological, and syntactic-sema...
Article
Full-text available
The abundance of Arabic compound nouns in medical corpora requires their listing in electronic dictionaries. However, the generation of all potential inflected forms as well as the recognition of agglutinated forms attached to each entry needs a special tokenization and inflection process due to the linguistic specificities these lexical entries. T...
Conference Paper
In this paper, we propose an Arabic Question-Answering (Q-A) system called QASAL (Question-Answering system for Arabic Language). QASAL accepts as an input a natural language question written in Modern Standard Arabic (MSA) and generates as an output the most efficient and appropriate answer. The proposed system is composed of three modules: A ques...
Conference Paper
In this paper, we describe the use of an incremental construction method of minimal, acyclic, deterministic FST. The approach consists in constructing a transducer in a single step by adding new strings one by one and minimizing the resultant automaton incrementally. Then, we present a new method to encode the morphological information associated w...
Conference Paper
Named entities (NE) occur frequently in Arabic texts, and their recognition is essential. Recognizing and categorizing NE requires both internal (morphological) and external (syntactic) evidences. This paper describes a system that combines a morphological parser and a syntactic parser, that are built with the NooJ linguistic development environmen...
Article
1 INTEX and DELA system dictionaries The INTEX linguistic engine (Silberztein 1993) used two sets of dictionaries that were designed at Prof. Maurice Gross's LADL laboratory: on the one hand, DELA-type dictionaries describe simple and compound words (Courtois, Silberztein 1990) within four types of dictionaries: DELAS : Electronic Dictionary of LAD...
Article
Full-text available
The amount of available information is becoming very huge, especially with the Web proliferation. The problem faced by the user is not the lack of documents or information but is the lack of time to find a short and precise answer among the variety of available documents. Search engines offer a lot of links toward web pages, but are not able to pro...
Article
Full-text available
This article describes the construction of a lexicon and a morphological description for standard Arabic. This system uses finite state technology to parse vowelled texts, as well as partially and not vowelled ones. It is based on large-coverage morphological grammars covering all grammatical rules.

Network

Cited By