Bernard Jacquemin

Bernard Jacquemin
  • PhD - Natural Language Processing
  • Professor (Associate) at University of Lille

About

48
Publications
5,348
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
236
Citations
Introduction
Skills and Expertise
Current institution
University of Lille
Current position
  • Professor (Associate)
Additional affiliations
September 2012 - present
Université de Lille, France
Position
  • Professor (Associate)
September 2007 - July 2008
September 2006 - August 2007
University of Upper Alsace
Position
  • ATER Information and communication science
Education
February 2000 - December 2003
Sorbonne Nouvelle University
Field of study
  • Sciences du langage -- Linguistique et informatique
September 1998 - August 1999
Gustave Eiffel University
Field of study
  • Information scientifique et technique -- Ingénierie linguistique
September 1997 - August 1998
Catholic University of Louvain
Field of study
  • Ingénierie linguistique

Publications

Publications (48)
Article
Full-text available
This article presents the results of a qualitative analysis of the use of the HAL platform by research laboratories. The analysis is based on semi-directive interviews with representatives from 50 laboratories affiliated to the ten Udice research universities. It focuses on the function that HAL fulfils for the laboratories, on its added value for...
Chapter
Dans le cadre du programme de recherche e.thesaurus, l’orfèvrerie à l’épreuve de la modélisation 3D (Programme CPER MAuVE Médiations visuelles, culture numérique et création), le colloque des 11 et 12 mai 2023, organisé en partenariat avec le Groupe d’Étude de Recherche interdisciplinaire en information et COmmunication (GERiiCO-ULille–ULR 4073), e...
Article
Full-text available
L’article présente les résultats d’une étude menée dans le cadre du projet HAL/LO, sur un échantillon de 1 246 laboratoires (=1 035 612 dépôts) rattachés aux dix grandes universités de recherche et membres de l’association Udice. L’objectif est une description plus détaillée des pratiques sur HAL. 99 % des laboratoires sont présents sur HAL, avec u...
Article
Full-text available
(1) Background: The 2002 Budapest Open Access Initiative recommended the self-archiving of scientific articles in open repositories, which has been described as the “green road” to open access. Twenty years later, only one part of the researchers deposits their publications in open repositories; moreover, one part of the repositories’ content is no...
Preprint
Full-text available
(1) Background: The 2002 Budapest Open Access Initiative recommended on self-archiving of scientific articles in open repositories as the “green road” to open access. Twenty years later, only one part of the researchers deposits their publications in open repositories; moreover, one part of the repositories’ content is not based on self-archived de...
Chapter
Cet ouvrage réunit dix contributions au colloque Le corpus audiovisuel – Quelles approches ? Quels usages. Ce colloque s’est tenu en juin 2019 à l’Inalco (Institut National des Langues et Civilisations Orientales) à Paris et a été organisé par Odile Farge, Louise Ouvrard et Peter Stockinger de l’équipe Plidam (EA 4514 - Pluralité des langues et des...
Book
Full-text available
Content fully available from the Conference program: https://mussi2018.sciencesconf.org/resource/page/id/10
Chapter
The organisation of free access to scientific data is one of France’s public research objectives. The commitment to open research data has been confirmed by the National Action Plan –, whose commitment is to build an open science ecosystem. On the ground, the policy of openness is accompanied by a strong incentive to implement good scientific pract...
Article
Full-text available
Purpose This paper aims to show how Master’s theses can contribute to open scholarship and give reasons why this should be done. Design/methodology/approach The paper provides an overview of published studies and, based on the experience at the University of Lille (France), describes some essential aspects for the processing and valorization of th...
Preprint
The TERRE-ISTEX project aims to identify scientific research dealing with specific geographical territories areas based on heterogeneous digital content available in scientific papers. The project is divided into three main work packages: (1) identification of the periods and places of empirical studies, and which reflect the publications resulting...
Article
Full-text available
Das DOREMUS Projekt strebt eine bessere Beschreibung von Musik an, indem es Daten dreier französicher Institutionen untersucht und zusammenführt. Der vorliegende Artikel gibt einen Überblick über das auf FRBRoo basierende Datenmodell, das die automatische Umwandlung und Verlinkung von Daten ermöglicht. Er stellt Prototypen vor, wie die Daten nach d...
Conference Paper
Full-text available
Le projet ANR Dorémus vise à proposer un modèle générique de description de l'information musicale permettant de rendre disponible et accessible ce type d'information riche mais peu normée et donc généralement hétérogène entre les différentes descriptions existantes. Cet objectif doit faire face à trois contraintes divergentes : besoin d'un modèle...
Conference Paper
Full-text available
Cette communication cherche à identifier et expliquer la manière dont les biblio-thèques municipales du territoire de la Métropole Européenne Lilloise (MEL), en tant qu'espaces physiques et symboliques au coeur de l'organisation des connais-sances et de la construction des savoirs, intègrent l'« innovation » dans leurs pra-tiques. À travers les rép...
Conference Paper
Full-text available
Le projet pluridisciplinaire TALIE est dédié, dans sa partie « Textes », à la valori-sation de la tradition des oeuvres de l'Antiquité gréco-romaine telle qu'elle est re-présentée dans les fonds anciens ou patrimoniaux des bibliothèques de la région Nord-Pas de Calais-Picardie, sous la forme des manuscrits portant ces oeuvres, mais aussi des éditio...
Article
Full-text available
Within the Emergency Medical Service (EMS), Medical Regulation Assistant (MRA) are in the front line when it comes to managing emergencies. MRA is the first point of contact to take charge of callers. This feature, in the heart of the functioning and prerogatives of EMS is at the crossroads of medical emergency services in the covered area. Even if...
Conference Paper
Full-text available
Problem/goal The paper provides an overview and empirical evidence on the usability of electronic theses and dissertations (ETDs) and related research data for text and data mining (TDM) techniques. Research method/procedure The first part of the paper is a review of recent publications and projects on the potential and usefulness of ETDs for TDM,...
Article
See https://hal.univ-lille3.fr/hal-01911939v2 Purpose – Print theses and dissertations have regularly been submitted together with complementary material, such as maps, tables, speech samples, photos or videos, in various formats and on different supports. In the digital environment of open repositories and open data, these research results could b...
Article
Full-text available
As a collaborative work, the online encyclopaedia Wikipedia leads naturally the contributors to work with each other and to face their opinions. But no frame is provided to control the collaboration, neither in the five fundamental principles, nor from the wiki software. This article studies how the contributing community thinks up original ways to...
Conference Paper
Full-text available
La rapide expansion de ce que recouvre la notion de web 2.0 propage sur Internet de nouveaux usages. Les outils de recherche d'information sur le web doivent maintenant te-nir compte des phénomènes nouveaux liés à l'apparition des blogs, wikis, et autres sites collaboratifs de publication. D'une part, les internautes du web 2.0 créent, en même temp...
Conference Paper
Full-text available
In Knowledge Management, variations in information expressions have proven a real challenge. In particular, classical semantic relations (e.g. synonymy) do not connect words with different parts-of-speech. The method proposed tries to address this issue. It consists in building a derivational resource from a morphological derivation tool together w...
Conference Paper
Full-text available
While social network analysis often focuses on graph structure of social actors, an increasing number of communication networks now provide textual content within social activity (email, instant messaging, blogging, collaboration networks). We present an open source visualization software, GraphDuplex, which brings together social structure and tex...
Article
Full-text available
The on-line encyclopaedia Wikipedia is a pragmatic patchwork, an assemblage of many singular points of view on a given subject, based on rules of clarity and communicability of public statements. Its success reflects a transformation of our relationship with knowledge. In this article the authors use a social map of the conflicts in the on-line enc...
Conference Paper
Full-text available
RÉSUMÉ. Les outils de recherche d'information sur le web doivent tenir compte des phénomènes nouveaux liés à l'apparition des blogs, wikis, et autres publications collaboratives. Parmi ces sites, l'encyclopédie Wikipédia constitue une source importante d'information. La qualité de ses informations a pourtant été récemment mise en cause. Mieux conna...
Article
"Semantic Atlas" is a mathematic and statistic model to visualise word senses according to relations between words. The model, that has been applied to proximity relations from a corpus, has shown its ability to distinguish word senses as the corpus' contributors comprehend them. We propose to use the model and a specialised corpus in order to crea...
Conference Paper
Full-text available
Wikipedia is nowadays a widely used encyclopedia, and one of the most visible sites on the Internet. Its strong principle of collaborative work and free editing sometimes generates disputes due to disagreements between users. In this article we study how the wikipedian community resolves the conflicts and which roles do wikipedian choose in this pr...
Chapter
Full-text available
Les Atlas sémantiques sont un modèle mathématique et statistique de représentation vi-suelle de la sémantique lexicale basé sur l'examen des relations entre les mots. Une application de ce modèlè a des relations de proximité contextuelle dans un corpus a permis de montrer que le modèlé etait capable de dénoter le sens des unités lexicales tel qu'il...
Article
Full-text available
In this article, we propose an automatic process to build multi-lingual lexico-semantic resources. The goal of these resources is to browse seman- tically textual information contained in texts of different languages. This method uses a mathematical model called Atlas semantiques in order to represent the different senses of each word. It uses the...
Raw Data
Full-text available
Statement of the English Wikipedia dispute resolution process for comparison to the French corresonding process.
Patent
Full-text available
Systems and methods for indexing and searching the inner structure of a string over a language having a vocabulary and a grammar using bit vectors. The index preserves the inner gramatical structure of the string while allowing for a fast search. A single search provides immediate access to every level of a document, without having to re-search a s...
Patent
Full-text available
Systems and methods for indexing and searching the inner structure of a string over a language having a vocabulary and a grammar using bit vectors. The index preserves the inner grammatical structure of the string while allowing for a fast search. A single search provides immediate access to every level of a document, without having to re-search a...
Conference Paper
Full-text available
In textual knowledge management, statistical methods prevail. Nonetheless, some difficulties cannot be overcome by these methodologies. I propose a symbolic approach using a complete textual analysis to identify which analysis level can improve the the answers provided by a system. The approach identifies word senses and relation between words and...
Article
Full-text available
This paper presents a lexical disambiguation system, initi ally developed for English and now adapted to French. This system associates a word with its meaning in a given context using electronic dictionaries as semantically annotated c orpora in order to extract semantic disambiguation rules. We describe the rule extraction and a pplication proces...
Article
In textual knowledge management, statistical methods prevail. Nonetheless, some dif- ficulties cannot be overcome by these methodologies. I propose a symbolic approach using a complete textual analysis to identify which analysis level can improve the the answers provided by a system. The approach identifies word senses and relation between words an...
Conference Paper
Full-text available
This paper presents an original way to add new data in a reference dictionary from several other lexical resources, without loosing any consistence. This operation is carried in order to get lexical information classified by the sense of the entry. This classification makes it possible to enrich utterances (in QA: the queries) following the meaning...
Conference Paper
Full-text available
This paper presents an original methodology to consider question answering. We noticed that query expansion is often incorrect because of a bad understanding of the question. But the automatic good understanding of an utterance is linked to the context length, and the question are often short. This methodology proposes to analyse the documents and...
Thesis
Full-text available
Cette thèse présente une méthode originale pour identifier et structurer l'information de documents et pour l'interroger. Comme les méthodes linguistiques améliorent les résultats des systèmes actuels, cette approche se base sur des analyses linguistiques et des ressources lexicales. Une analyse grammaticale de haut niveau (morphologique, syntaxiqu...
Conference Paper
Full-text available
External linguistic resources have been used for a very long time in information extraction. These methods enrich a document with data that are semantically equivalent, in order to improve recall. For instance, some of these methods use synonym dictionaries. These dictionaries enrich a sentence with words that have a similar meaning. However, these...
Article
Full-text available
This paper presents a lexical disambiguation system, initially developed for English and now adapted to French. This system associates a word with its meaning in a given context using electronic dictionaries as semantically annotated corpora in order to extract semantic disambiguation rules. We describe the rule extraction and application process a...

Network

Cited By