About
48
Publications
5,348
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
236
Citations
Introduction
Skills and Expertise
Current institution
Additional affiliations
September 2012 - present
Université de Lille, France
Position
- Professor (Associate)
September 2007 - July 2008
September 2006 - August 2007
Publications
Publications (48)
This article presents the results of a qualitative analysis of the use of the HAL platform by research laboratories. The analysis is based on semi-directive interviews with representatives from 50 laboratories affiliated to the ten Udice research universities. It focuses on the function that HAL fulfils for the laboratories, on its added value for...
Dans le cadre du programme de recherche e.thesaurus, l’orfèvrerie à l’épreuve de la modélisation 3D (Programme CPER MAuVE Médiations visuelles, culture numérique et création), le colloque des 11 et 12 mai 2023, organisé en partenariat avec le Groupe d’Étude de Recherche interdisciplinaire en information et COmmunication (GERiiCO-ULille–ULR 4073), e...
L’article présente les résultats d’une étude menée dans le cadre du projet HAL/LO, sur un échantillon de 1 246 laboratoires (=1 035 612 dépôts) rattachés aux dix grandes universités de recherche et membres de l’association Udice. L’objectif est une description plus détaillée des pratiques sur HAL. 99 % des laboratoires sont présents sur HAL, avec u...
(1) Background: The 2002 Budapest Open Access Initiative recommended the self-archiving of scientific articles in open repositories, which has been described as the “green road” to open access. Twenty years later, only one part of the researchers deposits their publications in open repositories; moreover, one part of the repositories’ content is no...
(1) Background: The 2002 Budapest Open Access Initiative recommended on self-archiving of scientific articles in open repositories as the “green road” to open access. Twenty years later, only one part of the researchers deposits their publications in open repositories; moreover, one part of the repositories’ content is not based on self-archived de...
Cet ouvrage réunit dix contributions au colloque Le corpus audiovisuel – Quelles approches ? Quels usages. Ce colloque s’est tenu en juin 2019 à l’Inalco (Institut National des Langues et Civilisations Orientales) à Paris et a été organisé par Odile Farge, Louise Ouvrard et Peter Stockinger de l’équipe Plidam (EA 4514 - Pluralité des langues et des...
Content fully available from the Conference program: https://mussi2018.sciencesconf.org/resource/page/id/10
The organisation of free access to scientific data is one of France’s public research objectives. The commitment to open research data has been confirmed by the National Action Plan –, whose commitment is to build an open science ecosystem. On the ground, the policy of openness is accompanied by a strong incentive to implement good scientific pract...
Purpose
This paper aims to show how Master’s theses can contribute to open scholarship and give reasons why this should be done.
Design/methodology/approach
The paper provides an overview of published studies and, based on the experience at the University of Lille (France), describes some essential aspects for the processing and valorization of th...
The TERRE-ISTEX project aims to identify scientific research dealing with specific geographical territories areas based on heterogeneous digital content available in scientific papers. The project is divided into three main work packages: (1) identification of the periods and places of empirical studies, and which reflect the publications resulting...
Das DOREMUS Projekt strebt eine bessere Beschreibung von Musik an, indem es Daten dreier französicher Institutionen untersucht und zusammenführt. Der vorliegende Artikel gibt einen Überblick über das auf FRBRoo basierende Datenmodell, das die automatische Umwandlung und Verlinkung von Daten ermöglicht. Er stellt Prototypen vor, wie die Daten nach d...
Le projet ANR Dorémus vise à proposer un modèle générique de description de l'information musicale permettant de rendre disponible et accessible ce type d'information riche mais peu normée et donc généralement hétérogène entre les différentes descriptions existantes. Cet objectif doit faire face à trois contraintes divergentes : besoin d'un modèle...
Cette communication cherche à identifier et expliquer la manière dont les biblio-thèques municipales du territoire de la Métropole Européenne Lilloise (MEL), en tant qu'espaces physiques et symboliques au coeur de l'organisation des connais-sances et de la construction des savoirs, intègrent l'« innovation » dans leurs pra-tiques. À travers les rép...
Le projet pluridisciplinaire TALIE est dédié, dans sa partie « Textes », à la valori-sation de la tradition des oeuvres de l'Antiquité gréco-romaine telle qu'elle est re-présentée dans les fonds anciens ou patrimoniaux des bibliothèques de la région Nord-Pas de Calais-Picardie, sous la forme des manuscrits portant ces oeuvres, mais aussi des éditio...
Within the Emergency Medical Service (EMS), Medical Regulation Assistant (MRA) are in the front line when it comes to managing emergencies. MRA is the first point of contact to take charge of callers. This feature, in the heart of the functioning and prerogatives of EMS is at the crossroads of medical emergency services in the covered area. Even if...
Problem/goal The paper provides an overview and empirical evidence on the usability of electronic theses and dissertations (ETDs) and related research data for text and data mining (TDM) techniques. Research method/procedure The first part of the paper is a review of recent publications and projects on the potential and usefulness of ETDs for TDM,...
See https://hal.univ-lille3.fr/hal-01911939v2
Purpose – Print theses and dissertations have regularly been submitted together with complementary material, such as maps, tables, speech samples, photos or videos, in various formats and on different supports. In the digital environment of open repositories and open data, these research results could b...
As a collaborative work, the online encyclopaedia Wikipedia leads naturally the contributors to work with each other and to face their opinions. But no frame is provided to control the collaboration, neither in the five fundamental principles, nor from the wiki software. This article studies how the contributing community thinks up original ways to...
La rapide expansion de ce que recouvre la notion de web 2.0 propage sur Internet de nouveaux usages. Les outils de recherche d'information sur le web doivent maintenant te-nir compte des phénomènes nouveaux liés à l'apparition des blogs, wikis, et autres sites collaboratifs de publication. D'une part, les internautes du web 2.0 créent, en même temp...
In Knowledge Management, variations in information expressions have proven a real challenge. In particular, classical semantic relations (e.g. synonymy) do not connect words with different parts-of-speech. The method proposed tries to address this issue. It consists in building a derivational resource from a morphological derivation tool together w...
While social network analysis often focuses on graph structure of social actors, an increasing number of communication networks now provide textual content within social activity (email, instant messaging, blogging, collaboration networks). We present an open source visualization software, GraphDuplex, which brings together social structure and tex...
The on-line encyclopaedia Wikipedia is a pragmatic patchwork, an assemblage of many singular points of view on a given subject, based on rules of clarity and communicability of public statements. Its success reflects a transformation of our relationship with knowledge. In this article the authors use a social map of the conflicts in the on-line enc...
RÉSUMÉ. Les outils de recherche d'information sur le web doivent tenir compte des phénomènes nouveaux liés à l'apparition des blogs, wikis, et autres publications collaboratives. Parmi ces sites, l'encyclopédie Wikipédia constitue une source importante d'information. La qualité de ses informations a pourtant été récemment mise en cause. Mieux conna...
"Semantic Atlas" is a mathematic and statistic model to visualise word senses according to relations between words. The model, that has been applied to proximity relations from a corpus, has shown its ability to distinguish word senses as the corpus' contributors comprehend them. We propose to use the model and a specialised corpus in order to crea...
Wikipedia is nowadays a widely used encyclopedia, and one of the most visible sites on the Internet. Its strong principle of collaborative work and free editing sometimes generates disputes due to disagreements between users. In this article we study how the wikipedian community resolves the conflicts and which roles do wikipedian choose in this pr...
Les Atlas sémantiques sont un modèle mathématique et statistique de représentation vi-suelle de la sémantique lexicale basé sur l'examen des relations entre les mots. Une application de ce modèlè a des relations de proximité contextuelle dans un corpus a permis de montrer que le modèlé etait capable de dénoter le sens des unités lexicales tel qu'il...
In this article, we propose an automatic process to build multi-lingual lexico-semantic resources. The goal of these resources is to browse seman- tically textual information contained in texts of different languages. This method uses a mathematical model called Atlas semantiques in order to represent the different senses of each word. It uses the...
Statement of the English Wikipedia dispute resolution process for comparison to the French corresonding process.
Systems and methods for indexing and searching the inner structure of a string over a language having a vocabulary and a grammar using bit vectors. The index preserves the inner gramatical structure of the string while allowing for a fast search. A single search provides immediate access to every level of a document, without having to re-search a s...
Systems and methods for indexing and searching the inner structure of a string over a language having a vocabulary and a grammar using bit vectors. The index preserves the inner grammatical structure of the string while allowing for a fast search. A single search provides immediate access to every level of a document, without having to re-search a...
In textual knowledge management, statistical methods prevail. Nonetheless, some difficulties cannot be overcome by these methodologies. I propose a symbolic approach using a complete textual analysis to identify which analysis level can improve the the answers provided by a system. The approach identifies word senses and relation between words and...
This paper presents a lexical disambiguation system, initi ally developed for English and now adapted to French. This system associates a word with its meaning in a given context using electronic dictionaries as semantically annotated c orpora in order to extract semantic disambiguation rules. We describe the rule extraction and a pplication proces...
In textual knowledge management, statistical methods prevail. Nonetheless, some dif- ficulties cannot be overcome by these methodologies. I propose a symbolic approach using a complete textual analysis to identify which analysis level can improve the the answers provided by a system. The approach identifies word senses and relation between words an...
This paper presents an original way to add new data in a reference dictionary from several other lexical resources, without loosing any consistence. This operation is carried in order to get lexical information classified by the sense of the entry. This classification makes it possible to enrich utterances (in QA: the queries) following the meaning...
This paper presents an original methodology to consider question answering. We noticed that query expansion is often incorrect because of a bad understanding of the question. But the automatic good understanding of an utterance is linked to the context length, and the question are often short. This methodology proposes to analyse the documents and...
Cette thèse présente une méthode originale pour identifier et structurer l'information de documents et pour l'interroger. Comme les méthodes linguistiques améliorent les résultats des systèmes actuels, cette approche se base sur des analyses linguistiques et des ressources lexicales. Une analyse grammaticale de haut niveau (morphologique, syntaxiqu...
External linguistic resources have been used for a very long time in information extraction. These methods enrich a document with data that are semantically equivalent, in order to improve recall. For instance, some of these methods use synonym dictionaries. These dictionaries enrich a sentence with words that have a similar meaning. However, these...
This paper presents a lexical disambiguation system, initially developed for English and now adapted to French. This system associates a word with its meaning in a given context using electronic dictionaries as semantically annotated corpora in order to extract semantic disambiguation rules. We describe the rule extraction and application process a...