Jean-Luc MinelUniversity Paris Nanterre · Linguistics
Jean-Luc Minel
HDR (Habiltitation to supervise PhD )
About
160
Publications
15,515
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
619
Citations
Introduction
Jean-Luc Minel is professor emeritus in Linguistics studies at University of Paris Nanterre (France). He is a researcher at the MoDyCo laboratory accredited by the CNRS in which he currently develops models and tools to process texts relying on Deep Learning approaches. He was head of MoDyCo laboratory from 2008 and 2017. He is presently the chairman of scientific committee of the Very Large Digital Infrastructure Huma-Num and scientific advisor at Haut Conseil de l’évaluation de la recherche
Skills and Expertise
Publications
Publications (160)
This article delves into the debates, both in print and digital media, surrounding copper mining and its associated concerns, particularly those related to hardrock mining. Guided by agenda-building, pragma-dialectics, and stakeholder theories, this research employs topic modeling to scrutinize the media strategies and arguments employed by key sta...
Cet article vise à analyser la publicisation des débats sur les orientations de politiques publiques, les choix sociotechniques et les solutions alternatives avancés par des acteurs et actrices, pour faire face aux situations de sécheresse et de pénurie en eau. Il prend pour cas d’étude l’État d’Arizona (États-Unis), dans lequel ces situations se m...
De 2019 à 2023, des représentants des Archives nationales, du laboratoire Dicen-IDF du Conservatoire national des arts et métiers et du Centre Jean-Mabillon de l’École nationale des chartes se sont réunis au sein du séminaire « Les nouveaux paradigmes de l’archive », mené avec le soutien du Laboratoire d’excellence HASTEC. Ils y ont accueilli plus...
This research focuses, in the media public sphere, on the modalities of construction and progressive visibility of persistent symptoms of certain forms of Covid-19, designated by the category “Long Covid”. The aim is to analyze the process according to which this particular form of the disease appeared in the media in June 2020, was the subject of...
This study examined research centre Biosphere 2 (B2) coverage by US newspapers between 1984 (as stories of conception before construction emerged) and 2019 (at the time this research was conducted) in order to uncover news diffusion relative to B2 in public media across historic eras and amid shifts in stakeholders over time. The analysis focussed...
How can researchers identify suitable research data repositories for the deposit of their research data? Which repository matches best the technical and legal requirements of a specific research project? For this end and with a humanities perspective the Data Deposit Recommendation Service (DDRS) has been developed as a prototype. It not only serve...
Les politiques d’accès ouvert aux données culturelles des musées concernent désormais la mission de diffusion et de partage des collections et des connaissances. Cet article étudie les régimes d’accès et de circulation médiatique du patrimoine numérisé mis à disposition sous la forme de données culturelles en privilégiant une entrée par la médiatio...
This article focuses on the development of an instrumented methodology for modeling and analyzing the circulation message flows concerning air quality on the social network Twitter. This methodology aims at describing and representing, on the one hand, the modes of circulation and distribution of message flows on this social media and, on the other...
Environmental health is an emerging and hotly debated topic that covers several fields of study such as pollution in urban or rural environments and the consequences of these changes on health populations. In this field of intersectorial forces, the complexity of stakeholders’ logics is realized in the production, use and communication of data and...
In this paper, we first present a model to represent message flows and their contents on Twitter, then a model and an instrumented methodology to describe and analyze these flows and their distribution among the various stakeholders. The aim is to explore the engagement and interactions between different types of stakeholders. We apply our methodol...
In this paper, we first present a model to represent message flows and their contents on Twitter, then a model and an instrumented methodology to describe and analyze these flows and their distribution among the various stakeholders. The aim is to explore the engagement and interactions between different types of stakeholders. We apply our methodol...
In this paper, we first present a representation of message flows and their contents on Twitter, then an instrumented methodology to describe and analyze these flows and their distribution among the various stakeholders. The aim is to explore the engagement and interactions between different types of stakeholders. We apply our methodology and tools...
Interoperability is analyzed here from a conceptual, technical, and cultural standpoint. The authors examine institutional policies and practices in the heritage field with respect to digitization and the provision of cultural resources in digital format by probing possible forms of interoperability. They then showcase these forms and the visibilit...
Cet article est centre sur les strategies institutionnelles et les pratiques de mediation patrimoniale des musees adossees aux agencements socio-techniques du LOD. Dans le secteur culturel, les enjeux d’image, de visibilite et de diffusion des objets patrimoniaux poses par la mise en œuvre du web de donnees ouvert sont mis en evidence. Les modes d’...
Au cours des vingt dernières années, les musées ont largement développé des politiques d’accès aux ressources culturelles en s’appuyant sur des supports et des technologies numériques, qu’il s’agisse d’écrans installés dans les expositions, de sites web dédiés, d’applications développées sur des smartphones, de l’utilisation des réseaux sociaux num...
Comment le monde des musées va-t-il évoluer dans les prochaines années ? Quelle pourrait être sa place sur Internet ? Existe-t-il des différences marquantes entre les musées européens, américains, asiatiques et africains ? La muséologie, entendue ici comme l'ensemble des théories et réflexions critiques liées au champ muséal, a connu des développem...
The goal of this paper is to analyze messages sent on the Twitter socialnetwork during the MuseumWeek event. This analysis relies on quantitative and qualitative studies, which were benchmarked with the MuseumWeek event.
The goal of this paper is to analyze messages sent on the Twitter social network during the Museum Week event. This analysis relies on quantitative and qualitative studies, which were benchmarked with the Museum Week event.
The goal of this paper is to present a tool-based methodology which has been developed to analyze messages sent on the Twitter social network. This methodology implements quantitative and qualitative analyses, which were benchmarked with the “MuseumWeek” event.
In this paper we elaborate over the use of sequential supervised learning methods on the task of hedge cue scope detection. We address the task using a learning methodology that proposes the use of an iterative, error-based approach to improve classification performance. We analyze how the incorporation of syntactic constituent information to the l...
This paper studies the dynamics of linkage between cultural communication and innovative digital technologies. We hypothesize that the study of the communication policies, publishing practices and digital publication of museums can provide a way to analyze recent developments in forms of mediation of works and their related contents. A quantitative...
La protection des données personnelles est un sujet complexe porteur d'enjeux majeurs dans la société. Inscrit dans l'information d'actualité, ce sujet sociétal est largement commenté et discuté dans les médias. Cette communication aborde le traitement médiatique dont il fait l'objet dans l'espace public médiatisé contemporain sur Internet. Nous av...
Cette communication présente une approche instrumentée du traitement médiatique de l'information dans un contexte monolingue. L'objectif est une analyse contrastive de 3 types de médias en France, la presse généraliste, la presse professionnelle et économique, et les blogs de journalistes, sur la problématique des données personnelles et des techno...
We present a linguistic approach for the automatic processing of "media events" . In this paper, we focus on the predicative nouns of events. We present our approach for a structured knowledge base that we developed from the linguistic framework of S-H.Lee and G.Gross.
La présentation aborde les point suivants : La protection des données personnelles à l'ère des technologies nomades : un construit social et politique complexe, Les transformations des formes de la communication médiatique dans l'espace public contemporain, La publicisation de la question de la protection des données personnelles dans les médias et...
Notre proposition s'inscrit dans le cadre de recherches menées au carrefour des Sciences de l'Information et de la Communication, des Sciences du Langage et plus spécifiquement du Traitement Automatique des Langues (Abney 2011) et de la linguistique textuelle (Adam 2011), et de l'analyse textuelle en sociologie (Demazière et al. 2006). Ce travail s...
In this paper, we present a framework and a system that extracts "salient" events relevant to a query from a large collection of documents, and which also enables events to be placed along a timeline. Each event is represented by a sentence extracted from the collection. We have conducted some experiments showing the interest of the method for this...
L'enjeu est de discuter des formes renouvelées de médiation culturelle dans les pratiques éditoriales des musées. Nous considérons que ces mutations sont de nature hybride, à la fois socioculturelle, socio-technique et informationnelle. Elles s'insèrent dans le contexte du rôle croissant joué par les industries de la communication et les " industri...
In this paper, we present a framework and a system that extracts "salient" events relevant to a query from a large collection of documents, and which also enables events to be placed along a timeline. Each event is represented by a sentence extracted from the collection. We have conducted some experiments showing the interest of the method for this...
In this work we present a system for the automatic annotation of opinions in Spanish texts. We focus mainly in the definition of a TFS-style model for the predicates of opinion and their arguments, in the creation of a lexicon of opinion predicates and in two additional variants for identifying the source of opinions. The original system extracts o...
In this paper we present an iterative methodology to improve classifier performance by incorporating linguistic knowledge, and propose a way to incorporate domain rules into the learning process. We applied the methodology to the tasks of hedge cue recognition and scope detection and obtained competitive results on a publicly available corpus.
Our work deals with calendar information as it is expressed in natural language (NL), that is to say through textual units such as prepositional phrases or noun phrases (e.g. in the 90s, at the beginning of the XVth century). We call these textual units Calendar Expressions (CE). Our work aims at showing how Information Retrieval systems can benefi...
Parmi les multiples approches de l'analyse du discours - analytique critique, théorique- nous allons, dans cet article, proposer une approche de dominance linguistique et textuelle. En fait, nous allons combiner langue, texte et discours, en étudiant comment une expression linguistique bien précise (du type " cette hypothèse est ...", " ce résultat...
Many organizations are in charge of global security management. This paper outlines and argues for the construction of a theoretical and methodological framework in order to critically assess the new technopolitics currently being developed in the field of global security and which are materialized in standards. The main purpose is to design both a...
This paper is part of a study focusing on the terminological and socio-organizational analysis of a corpus of 18 national and international standards, written in English, in the domains of business continuity activity management and risk management. The aim is to determine whether lobbying by certain countries seeking to impose their own national s...
In the domain of the international standardization of social security, the organization standards – called management standards– of security are published (or are in the process of being published) by NGOs authorized to conduct standardization activities. These texts deal with the security and protection of people and goods; they are characterized...
The work discussed in this paper was commissioned by the French National Research Agency (ANR). The NOTSEG project studies standardization and global security. It aims to identify influence factors at play during the drafting phase of national and international standards. In this paper we present KONTRAST, an ontological contrastive glossary create...
Résumé. Cet article décrit la conception d'une méthodologie instrumentée, articulée en différentes étapes, qui vise à analyser les variations terminologiques dans des textes industriels. Nous cherchons notamment à identifier, dans les textes de normes, l'origine et les modes d'élaboration des concepts, ainsi que les variations diachroniques des ter...
Cette contribution vise à tenir en tension deux cadres d'analyse généralement disjoints dans les études dont l'objet est un site Web, ou pour être plus précis, une forme techno-sémiotique partielle, si l'on tient compte du fait que, en général, ces formes techno-sémiotiques ne sont qu'un des nombreux éléments d'une arborescence dont la racine est d...
Cet article décrit un cadre théorique et d'analyse afin de rendre compte des modes d'élaboration et des caractéristiques des textes internationaux de normalisation. Les normes sont étudiées comme la cristallisation d'éléments de savoirs et de pratiques qui relèvent de dimensions socioculturelle, juridique, industrielle et technique. Nous montrons q...
Ce rapport vise à analyser les utilisations de la plate-forme Isidore entre le 1er mai et le 15 octobre 2011 et en nous appuyant sur les observations et les enseignements qu'il est possible d'en tirer, nous proposerons des pistes de réflexion en vue d'élaborer une feuille de route de pilotage, qui dépasse le simple cadre de l'évolution technique de...
The availability of millions of digital resources, and of the means for processing, analysing, signalling, and exposing them, should make it possible to facilitate the emergence of new research objects and new questions. After a brief description of Isidore, this paper proposes first results in observing the Isidore platform because it presents all...
Many organizations are in charge of global security management. This paper outlines and argues for the construction of a theoretical and methodological framework in order to critically assess the new technopolitics currently being developed in the field of global security and which are materialized in standards. The main purpose is to design both a...
Unlike most approaches in the field of temporal expressions annotation, we consider that temporal adverbials could be relevant
units from the point of view of Information Retrieval. We present here the main principles of our semantic modeling approach
to temporal adverbial units. It comprises two steps: functional modeling (using a small number of...
This paper details the method used to augment an epistemic modality corpus (the Bioscope corpus), incorporating results from the lexical and syntactic analysis of its sentences. The features resulting from these analyses were consolidated in a single data structure, that can be used for interactive experimentation on the corpus. Some visualization...
In this paper we present the main kernel approaches to the problem of relation extraction from unstructured texts. After a brief introduction to the problem and its characterization as a classication task, we present a survey of the methods and techniques used, and the results obtained. We nally suggest some future lines of work, such as the use of...
In this article we review the problem of generating of goal-oriented knowledge systems in linguistics. The problem belongs to a new research area, which is called “cognitive informatics”. The article focuses on computer coding of goal-oriented knowledge systems using as an example the typology of Russian-to-French translation difficulties.
L'article décrit une méthode et des ressources pour manipuler des données temporelles, à la fois pour un utilisateur final qui souhaite interroger un portail avec des filtres temporels et, en amont, pour le peuplement d'ontologies. Le système assiste les personnes ayant à charge la saisie d'information dans une ontologie en leur permettant d'exprim...
The linguistic resources presented in this paper are designed for the recognition and semantic tagging of calendar expressions in French. While existing resources generally put the emphasis on describing calendar bases pointed out by calendar expressions (which are considered as named entities), our approach tries to explicit how references to cale...
As a text, each job advertisement expresses rich information about the occupation at hand, such as competence needs (i.e. required degrees, field knowledge, task expertise or technical skills). To facilitate the access to this information, the SIRE project conducted a corpus based study of how to articulate HR expert ontologies with modern semi-sup...
Technologies de l'information et intelligences collectives analyse les évolutions et les problèmes liés au développement des technologies de l'information. Les conditions de production, de consommation et de circulation des savoirs sont en voie de transformation. Il s'agit de définir les mutations des pratiques de lecture-écriture en prenant la mes...
We present our work on the identification of opinions and its components: the source, the topic and the message. We describe a rule-based system for which we achieved a recall of 74% and a precision of 94%. Experimentation with machine-learning techniques for the same task is currently underway
In this paper we address the problem of accessing text information by text
navigation. We present an approach to text navigation conceived as a cognitive process
exploiting linguistic information present in texts. We claim that the navigational knowledge
involved in this process can be modeled in a declarative way with the Sextant language. Since
t...
In this paper we present a new approach to the expression of certainty and uncertainty in scientific experimental articles. This will permit to ascertain the validity of knowledge extracted from biological literature and used to automatically populate a domain ontology. We argue that lexical terms such as show, find, observe... express a semantic c...
Le filtrage sémantique, comme la majorité des processus qui relèvent du TAL,
nécessite l'existence de plateformes logicielles. En effet, l'explosion en volume des
données textuelles disponibles sous forme numérique a entraîné, entre autres, la
multiplication et la croissance des équipes composées de développeurs, de
chercheurs et d'usagers trav...
Les mutations qui accompagnent le développement des moteurs de recherche sur Internet et la mise en place d'entrepôts de données textuelles bouleversent les pratiques de lecture, professionnelles autant que privées, individuelles mais également collectives.
Toutes ces pratiques, qui mettent en jeu l'écriture, la lecture, et la navigation dans les...
Temporal expressions that refer to a part of a calendar area in terms of common calendar divisions are studied. Our claim is that such a "cal- endar expression" (CE) can be described by a succession of operators operating on a calendar base (CB). These operators are categorized: a pointing operator that transform a CB into a CE; a focalizing/shifti...
This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the temporal information in question is thus particular to this area. The differences that exist between extraction from plain textual data and extraction from the web are broug...
We present an approach to text navigation conceived as a cognitive process exploiting linguistic information present in texts. We claim that the navigational knowledge in-volved in this process can be modeled in a declarative way with the Sextant language. Since Sextant refers exhaustively to specific linguistic phenomena, we have defined a customi...
In this paper, we describe NaviTexte, a software devoted to text navigation. First, we explain our conception of text navigation,
which exploits linguistic information in texts to offer dynamic reading paths to a reader. Second, we describe a text representation
specially defined to support our approach. Then we present a language for modeling navi...
In this paper, we present our approach to text navigation conceived like a cognitive process, which exploits navigation specific knowledge. We draw up the hypothesis that such knowledge can be designed in a declarative way with our language SEXTANT. Finally, several experimentations are described. MOTS-CLÉS : navigation textuelle assistée, modélisa...
PowerPoint formate-t-il la pensée ? Peut-on numériser l'actualité ? Y a-t-il des univers littéraires sur Internet ? L'informatique peut-elle transformer l'apprentissage ? Ou plus généralement, comment l'arrivée des médias informatisés peut-elle conduire à définir et à propager de façon particulière le rapport entre écriture et pratiques ?
Issu d'u...
PowerPoint formate-t-il la pensée ? Peut-on numériser l'actualité ? Y a-t-il des univers littéraires sur Internet ? L'informatique peut-elle transformer l'apprentissage ? Ou plus généralement, comment l'arrivée des médias informatisés peut-elle conduire à définir et à propager de façon particulière le rapport entre écriture et pratiques ?
Issu d'u...