
Fabien Lucien Gandon- PhD, HDR
- Research Director at National Institute for Research in Computer Science and Control
Fabien Lucien Gandon
- PhD, HDR
- Research Director at National Institute for Research in Computer Science and Control
Director of the Wimmics Lab
About
326
Publications
95,840
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,248
Citations
Introduction
Research Director in Informatics and Computer Science at Inria and Leader of the Wimmics team at the Sophia-Antipolis Research Center (Inria, I3S). Inria representative at the World-Wide Web Consortium (W3C). His professional interests include: Web, Semantic Web, Social Web, Ontologies, Knowledge Engineering and Modelling, Mobility, Privacy, Context-Awareness, Semantic Social Network / Semantic Analysis of Social Network, Intraweb, Distributed Artificial Intelligence.
Current institution
Additional affiliations
January 2003 - December 2003
July 2014 - present
January 2012 - present
Publications
Publications (326)
Over the last 70 years, we, humans, have created an economic market where attention is being captured and turned into money thanks to advertising. During the last two decades, leveraging research in psychology, sociology, neuroscience and other domains, Web platforms have brought the process of capturing attention to an unprecedented scale. With th...
Seq-to-seq generative models recently gained attention for solving the relation extraction task. By approaching this problem as an end-to-end task, they surpassed encoder-based-only models. Little research investigated the effects of the output syntaxes on the training process of these models. Moreover, a limited number of approaches were proposed...
Over recent years, we witnessed an astonishing growth in production and consumption of Linked Data (LD), which contains valuable information to support decision-making processes in various application domains. In this context, data visualization plays a decisive role in making sense of the large volumes of data created every day and in effectively...
The advancement and deployment of artificial intelligent agents brought numerous benefits in knowledge and data gathering and processing. However, one of the key challenges in deploying such agents in an open environment like the Web is their interoperability as they currently mostly run in silos. In this paper we report on a simulation and evaluat...
We present the WASABI Song Corpus, a large corpus of songs enriched with metadata extracted from music databases on the Web, and resulting from the processing of song lyrics and from audio analysis. More specifically, given that lyrics encode an important part of the semantics of a song, we focus here on the description of the methods we proposed t...
In this paper, we present the WeKG-MF Knowledge Graph constructed from open weather observations published by Météo-France institution. WeKG-MF relies on a semantic model that formalizes knowledge about meteorological observational data. The model is generic enough to be adopted and extended by meteorological data providers to publish and integrate...
A large number of semantic Web knowledge bases have been developed and published on the Web. To help the user identify the knowledge bases relevant for a given problem, and estimate their usability, we propose a declarative indexing framework and an associated visualization Web application, KartoGraphI . It provides an overview of important charact...
Relational Graph Convolutional Networks (RGCNs) are commonly used on Knowledge Graphs (KGs) to perform black box link prediction. Several algorithms have been proposed to explain their predictions. Evaluating performance of explanation methods for link prediction is difficult without ground truth explanations. Furthermore, there can be multiple exp...
This work combines semantic reasoning and machine learning to create tools that allow curators of the visual art collections to identify and correct the annotations of the artwork as well as to improve the relevance of the content-based search results in these collections. The research is based on the Joconde database maintained by French Ministry...
Background
Artificial intelligence methods applied to electronic medical records (EMRs) hold the potential to help physicians save time by sharpening their analysis and decisions, thereby improving the health of patients. On the one hand, machine learning algorithms have proven their effectiveness in extracting information and exploiting knowledge...
To study and predict meteorological phenomenons and to include them in broader studies, the ability to represent and exchange meteorological data is of paramount importance. A typical approach in integrating and publishing such data now is to formalize a knowledge graph relying on Linked Data and semantic Web standard models and practices. In this...
The unprecedented mobilization of scientists, consequent of the COVID-19 pandemics, has generated an enormous number of scholarly articles that is impossible for a human being to keep track and explore without appropriate tool support. In this context, we created the Covid-on-the-Web project, which aims to assist the access, querying, and sense mak...
Equivalence links are the cornerstone of Linked Data and their integration. However, it is not easy to establish and manipulate them, since the Web is always evolving with datasets emerging and disappearing. Inconsistencies may also be present on the Web, leading to erroneous assertions and inferences. We propose a method to identify owl:sameAs rel...
Association rule mining often leads the analyst into a rough rummaging process to identify rules that are relevant to understand specific problems. We propose a visualization interface to assist the rule selection process and evaluate it on an RDF knowledge graph derived from the COVID-19 Open Research Dataset. The user interface supports data expl...
Since 2017, the goal of the two-million song WASABI database has been to build a knowledge graph linking collected metadata (artists, discography, producers, dates, etc.) with metadata generated by the analysis of both the songs’ lyrics (topics, places, emotions, structure, etc.) and audio signal (chords, sound, etc.). It relies on natural language...
Song lyrics contain repeated patterns that have been proven to facilitate automated lyrics segmentation, with the final goal of detecting the building blocks (e.g., chorus, verse) of a song text. Our contribution in this article is twofold. First, we introduce a convolutional neural network (CNN)-based model that learns to segment the lyrics based...
The training curriculum for medical doctors requires the intensive and rapid assimilation of a lot of knowledge. To help medical students optimize their learning path, the SIDES 3.0 national French project aims to extend an existing platform with intelligent learning services. This platform contains a large number of annotated learning resources, f...
For data sources to ensure providing reliable linked data, they need to indicate information about the (un)certainty of their data based on the views of their consumers. In Addition, uncertainty information in terms of Semantic Web has also to be encoded into a readable, publishable, and exchangeable format to increase the interoperability of syste...
Scientists are harnessing their multidisciplinary expertise and resources to fight the COVID-19 pandemic. Aligned with this mind-set, the Covid-on-the-Web project aims to allow biomedical researchers to access, query and make sense of COVID-19 related literature. To do so, it adapts, combines and extends tools to process, analyze and enrich the "CO...
A Linked Data crawler performs a selection to focus on collecting linked RDF (including RDFa) data on the Web. From the perspectives of throughput and coverage, given a newly discovered and targeted URI, the key issue of Linked Data crawlers is to decide whether this URI is likely to dereference into an RDF data source and therefore it is worth dow...
Le Web touche plus de 3 milliards d’utilisateurs directs. Cependant, depuis plusieurs années il n’est plus seulement utilisé par les humains mais aussi par les machines. Cet article explique comment se tissent sur la toile un Web de Données et un Web sémantique pour y décrire tout ce qui peut être identifié, et pour échanger entre machines et à l’é...
Recent W3C recommendations for the Web of Things (WoT) and the Social Web are turning hypermedia into a homogeneous information fabric that interconnects heterogeneous resources: devices, people, information resources, abstract concepts, etc. The integration of multi-agent systems with such hypermedia environments now provides a means to distribute...
Although there are many medical standard vocabularies available, it remains challenging to properly identify domain concepts in electronic medical records. Variations in the annotations of these texts in terms of coverage and abstraction may be due to the chosen annotation methods and the knowledge graphs, and may lead to very different performance...
We present the WASABI Song Corpus, a large corpus of songs enriched with metadata extracted from music databases on the Web, and resulting from the processing of song lyrics and from audio analysis. More specifically, given that lyrics encode an important part of the semantics of a song, we focus here on the description of the methods we proposed t...
The integration of systems of autonomous agents in Web of Things (WoT) environments is a promising approach to provide and distribute intelligence in world-wide pervasive systems. A central problem then is to enable autonomous agents to discover heterogeneous resources in large-scale, dynamic WoT environments. This is true in particular if an envir...
This book highlights novel research in Knowledge Discovery and Management (KDM), gathering the extended, peer-reviewed versions of outstanding papers presented at the annual conferences EGC’2017 & EGC’2018. The EGC conference cycle was founded by the International French-speaking EGC society (“Extraction et Gestion des Connaissances”) in 2003, and...
In this keynote I will mention a number of works from the research team Wimmics that has been studying the challenges in bridging social semantics and formal semantics on the Web. These contributions address some of the challenges in connecting AIs to the Web.
The open nature of the Web exposes it to the many imperfections of our world. As a result, before we can use knowledge obtained from the Web, we need to represent that fuzzy, vague, ambiguous and uncertain information. Current standards of the Semantic Web and Linked Data do not support such a representation in a formal way and independently of any...
As the Web of Linked Open Data is growing the problem of crawling that cloud becomes increasingly important. Unlike normal Web crawlers, a Linked Data crawler performs a selection to focus on collecting linked RDF (including RDFa) data on the Web. From the perspectives of throughput and coverage, given a newly discovered and targeted URI, the key i...
Electronic medical records (EMR) contain key information about the different symptomatic episodes that a patient went through. They carry a great potential in order to improve the well-being of patients and therefore represent a very valuable input for artificial intelligence approaches. However, the explicit knowledge directly available through th...
To help in making sense of the ever-increasing number of data sources available on the Web, in this article we tackle the problem of enabling automatic discovery and querying of data sources at Web scale. To pursue this goal, we suggest to (1) provision rich descriptions of data sources and query services thereof, (2) leverage the power of Web sear...
There is no credibility insurance measure for the information provided by the Web. In most cases, information cannot be checked for accuracy. Semantic Web technologies aimed to give structure and sense to information published on the Web and to provide us with a machine-readable data format for interlinked data. However, Semantic Web standards do n...
This article is a collective position paper from the Wimmics research team, expressing our vision of how Web graph data technologies should evolve in the future in order to ensure a high-level of interoperability between the many types of applications that produce and consume graph data. Wimmics stands for Web-Instrumented Man-Machine Interactions,...
The two-volume set of LNCS 11778 and 11779 constitutes the refereed proceedings of the 18th International Semantic Web Conference, ISWC 2019, held in Auckland, New Zealand, in October 2019. The ISWC conference is the premier international forum for the Semantic Web / Linked Data Community.
The total of 74 full papers included in this volume was sel...
The two-volume set of LNCS 11778 and 11779 constitutes the refereed proceedings of the 18th International Semantic Web Conference, ISWC 2019, held in Auckland, New Zealand, in October 2019. The ISWC conference is the premier international forum for the Semantic Web / Linked Data Community.
The total of 74 full papers included in this volume was sel...
Web based e-Education systems are an important kind of information systems that benefited from Web standards for content, implementation, deployment and integration. An e-Education system requires the collaboration of many actors in a complete ecosystem: public authorities (e.g. Ministry) and knowledge engineers, who build official reference standa...
This paper is a survey of the research topics in the field of Semantic Web, Linked Data and Web of Data. This study looks at the contributions of this research community over its first twenty years of existence. Compiling several bibliographical sources and bibliometric indicators , we identify the main research trends and we reference some of thei...
In recent years, Web APIs have become a de facto standard for exchanging machinereadable data on the Web. Despite this success, however, they often fail in making resource descriptions interoperable due to the fact that they rely on proprietary vocabularies that lack formal semantics. The Linked Data principles similarly seek the massive publicatio...
In recent years, Web APIs have become a de facto standard for exchanging machine-readable data on the Web. Despite this success though, they often fail in making resource descriptions interoperable due to the fact that they rely on proprietary vocabularies that lack formal semantics. The Linked Data principles similarly seek the massive publication...
Current evaluation methods of exploratory search systems are still incomplete as they are not fully based on a suitable model of the exploratory search process: as such they cannot be used to determine if they effectively support exploratory search behaviors and tasks. Aiming to elaborate evaluation methods based on an appropriate model of explorat...
As part of the SMILK Joint Lab, we studied the use of Natural Language Processing to: (1) enrich knowledge bases and link data on the web, and conversely (2) use this linked data to contribute to the improvement of text analysis and the annotation of textual content, and to support knowledge extraction. The evaluation focused on brand-related infor...
Web APIs are a prominent source of machine-readable information that remains insufficiently connected to the Web of Data. To enable automatic combination of Linked Data (LD) interfaces and Web APIs, we present the SPARQL Micro-Service architecture. A SPARQL micro-service is a lightweight SPARQL endpoint that provides access to a small, resource-cen...
Web APIs (Application Programming Interface) are a common means for Web portals and data producers to enable HTTP-based, machine-processable access to their data. They are a prominent source of information*1 pertaining to topics as diverse as scientific information, social networks, entertainment or finance. The methods of Linked Data (Heath and Bi...
Argumentative persuasion usually employs one of the three persuasion strategies: Ethos, Pathos or Logos. Several approaches have been proposed to model persuasive agents, however, none of them explored how the choice of a strategy impacts the mental states of the debaters and the argumentation process. We conducted a field experiment with real deba...
Web APIs are a prominent source of machine-readable information. We hypothesize that harnessing the Semantic Web standards to enable automatic combination of Linked Data and non-RDF Web APIs data could trigger novel cross-fertilization scenarios. To achieve this goal, we define the SPARQL Micro-Service architecture. A SPARQL micro-service is a ligh...
In this paper the authors focus on context-aware adaptation for linked data on mobile. They split up the problem in two sub-questions: how to declaratively describe context at RDF presentation level, and how to overcome context imprecisions and incompleteness when selecting the proper context description at runtime. The authors answer their two-fol...
Web based e-Education systems are an important kind of information systems that benefited from Web standards for implementation, deployment and integration. In this paper we propose and evaluate a semantic Web approach to support the features and interoperability of a real industrial e-Education system in production. We show how ontology-based know...
In this paper, we propose a proof of concept for the ontological representation of normative requirements as Linked Data on the Web. Starting from the LegalRuleML ontology, we present an extension of this ontology to model normative requirements and rules. Furthermore, we define an operational formalization of the deontic reasoning over these conce...
In addition to the existing standards dedicated to representation or querying, Semantic Web programmers could really benefit from a dedicated programming language enabling them to directly define functions on RDF terms, RDF graphs or SPARQL results. This is especially the case, for instance, when defining SPARQL extension functions. The ability to...
We develop the theory of a possibilistic framework for OWL 2 axiom testing against RDF datasets, as an alternative to statistics-based heuristics. The intuition behind it is to evaluate the credibility of OWL 2 axioms based on the evidence available in the form of a set of facts contained in a chosen RDF dataset. To achieve it, we first define the...
This editorial introduces the special issue based on the best papers from ESWC 2015. And since ESWC’15 marked 15 years of Semantic Web research, we extended this editorial to a position paper that reflects the path that we, as a community, traveled so far with the goal of transforming the Web of Pages to a Web of Resources. We discuss some of the k...
In everyday life discussion, people try to persuade each other about the goodness of their viewpoint regarding a certain topic. This persuasion process is usually affected by several elements, like the ability of the speaker in formulating logical arguments, her confidence with respect to the discussed topic, and the emotional solicitation that cer...
Dans cet article, nous proposons une approche pour construire une base de connaissances à partir
de textes dans le domaine de la cosmétique. Il s’agit d’un cas particulier pour un domaine fixé du problème de
l’extraction de relations à partir de textes. Dans le but de résoudre ce problème, nous proposons une approche
semi-supervisée pour l’extracti...
We define and provide real cases of "Web-Augmented Interactions" (WAI) with the world, a new family of interactions designed to exploit resources from the Web to improve the users' experience with the devices surrounding them.
Dans cet article, nous proposons une approche pour construire une base de connaissances à partir
de textes dans le domaine de la cosmétique. Il s’agit d’un cas particulier pour un domaine fixé du problème de
l’extraction de relations à partir de textes. Dans le but de résoudre ce problème, nous proposons une approche
semi-supervisée pour l’extracti...
In many social networks, people interact based on their relationship network. Community detection algorithms are then useful to reveal the sub-structures of a network. Identifying these users’ communities can help us assist their life-cycle. However, in certain kinds of online communities such as question-and-answer (Q&A) sites or forums, people in...
Exploratory search has an unclear and open-ended definition. The complexity of the task and the difficulty of defining this activity are reflected in the limits of existing evaluation methods for exploratory search systems. In order to improve them, we intend to design an evaluation method based on a user-centered model of exploratory search. In th...
Argumentation is a mechanism to support different forms of reasoning such as decision making and persuasion and always cast under the light of critical thinking. In the latest years, several computational approaches to argumentation have been proposed to detect conflicting information, take the best decision with respect to the available knowledge,...
The web was originally conceived as decentralized and universal, but during its popularization, its big value was built on centralized servers and nonuniversal access. A key element to redecentralize the web is to be able to generate trustable, secure, and accountable updates among autonomous participants without a central server. The authors belie...
Les projets DBpédia et SemanticPedia illustrent la possibilité de réutilisation des données dans de nombreuses applications grâce à des langages et à des schémas de descriptions qui sont ici expliqués.
The extraction and the disambiguation of knowledge guided by textual resources on the web is a crucial process to advance the Web of Linked Data. The goal of our work is to semantically enrich raw data by linking the mentions of named entities in the text to the corresponding known entities in knowledge bases. In our approach multiple aspects are c...
In this paper we present an ongoing work on building a repository of knowledge about objects typically found in homes, their usual locations and usage. We extract an RDF knowledge base by automatically reading text on the Web and applying simple inference rules. The obtained common sense object relations are ready to be used in a domestic robotic s...
This position paper provides an overview of the OCKTOPUS project whose goal is to increase the social and economic benefit of user-generated content, by transforming it into knowledge which can be shared and reused broadly.
In the Semantic Web context, OWL ontologies represent the con-ceptualization of domains of interest while the corresponding as-sertional knowledge is given by the heterogeneous Web resources referring to them. Being strongly decoupled, ontologies and assertion can be out-of-sync. An ontology can be incomplete, noisy and sometimes inconsistent with...
In many social networks, people interact based on their interests. Community detection algorithms are then useful to reveal the sub-structures of a network and in particular interest groups. Identifying these users' communities and the interests that bind them can help us assist their life-cycle. Certain kinds of online communities such as question...