Catherine Faron-Zucker

Catherine Faron-Zucker
Université Côte d'Azur

PhD, Université Paris 6, and HDR, Université Côte d'Azur, in Computer Science

About

180
Publications
24,796
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,757
Citations
Additional affiliations
September 2002 - present
University of Nice Sophia Antipolis
January 2002 - November 2017
Université Côte d'Azur
Position
  • Professor
Education
September 1994 - October 1998
Sorbonne Université
Field of study
  • Artificial Intelligence

Publications

Publications (180)
Chapter
In recent years, the growing application of Knowledge Graphs to new and diverse domains has created the need to make these resources accessible and understandable by users with increasingly diverse backgrounds. Visualization techniques have been widely employed as means to facilitate the exploration and comprehension of such data sources. Moreover,...
Article
Full-text available
Background Artificial intelligence methods applied to electronic medical records (EMRs) hold the potential to help physicians save time by sharpening their analysis and decisions, thereby improving the health of patients. On the one hand, machine learning algorithms have proven their effectiveness in extracting information and exploiting knowledge...
Article
Full-text available
Ontology alignment plays a key role in the management of heterogeneous data sources and metadata. In this context, various ontology alignment techniques have been proposed to discover correspondences between the entities of different ontologies. This paper proposes a new ontology alignment approach based on a set of rules exploiting the embedding s...
Article
Full-text available
Harnessing worldwide biodiversity data requires integrating myriad pieces of information, often sparse and incomplete, into a global, coherent data space. To do so, projects like the Global Biodiversity Information Facility, Catalog of Life and Encyclopedia of Life have set up platforms that gather, consolidate, and centralize billions of records f...
Chapter
Full-text available
The training curriculum for medical doctors requires the intensive and rapid assimilation of a lot of knowledge. To help medical students optimize their learning path, the SIDES 3.0 national French project aims to extend an existing platform with intelligent learning services. This platform contains a large number of annotated learning resources, f...
Conference Paper
Full-text available
Although there are many medical standard vocabularies available, it remains challenging to properly identify domain concepts in electronic medical records. Variations in the annotations of these texts in terms of coverage and abstraction may be due to the chosen annotation methods and the knowledge graphs, and may lead to very different performance...
Chapter
Full-text available
The difficulty of representing and organizing knowledge in reasonably complete ways raises at least two research questions: “how to check that particular relations are systematically used not just whenever possible but whenever relevant for knowledge providers?” and “how to extend best practices, ontology patterns or methodologies advocating the sy...
Chapter
Full-text available
Ontology alignment plays a key role in achieving interoperability on the semantic Web. Inspired by the success of word embedding techniques in several NLP tasks, we propose a new ontology alignment approach based on the combination of word embedding and the radius measure. We tested our system on the OAEI (http://oaei.ontologymatching.org/) confere...
Article
Full-text available
During the last decade, Web APIs (Application Programming Interface) have gained significant traction to the extent that they have become a de-facto standard to enable HTTP-based, machine-processable data access. Despite this success, however, they still often fail in making data interoperable, insofar as they commonly rely on proprietary data mode...
Chapter
Full-text available
Electronic medical records (EMR) contain key information about the different symptomatic episodes that a patient went through. They carry a great potential in order to improve the well-being of patients and therefore represent a very valuable input for artificial intelligence approaches. However, the explicit knowledge directly available through th...
Conference Paper
Full-text available
To help in making sense of the ever-increasing number of data sources available on the Web, in this article we tackle the problem of enabling automatic discovery and querying of data sources at Web scale. To pursue this goal, we suggest to (1) provision rich descriptions of data sources and query services thereof, (2) leverage the power of Web sear...
Conference Paper
Full-text available
Dans le domaine de la recherche d'information et du traite-ment automatique du langage, la tâche de classification de textes est devenue une tâche cruciale. Dans cet article, nous partageons notre expérience de la classification de textes dans un contexte industriel et présentons une évalu-ation comparative de différents algorithmes de classificati...
Conference Paper
Full-text available
Ces dernières années, de nombreuses entreprises s'orientent vers l'intégration du développement d'ontologies au sein de leurs processus pour mieux organiser les connaissances mises en jeu et améliorer les performances de leur traitement automatique. Dans cet article, nous décrivons notre travail de modélisation d'ontologie dans le domaine du sourci...
Preprint
Full-text available
This article is a collective position paper from the Wimmics research team, expressing our vision of how Web graph data technologies should evolve in the future in order to ensure a high-level of interoperability between the many types of applications that produce and consume graph data. Wimmics stands for Web-Instrumented Man-Machine Interactions,...
Chapter
Full-text available
RDF-based data integration is often hampered by the lack of methods to translate data locked in heterogeneous silos into RDF representations. In this paper, we tackle the challenge of bridging the gap between the Semantic Web and NoSQL worlds, by fostering the development of SPARQL interfaces to heterogeneous databases. To avoid defining yet anothe...
Chapter
Web based e-Education systems are an important kind of information systems that benefited from Web standards for content, implementation, deployment and integration. An e-Education system requires the collaboration of many actors in a complete ecosystem: public authorities (e.g. Ministry) and knowledge engineers, who build official reference standa...
Article
Full-text available
In recent years, Web APIs have become a de facto standard for exchanging machinereadable data on the Web. Despite this success, however, they often fail in making resource descriptions interoperable due to the fact that they rely on proprietary vocabularies that lack formal semantics. The Linked Data principles similarly seek the massive publicatio...
Preprint
Full-text available
In recent years, Web APIs have become a de facto standard for exchanging machine-readable data on the Web. Despite this success though, they often fail in making resource descriptions interoperable due to the fact that they rely on proprietary vocabularies that lack formal semantics. The Linked Data principles similarly seek the massive publication...
Article
Full-text available
As part of the SMILK Joint Lab, we studied the use of Natural Language Processing to: (1) enrich knowledge bases and link data on the web, and conversely (2) use this linked data to contribute to the improvement of text analysis and the annotation of textual content, and to support knowledge extraction. The evaluation focused on brand-related infor...
Chapter
Data published on the Web following the Linked Data principles has resulted in a global data space called the Web of Data. These principles led to semantically interlink and connect different resources at data level regardless their structure, authoring, location, etc. The tremendous and continuous growth of the Web of Data also implies that now it...
Chapter
Data published on the Web following the Linked Data principles has resulted in a global data space called the Web of Data. These principles led to semantically interlink and connect different resources at data level regardless their structure, authoring, location, etc. The tremendous and continuous growth of the Web of Data also implies that now it...
Chapter
Full-text available
Web APIs are a prominent source of machine-readable information that remains insufficiently connected to the Web of Data. To enable automatic combination of Linked Data (LD) interfaces and Web APIs, we present the SPARQL Micro-Service architecture. A SPARQL micro-service is a lightweight SPARQL endpoint that provides access to a small, resource-cen...
Article
Full-text available
As the biodiversity community increasingly adopts Semantic Web (SW) standards to represent taxonomic registers, trait banks or museum collections, some questions come up relentlessly: How to model the data? For what goals? Can the same model fulfill different goals? So far, the community has mostly considered the SW standards through their most sal...
Article
Full-text available
Web APIs (Application Programming Interface) are a common means for Web portals and data producers to enable HTTP-based, machine-processable access to their data. They are a prominent source of information*1 pertaining to topics as diverse as scientific information, social networks, entertainment or finance. The methods of Linked Data (Heath and Bi...
Conference Paper
Full-text available
Web APIs are a prominent source of machine-readable information. We hypothesize that harnessing the Semantic Web standards to enable automatic combination of Linked Data and non-RDF Web APIs data could trigger novel cross-fertilization scenarios. To achieve this goal, we define the SPARQL Micro-Service architecture. A SPARQL micro-service is a ligh...
Conference Paper
Full-text available
Educational quizzes are a powerful and popular tool to test the knowledge acquired by a learner and also to deepen her/his knowledge about a specific subject in an informal and entertaining way. Their production is a time-consuming task that can be automated by taking advantage of existing knowledge bases available on the Web of Linked Open Data (L...
Chapter
Educational quizzes are useful not only to evaluate or test the knowledge acquired by a learner, but also to help her/him to deepen knowledge about a specific domain or topic in an informal and entertaining way. The production of quizzes is a time-consuming task that can be automated by taking advantage of existing knowledge bases available on the...
Conference Paper
Web based e-Education systems are an important kind of information systems that benefited from Web standards for implementation, deployment and integration. In this paper we propose and evaluate a semantic Web approach to support the features and interoperability of a real industrial e-Education system in production. We show how ontology-based know...
Book
This book constitutes the refereed proceedings of the 21th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2018, held in Nancy, France, in November 2018. The 36 full papers presented were carefully reviewed and selected from 104 submissions. The papers cover all aspects of eliciting, acquiring, modeling, and managin...
Conference Paper
Full-text available
Taxonomic registers are key tools to help us comprehend the diversity of nature. Publishing such registers in the Web of Data, following the standards and best practices of Linked Open Data (LOD), is a way of integrating multiple data sources into a world-scale, biological knowledge base. In this paper, we present an ongoing work aimed at the publi...
Conference Paper
In addition to the existing standards dedicated to representation or querying, Semantic Web programmers could really benefit from a dedicated programming language enabling them to directly define functions on RDF terms, RDF graphs or SPARQL results. This is especially the case, for instance, when defining SPARQL extension functions. The ability to...
Article
The increase in the amount of structured data published on the Web using the principles of Linked Data means that now it is more likely to find resources on the Web of Data that represent real life concepts. Discovering and recommending resources on the Web of Data related to a given resource is still an open research area. This work presents a fra...
Article
We develop the theory of a possibilistic framework for OWL 2 axiom testing against RDF datasets, as an alternative to statistics-based heuristics. The intuition behind it is to evaluate the credibility of OWL 2 axioms based on the evidence available in the form of a set of facts contained in a chosen RDF dataset. To achieve it, we first define the...
Article
Full-text available
Dans cet article, nous proposons une approche pour construire une base de connaissances à partir de textes dans le domaine de la cosmétique. Il s’agit d’un cas particulier pour un domaine fixé du problème de l’extraction de relations à partir de textes. Dans le but de résoudre ce problème, nous proposons une approche semi-supervisée pour l’extracti...
Article
Full-text available
Dans cet article, nous proposons une approche pour construire une base de connaissances à partir de textes dans le domaine de la cosmétique. Il s’agit d’un cas particulier pour un domaine fixé du problème de l’extraction de relations à partir de textes. Dans le but de résoudre ce problème, nous proposons une approche semi-supervisée pour l’extracti...
Article
In many social networks, people interact based on their relationship network. Community detection algorithms are then useful to reveal the sub-structures of a network. Identifying these users' communities can help us assist their life-cycle. However, in certain kinds of online communities such as question-And-Answer (Q&A) sites or forums, people in...
Conference Paper
In the education field, in order to achieve learning goals, it is necessary to define learning paths that foresee a gradual and incremental acquisition of certain knowledge and skills that students should acquire. In this paper we analyze the educational progressions of the French educational system, we show how to formalize them through a web onto...
Article
Collaborative tagging which is the keystone of the social practices of web 2.0 has been highly developed in the last few years. In this paper, we propose a new method to analyze user profiles according to their tagging activity in order to improve resource recommendation. We base upon association rules which is a powerful method to discover interes...
Conference Paper
In the education field, in order to achieve learning goals, it is necessary to define learning paths that foresee a gradual and incre- mental acquisition of certain knowledge and skills that students should acquire. In this paper we analyze the educational progressions of the French educational system, we show how to formalize them through a web on...
Article
Dans cet article nous commençons par présenter le langage STTL (SPARQL Template Transformation Language) pour la transformation de graphes RDF, conçu comme une extension du langage SPARQL.Nous présentons ensuite une plate-forme qui implémente ce langage pour permettre la conception de navigateurs Web offrant une navigation hypertextuelle en HTML da...
Conference Paper
Full-text available
The Web of Data is an interconnected global dataspace in which discovering resources related to a given resource and recommend relevant ones is still an open research area. This work describes a new recommendation algorithm based on structured data published on the Web (Linked Data). The algorithm exploits existing relationships between resources b...
Conference Paper
In this paper we address the general research question of How can we express constraints on RDF data and how can we check that an RDF graph satisfies some given constraints? and we focus on expressing constraints defining OWL 2 profiles and checking these constraints for OWL validation. We propose an approach based on the SPARQL Template Transforma...
Conference Paper
Full-text available
Accessing legacy data as virtual RDF stores is a key issue in the building of the Web of Data. In recent years, the MongoDB database has become a popular actor in the NoSQL market, making it a significant potential contributor to the Web of Linked Data. Therefore, in this paper we address the question of how to access arbitrary MongoDB documents wi...
Conference Paper
This position paper provides an overview of the OCKTOPUS project whose goal is to increase the social and economic benefit of user-generated content, by transforming it into knowledge which can be shared and reused broadly.
Conference Paper
Full-text available
While the data deluge accelerates, most of the data produced remains locked in deep Web databases. For the linked open data to benefit from the potential represented by this huge amount of data, it is crucial to come up with solutions to expose heterogeneous databases as linked data. The xR2RML mapping language is an endeavor towards this goal: it...
Conference Paper
XSLT is a language for transforming XML documents into other XML documents. Despite its 16 years long life, the RDF Semantic Web language still waits its transformation language. Some propositions have been done, relying on and extending XSLT, but none of them became widely used. In this paper, we present a radically new transformation language for...
Book
This book constitutes the proceedings of the 22th International Conference on Conceptual Structures, ICCS 2016, held in Annecy, France, in July 2016. The 14 full papers and 5 short papers presented in this volume were carefully reviewed and selected from 40 submissions. They are organized around the following topical sections: time representation;...
Article
Full-text available
In many social networks, people interact based on their interests. Community detection algorithms are then useful to reveal the sub-structures of a network and in particular interest groups. Identifying these users' communities and the interests that bind them can help us assist their life-cycle. Certain kinds of online communities such as question...
Article
In many social networks, people interact based on their interests. Community detection algorithms are then useful to reveal the sub-structures of a network and in particular interest groups. Identifying these users’ communities and the interests that bind them can help us assist their life-cycle. Certain kinds of online communities such as question...
Conference Paper
Full-text available
In this article we present a generic template and software solution for developers to support the many cases where we need to transform RDF. It relies on the SPARQL Template Transformation Language (STTL) which enables Semantic Web developers to write specific yet compact RDF transformers toward other languages and formats.We first briefly recall t...
Conference Paper
Given the increasing amount of structured data published on the Web, many possibilities are open for creating new types of games that use resources from the Web of Data. In particular, if we consider the subcategory of Serious Games in which the object of the game is to educate the user through the interactive discovery of real-life concepts (assoc...
Article
Full-text available
Axiom scoring is a critical task both for the automatic enrichment/learning and for the automatic validation of knowledge bases and ontologies. We designed and developed an axiom scoring heuristic based on possibility theory, which aims at overcoming some limitations of scoring heuristics based on statistical inference and taking into account the o...