Vânia Vidal’s research while affiliated with Federal University of Ceará and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (39)


An Autonomous Domain-independent Framework Based on LLMs for Text-to-SPARQL
  • Article

March 2025

·

6 Reads

International Journal of Semantic Computing

·

Vania M. P. Vidal

·

·

Large language models (LLMs) currently are the state of the art for pre-trained language models. LLMs have been applied to many tasks, including question and answering over Knowledge Graphs (KGs) and text-to-SPARQL, that is, the translation of Natural Language questions to SPARQL queries. With such motivation, this paper first describes preliminary experiments to evaluate the ability of ChatGPT to answer Natural Language questions over KGs. Based on these experiments, the paper introduces Auto-KGQA, an autonomous domain-independent framework based on LLMs for text-to-SPARQL. The framework selects fragments of the KG, which the LLM uses to translate the user’s Natural Language question to a SPARQL query on the KG. The paper describes preliminary experiments with Auto-KGQA with ChatGPT that indicate that the framework substantially reduced the number of tokens passed to ChatGPT without sacrificing performance. Finally, the paper includes an evaluation of Auto-KGQA in a publicly available benchmark, which showed that the framework is competitive, achieving an improvement of 13.2% in accuracy with respect to the state of the art and a reduction of 51.12% in the number of tokens passed to the LLM.



ContextEKG_Explorer: Uma Ferramenta Interativa para Exploração Contextual da Visão Semântica em Sistemas de Grafo de Conhecimento Empresarial

October 2024

·

1 Read

Um sistema de Grafo de Conhecimento Empresarial (Enterprise Knowledge Graph ou EKG) é um paradigma baseado em tecnologias da Web Semântica e Grafos de Conhecimento para integrar fontes de dados heterogêneas. Um EKG fornece uma visão ontológica e unificada, para que as aplicações tenham acesso integrado aos dados através da visão semântica. A exploração de dados na visão semântica requer ferramentas que apresentem grafos de forma compreensível, facilitando a interpretação e a tomada de decisão. Este artigo apresenta a ContextEKG Explorer1, uma ferramenta gráfica interativa para a exploração de dados na visão semântica de um EKG. A visão semântica é organizada em uma hierarquia de três níveis, simplificando a exploração das entidades em múltiplos contextos. A ferramenta oferece visualizações intuitivas das entidades e navegação fluida entre esses contextos.


A Data Design Pattern for Building and Exploring Semantic Views of Enterprise Knowledge Graphs

October 2024

·

5 Reads

Vânia M. P. Vidal

·

Renato Freitas

·

·

[...]

·

An Enterprise Knowledge Graph (EKG) is a robust foundation for knowledge management, data integration, and advanced analytics across organizations. It achieves this by offering a semantic view that semantically integrates various data sources within an organization’s data lake. This paper introduces a novel data design pattern (DDP) aimed at constructing and managing the semantic view of an EKG. The proposed DDP logically organizes data into three hierarchical levels, facilitating the maintenance and the versatile exploration of the semantic view in various contexts. Furthermore, this paper details an interactive graphical interface developed to supports context-sensitive navigation of the semantic view, enhancing user interaction and resource utilization.


Fig. 1. Scheme of the KG used in the experiments
Experiments with text-to-SPARQL based on ChatGPT
  • Conference Paper
  • Full-text available

February 2024

·

128 Reads

·

19 Citations

Download

LiRB: Um Navegador Leve Baseado em Texto para Knowledge Graphs RDF

September 2023

·

8 Reads

·

2 Citations

Neste artigo apresentamos LIght RDF Browser (LiRB), uma interface leve para a navegação interativa sobre Knowledge Graphs (KGs) RDF. LiRB apresenta uma interface Web baseada em texto, proporcionando uma experiência de navegação semelhante às páginas Web tradicionais, promovendo uma menor curva de aprendizagem aos usuários. Como diferenciais, LiRB utiliza consultas simples e rápidas, sendo ideal para a navegação de KGs massivos e/ou virtuais. Além disso, a ferramenta também permite a abstração de relacionamentos N-ários para representar a semântica de propriedades de relacionamentos, além da apresentação de consultas salvas. Por fim, LiRB também dá suporte ao recurso de exibição de timelines de eventos de recursos do KG.


Construção do Grafo de Conhecimento Semântico de Dados Abertos de Pessoas Jurídicas

November 2022

·

135 Reads

This article presents the construction of a semantic knowledge graph of the open data of legal entities (SKG:CNPJ). SKG:CNPJ is obtained from the semantic integration of four data sources: RFB, IBGE, Correios, TCU and CEIS. The main objective of SKG:CNPJ is to provide a semantic layer, so that applications can have integrated access to the data source data through the Semantic Layer. The data and metadata of the SKG:CNPJ are made available on a Semantic Portal (Semantic-CNPJ) for query and visualization. The article also describes the use of SKG:CNPJ for construction of semantic queries.


Figure 3. Confusion matrices for activities segmented on CAD-120.
Hybrid sensor data streams segmentation methods for HAR. The columns adopt: ST -Semantic Technology. The lines adopt: RSD -Raw Sensor Data; OOU -Object of Use; HP -Human Posture; and L -Location
SeAct: Semantic Adaptive Segmentation of Sensor Data Streams for Human Activity Recognition

September 2022

·

54 Reads

Pervasive computing delivers services based on user needs through smart environments that incorporate and integrate everyday objects discreet and non-intrusive. Personal applications provide the data collected by sensors for Human Activity Recognition. The main limitation is that these activities need to be continuously segmented for HAR. Furthermore, a growing problem is related to the disambiguation of activities since some actions generated by the same sensors belong to different activities. This paper proposes a hybrid method, SeAct, which dynamically adjusts segment size, combining machine learning and semantic inference. Experiments with CAD-120 data sets and a state-of-the-art hybrid method improve recognition accuracy and precision.


Publication and maintenance of RDB2RDF views externally materialized in enterprise knowledge graphs

July 2022

·

37 Reads

·

2 Citations

International Journal of Web Information Systems

Purpose Enterprise knowledge graphs (EKG) in resource description framework (RDF) consolidate and semantically integrate heterogeneous data sources into a comprehensive dataspace. However, to make an external relational data source accessible through an EKG, an RDF view of the underlying relational database, called an RDB2RDF view, must be created. The RDB2RDF view should be materialized in situations where live access to the data source is not possible, or the data source imposes restrictions on the type of query forms and the number of results. In this case, a mechanism for maintaining the materialized view data up-to-date is also required. The purpose of this paper is to address the problem of the efficient maintenance of externally materialized RDB2RDF views. Design/methodology/approach This paper proposes a formal framework for the incremental maintenance of externally materialized RDB2RDF views, in which the server computes and publishes changesets, indicating the difference between the two states of the view. The EKG system can then download the changesets and synchronize the externally materialized view. The changesets are computed based solely on the update and the source database state and require no access to the content of the view. Findings The central result of this paper shows that changesets computed according to the formal framework correctly maintain the externally materialized RDB2RDF view. The experiments indicate that the proposed strategy supports live synchronization of large RDB2RDF views and that the time taken to compute the changesets with the proposed approach was almost three orders of magnitude smaller than partial rematerialization and three orders of magnitude smaller than full rematerialization. Originality/value The main idea that differentiates the proposed approach from previous work on incremental view maintenance is to explore the object-preserving property of typical RDB2RDF views so that the solution can deal with views with duplicates. The algorithms for the incremental maintenance of relational views with duplicates published in the literature require querying the materialized view data to precisely compute the changesets. By contrast, the approach proposed in this paper requires no access to view data. This is important when the view is maintained externally, because accessing a remote data source may be too slow.


MediBot: An Ontology-Based Chatbot to Retrieve Drug Information and Compare its Prices

September 2021

·

170 Reads

·

11 Citations

Journal of Information and Data Management

In this article, we present the MediBot. MediBot is a chatbot for querying drugs information. The presented system acted as a single access point for natural and simplified information retrieval of drugs, prices, and its risks. The chatbot has two modes of operation: Quick Response and Interactive modes. The first answers questions asked in natural language, while the second has three interactive tasks, namely Browser, Query, and Price Comparison. We present here the system architecture, the Linked Data Mashup’s construction process, and Chatbot MediBot’s activities modes, focusing on the new Price Comparison’s task. This task presents the best prices for medicines and their best potential substitutes extracted in real-time from the Web with the help of the information obtained from a linked data mashup.


Citations (23)


... We will demonstrate but also explain our system using two scenarios. In the first scenario, we use two demonstration instances whereas one is pre-loaded with a simple example ontology from [1] and another one with selected popular ontologies. The user can use a simple database explorer GUI to get an overview on the different types of terms that are loaded in a dedicated collection each (Classes, ObjectProperties, DatatypeProperties, etc.). ...

Reference:

POTS - A Polyparadigmatic Ontology Term Search with Fine-Grained Context Steering using Hyper-Level Vector Spaces
A Framework for Question Answering on Knowledge Graphs Using Large Language Models
  • Citing Chapter
  • January 2025

... The preservation and accessibility of lexicographic data is crucial for standardizing Query in Natural Language: "What is the gender of Apfel in German?" Figure 1: Conversational lexicography: enabling natural language queries to KGs by automatically generating SPARQL code, eliminating the need for manual query writing language understanding, supporting linguistic research, documenting cultural diversity (Gregson et al., 2015), and crucially, increasing interoperability in language technology. Recent advancements in Large Language Models (LLMs) have opened new pathways for creating natural language interfaces to KGs, potentially democratizing access to this structured linguistic knowledge (Avila et al., 2024). ...

Experiments with text-to-SPARQL based on ChatGPT

... [Sellami and Zarour 2022] introduziram o KeyFSI, uma interface de navegação facetada, que visa facilitar a exploração de dados em um GC. Por fim, [Avila and Vidal 2023] apresentaram LiRB, uma interface baseada nas tradicionais páginas da Web que permite uma exploração de GCs. LiRB atua sobre endpoints SPARQL para acessar os dados. ...

LiRB: Um Navegador Leve Baseado em Texto para Knowledge Graphs RDF
  • Citing Conference Paper
  • September 2023

... Semantic annotations can help refining data retrieval because they support extending queries to identify data that is relevant to the query, but which is described with different terms.Ávila et al. [13] explored the semantic linkage using the SKOS predicate. Their research proposed semantic enrichment using SPARQL queries and SKOS vocabulary. ...

Ligações Semânticas Utilizando Predicados SKOS
  • Citing Conference Paper
  • October 2017

... As principais publicações que serviram de referência para a realização deste trabalho foram as de [Lopes et al. 2016], denominada "integração de dados na saúde pública", a de [Victorino et al. 2017], "big data com dados governamentais"; a de [Alencar et al. 2018], "vitrine de Currículos Lattes", e o de [Araújo et al. 2017], "integração de bases em cidades inteligentes". A escolha para estas publicações se justifica por eles trazerem contribuições para a maioria dos principais conceitos abordados no presente trabalho. ...

Construção de Linked Data Mashup para Integração de Dados da Saúde Pública
  • Citing Conference Paper
  • October 2016

... Terms that are not found in these ontologies and are specific to our domain will be created and publicly available. Once we have created the domain ontology containing all terms used in our database schema, we will use approaches such as the one presented by Vidal and colleagues (Vidal et al. 2022) to create an Enterprise Knowledge Graph (EKG) from our relational database and integrate it with other related projects, such as one Atlas Digital da América Lusa (Atlas Digital da América Lusa 2020). The EKG will, therefore, guarantee the availability and integration of data in other datasets and systems, and can be consulted by humans and machines. ...

Publication and maintenance of RDB2RDF views externally materialized in enterprise knowledge graphs
  • Citing Article
  • July 2022

International Journal of Web Information Systems

... Knowledge Graphs (KG) are semantic networks that represent information in a graph structure, with entities as nodes and relationships as edges (Heist et al., 2020), built from diverse data to integrate and organize knowledge (Paulheim, 2016). They can be applied in areas such as the labor market (Popping, 2003), education methods (cao, 2023), and medicine (Vidal Rolim et al., 2021), and are valued in Artificial Intelligence (AI) for their clarity and flexibility (Shen et al., 2022). An example is the combination of KG with AI technologies, such as Microsoft's Azure OpenAI, which further enhances their potential by facilitating the integration and analysis of large volumes of data more efficiently and accurately (Sarica et al., 2020). ...

UM ENFOQUE INCREMENTAL PARA CONSTRUÇÃO DO GRAFO DE CONHECIMENTO DO SUS
  • Citing Chapter
  • January 2021

... Avila (2021) [6] developed MediBot, a chatbot that provides drug information, including prices and potential substitutes. The bot works in two modes: Quick Response mode and Interactive mode. ...

MediBot: An Ontology-Based Chatbot to Retrieve Drug Information and Compare its Prices

Journal of Information and Data Management

... Essas plataformas visam consolidar os dados dos pacientes, permitindo decisões clínicas mais rápidas e embasadas. Por exemplo, [Rolim et al. 2020] integram fontes de dados clínicos e administrativos, facilitando a tomada de decisões em tempo real e melhorando a coordenação do cuidado. Outro exemplo é o ONTOVID [Gomes et al. 2022], que constrói um grafo semântico para rastrear casos e óbitos por COVID-19, fortalecendo o gerenciamento de surtos com dados em tempo real. ...

Um Enfoque Incremental para Construção do Grafo de Conhecimento do SUS
  • Citing Conference Paper
  • September 2020

... An exception is [5] where the bot generation is semi-automated but it requires a mandatory and extensive annotation process while we focus more on a scalable approach able to generate chatbots with no human intervention if so desired. Generation of chatbots from other types of data sources like APIs [7,16], web pages [6], knowledge graphs [2] or even software designs [12] has also been explored and some of their ideas could be exploited as well for tabular data. However, we did not find any existing solution for automatic generation of intent-based chatbots from tabular datasets that we could compare with this work. ...

CONQUEST: A Framework for Building Template-Based IQA Chatbots for Enterprise Knowledge Graphs

Lecture Notes in Computer Science