Guilin Qi's research while affiliated with University of Alaska Southeast and other places

Publications (243)

Article
The rise of the Internet and social media (i.e. reviews, forum discussions, blogs and social networks) constituted an interesting source to detect user opinion trends. This study examines the global publication output on opinion mining and sentiment analysis from documents published in 2000 to 2020. Bibliometric indicators on the trends, most cited...
Article
The emerging neural topic models make topic modeling more easily adaptable and extendable in unsupervised text mining. However, the existing neural topic models are difficult to retain representative information of the documents within the learnt topic representation. Fortunately, Deep Mutual Information Estimation (DMIE), which maximizes the mutua...
Article
Discovering the semantic relationships among heterogeneous ontologies has been one of the core research topics in Semantic Web. As ontology matching systems inevitably adopt heuristic strategies, wrong mappings are often contained in final alignments. Most methods for mapping revision depend on dealing with logical incoherence. However, erroneous m...
Article
Full-text available
The massive number of research articles on the Web makes it troublesome for researchers to identify related works that could meet their preferences and interests. Consequently, various network representation learning-based models have been proposed to produce citation recommendations. Nevertheless, these models do not exploit semantic relations and...
Conference Paper
The few-shot problem is an urgent challenge for single-table text-to-SQL. Existing methods ignore the potential value of unlabeled data, and merely rely on a coarse-grained Meta-Learning (ML) algorithm that neglects the differences of column contributions to the optimization object. This paper proposes a Meta Self-Training text-to-SQL (MST-SQL) met...
Article
During the last decades, recommender systems are becoming quite popular since they provide great assistance to users on social networks and library websites. Unfortunately, the large volume of data combined with sparsity makes personalization a difficult task. In this regard, several models were introduced in the literature that suffers from the co...
Article
Knowledge graph (KG) embedding encodes the entities and relations from a KG into low-dimensional vector spaces to support various applications such as KG completion, question answering, and recommender systems. In real world, knowledge graphs (KGs) are dynamic and evolve over time with addition or deletion of triples. However, most existing models...
Article
Measuring the similarity between legal documents to find prior documents from a massive collection that are similar to a current document is an essential component in legal assistant systems. This type of system can automatically link related legal documents to ensure that the same situations are treated identically in judicial practice. Most exist...
Article
Background: With the continuous spread of COVID-19, information about the worldwide pandemic is exploding. It's necessary and significant to organize large information. As the key branch of AI, Knowledge Graph (KG) is helpful to structure, reason and understand data. Objective: To improve the utilization value of the information and effectively...
Article
Full-text available
Intent detection and slot filling are the two most essential tasks of natural language understanding (NLU). Deep neural models have produced impressive results on these tasks. However, the predictive accuracy of these models heavily depends upon a massive amount of supervised data. In many applications collecting high-quality labeled data is a very...
Preprint
Full-text available
The emerging neural topic models make topic modeling more easily adaptable and extendable in unsupervised text mining. However, the existing neural topic models is difficult to retain representative information of the documents within the learnt topic representation. In this paper, we propose a neural topic model which incorporates deep mutual info...
Preprint
BACKGROUND With the continuous spread of COVID-19, information about the worldwide pandemic is exploding. Therefore, it is necessary and significant to organize such a large amount of information. As the key branch of artificial intelligence, a knowledge graph (KG) is helpful to structure, reason, and understand data. OBJECTIVE To improve the util...
Article
Multi-modal entity linking plays a crucial role in a wide range of knowledgebased modal-fusion tasks, i.e., multi-modal retrieval and multi-modal event extraction. We introduce the new ZEro-shot Multi-modal Entity Linking (ZEMEL) task, the format is similar to multi-modal entity linking, but multi-modal mentions are linked to unseen entities in the...
Article
As the core building blocks of the Semantic Web, ontologies provide shared vocabularies and conceptual knowledge for specific application fields. At the same time, ontologies can restrict their individuals and relationships through their semantic schema. However, logical conflicts of an ontology are often inevitably unavoidable in actual applicatio...
Article
Full-text available
This paper introduces a natural language understanding (NLU) framework for argumentative dialogue systems in the information-seeking and opinion building domain. The proposed framework consists of two sub-models, namely intent classifier and argument similarity. Intent classifier model stack BiLSTM with attention mechanism on top of pre-trained BER...
Chapter
Extracting knowledge from Wikipedia has attracted much attention in recent ten years. One of the most valuable kinds of knowledge is type information, which refers to the axioms stating that an instance is of a certain type. Current approaches for inferring the types of instances from Wikipedia mainly rely on some language-specific rules. Since the...
Article
Query graph construction aims to construct the correct executable SPARQL on the KG to answer natural language questions. Although recent methods have achieved good results using neural network-based query graph ranking, they suffer from three new challenges when handling more complex questions: 1) complicated SPARQL syntax, 2) huge search space, an...
Preprint
Event extraction (EE) plays an important role in many industrial application scenarios, and high-quality EE methods require a large amount of manual annotation data to train supervised learning models. However, the cost of obtaining annotation data is very high, especially for annotation of domain events, which requires the participation of experts...
Article
Full-text available
Existing visual scene understanding methods mainly focus on identifying coarse-grained concepts about the visual objects and their relationships, largely neglecting fine-grained scene understanding. In fact, many data-driven applications on the web (e.g. newsreading and e-shopping) require to accurately recognize much less coarse concepts as entiti...
Preprint
Full-text available
Query graph building aims to build correct executable SPARQL over the knowledge graph for answering natural language questions. Although recent approaches perform well by NN-based query graph ranking, more complex questions bring three new challenges: complicated SPARQL syntax, huge search space for ranking, and noisy query graphs with local ambigu...
Preprint
The ability to generate natural-language questions with controlled complexity levels is highly desirable as it further expands the applicability of question generation. In this paper, we propose an end-to-end neural complexity-controllable question generation model, which incorporates a mixture of experts (MoE) as the selector of soft templates to...
Chapter
Document-level event extraction (DEE) now draws a huge amount of researchers’ attention. Not only the researches on sentence-level event extraction have obtained a great progress, but researchers realize that an event is usually described by multiple sentences in a document especially for fields such as finance, medicine, and judicature. Several do...
Chapter
Multimodal tasks are gradually attracting the attention of the research community, and the lack of multimodal event extraction datasets restricts the development of multimodal event extraction. We introduce the new Multimodal Event Extraction Dataset (MEED) to fill the gap, we define event types and argument roles that can be used on multimodal dat...
Preprint
Full-text available
Single-table text-to-SQL aims to transform a natural language question into a SQL query according to one single table. Recent work has made promising progress on this task by pre-trained language models and a multi-submodule framework. However, zero-shot table, that is, the invisible table in the training set, is currently the most critical bottlen...
Preprint
Full-text available
Formal query building is an important part of complex question answering over knowledge bases. It aims to build correct executable queries for questions. Recent methods try to rank candidate queries generated by a state-transition strategy. However, this candidate generation strategy ignores the structure of queries, resulting in a considerable num...
Article
Distant supervision relation extraction (DSRE) trains a classifier by automatically labeling data through aligning triples in the knowledge base (KB) with large-scale corpora. Training data generated by distant supervision may contain many mislabeled instances, which is harmful to the training of the classifier. Some recent methods show that releva...
Preprint
Event detection (ED) aims at detecting event trigger words in sentences and classifying them into specific event types. In real-world applications, ED typically does not have sufficient labelled data, thus can be formulated as a few-shot learning problem. To tackle the issue of low sample diversity in few-shot ED, we propose a novel knowledge-based...
Article
Single-table text-to-SQL aims to transform a natural language question into a SQL query according to one single table. Recent work has made promising progress on this task by pre-trained language models and a multi-submodule framework. However, zero-shot table, that is, the invisible table in the training set, is currently the most critical bottlen...
Article
Continual relation extraction is an important task that focuses on extracting new facts incrementally from unstructured text. Given the sequential arrival order of the relations, this task is prone to two serious challenges, namely catastrophic forgetting and order-sensitivity. We propose a novel curriculum-meta learning method to tackle the above...
Article
Defect prediction, the task of predicting the presence of defects in source code artifacts, has broad application in software development. Defect prediction faces two major challenges, label scarcity, where only a small percentage of code artifacts are labeled, and data imbalance, where the majority of labeled artifacts are non-defective. Moreover,...
Article
Full-text available
The early concept of knowledge graph originates from the idea of the Semantic Web, which aims at using structured graphs to model the knowledge of the world and record the relationships that exist between things. Currently publishing knowledge bases as open data on the Web has gained significant attention. In China, CIPS(Chinese Information Process...
Article
Knowledge graph (KG) embedding techniques represent entities and relations as low-dimensional and continuous vectors. This enables KG machine learning models to be easily adapted for KG reasoning, completion, and querying tasks. However, learned dense vectors are inefficient for large-scale similarity computations. Learning-to-hash is to a method t...
Chapter
Recent advances in relational information extraction have allowed to automatically construct large-scale knowledge graphs (KGs). Nevertheless, an automatic process entails that a significant amount of uncertain facts are introduced into KGs. Uncertain knowledge graphs (UKGs) such as NELL and Probase model this kind of uncertainty as confidence scor...
Article
The variety and plethora of research papers available on the Web motivated researchers to propose models that could assist users with personalized citation recommendations. In recent years, citation recommendation models using Network Representation Learning (NRL) methods have shown promising results. Nevertheless, existing NRL-based models are lim...
Article
Full-text available
Link trustworthiness evaluation is a crucial task for information networks to evaluate the probability of a link being true in a heterogeneous information network (HIN). This task can significantly influence the effectiveness of downstream analysis. However, the performance of existing evaluation methods is limited, as they can only utilize incompl...
Preprint
Full-text available
This paper introduces a natural language understanding (NLU) framework for argumentative dialogue systems in the information-seeking and opinion building domain. Our approach distinguishes multiple user intents and identifies system arguments the user refers to in his or her natural language utterances. Our model is applicable in an argumentative d...
Preprint
Continual relation extraction is an important task that focuses on extracting new facts incrementally from unstructured text. Given the sequential arrival order of the relations, this task is prone to two serious challenges, namely catastrophic forgetting and order-sensitivity. We propose a novel curriculum-meta learning method to tackle the above...
Article
Question answering is an effective method for obtaining information from knowledge bases (KB). In this paper, we propose the Neural-Symbolic Complex Question Answering (NS-CQA) model, a data-efficient reinforcement learning framework for complex question answering by using only a modest number of training samples. Our framework consists of a neural...
Article
Large-scale knowledge graphs such as Wikidata and DBpedia have become a powerful asset for semantic search and question answering. However, most of the knowledge graph construction works focus on organizing and discovering textual knowledge in a structured representation, while paying little attention to the proliferation of visual resources on the...
Chapter
Full-text available
Based on Semantic Web technologies, knowledge graphs help users to discover information of interest by using live SPARQL services. Answer-seekers often examine intermediate results iteratively and modify SPARQL queries repeatedly in a search session. In this context, understanding user behaviors is critical for effective intention prediction and qu...
Preprint
Complex question-answering (CQA) involves answering complex natural-language questions on a knowledge base (KB). However, the conventional neural program induction (NPI) approach exhibits uneven performance when the questions have different types, harboring inherently different characteristics, e.g., difficulty level. This paper proposes a meta-rei...
Preprint
Question answering is an effective method for obtaining information from knowledge bases (KB). In this paper, we propose the Neural-Symbolic Complex Question Answering (NS-CQA) model, a data-efficient reinforcement learning framework for complex question answering by using only a modest number of training samples. Our framework consists of a neural...
Preprint
A compelling approach to complex question answering is to convert the question to a sequence of actions, which can then be executed on the knowledge base to yield the answer, aka the programmer-interpreter approach. Use similar training questions to the test question, meta-learning enables the programmer to adapt to unseen questions to tackle poten...
Preprint
Question generation over knowledge bases (KBQG) aims at generating natural-language questions about a subgraph, i.e. a set of (connected) triples. Two main challenges still face the current crop of encoder-decoder-based methods, especially on small subgraphs: (1) low diversity and poor fluency due to the limited information contained in the subgrap...
Preprint
Full-text available
Automatic charge prediction task aims to determine the final charges based on fact descriptions of criminal cases, which is a vital application of legal assistant systems. Conventional works usually depend on fact descriptions to predict charges while ignoring the legal schematic knowledge, which makes it difficult to distinguish confusing charges....
Article
Full-text available
Recommender systems have been used since the beginning of the Web to assist users with personalized suggestions related to past preferences for items or products including books, movies, images, research papers and web pages. The availability of millions research articles on various digital libraries makes it difficult for a researcher to find rele...
Chapter
Automatic charge prediction task aims to determine the final charges based on fact descriptions of criminal cases, which is a vital application of legal assistant systems. Conventional works usually depend on fact descriptions to predict charges while ignoring the legal schematic knowledge, which makes it difficult to distinguish confusing charges....
Article
Intent determination and slot filling are two prominent research areas related to natural language understanding (NLU). In a multi-turn NLU system, contextual information from dialogue history is exploited to mitigate the ambiguity of user utterance. State-of-the-art models employ memory networks to encode dialogue context, which is used by neural...
Conference Paper
Full-text available
Based on Semantic Web technologies, knowledge graphs help users to discover information of interest by using live SPARQL services. Answer-seekers often examine intermediate results iteratively and modify SPARQL queries repeatedly in a search session. In this context, understanding user behaviors is critical for effective intention prediction and qu...
Preprint
Based on Semantic Web technologies, knowledge graphs help users to discover information of interest by using live SPARQL services. Answer-seekers often examine intermediate results iteratively and modify SPARQL queries repeatedly in a search session. In this context, understanding user behaviors is critical for effective intention prediction and qu...
Article
Researchers face millions of research papers on various digital libraries. Therefore, finding relevant research work that meets the preferences of a researcher is a challenging task. Hence, different paper recommendation models have been proposed to address this issue. However, these models lack in exploiting prominent information factors, namely:...
Article
Full-text available
In recent years, lots of knowledge graphs built from Wikipedia, the largest multilingual online encyclopedia, have been published on the Web to support various applications. However, since non-English data in Wikipedia are sparse, some projects work on knowledge graph construction from multiple non-English online encyclopedias, but many technical d...
Conference Paper
A compelling approach to complex question answering is to convert the question to a sequence of actions, which can then be executed on the knowledge base to yield the answer, aka the programmer-interpreter approach. Use similar training questions to the test question, meta-learning enables the programmer to adapt to unseen questions to tackle poten...
Article
Full-text available
The aim of visual relation detection is to provide a comprehensive understanding of an image by describing all the objects within the scene, and how they relate to each other, in < object-predicate-object > form; for example, < person-lean on-wall > . This ability is vital for image captioning, visual question answering, and many other applications...
Article
Sentiment analysis in various languages has been a hot research topic with several applications. Most of the existing models have been reported to work well with widely used language. Were the lass directly applying these models to poor-quality corpora often leads to low results. Thus, to deal with these shortcoming we propose a cross-lingual senti...
Preprint
Full-text available
Knowledge graph models world knowledge as concepts, entities, and the relationships between them, which has been widely used in many real-world tasks. CCKS 2019 held an evaluation track with 6 tasks and attracted more than 1,600 teams. In this paper, we give an overview of the knowledge graph evaluation tract at CCKS 2019. By reviewing the task def...
Conference Paper
Full-text available
Traditional Chinese medicine (TCM) is an important intangible cultural heritage of China. To enhance the services of TCM, many works focus on constructing various types of TCM knowledge graphs according to the concrete requirements such as information retrieval. However, most of them ignored several key issues. One is temporal information that is v...
Chapter
Much attention has recently been given to knowledge graphs embedding by exploiting latent and semantic relations among entities and incorporating the structured knowledge they contain into machine learning. Most of the existing graph embedding models can only encode a simple model of the data, while few models are designed for ontology rich knowled...
Chapter
Knowledge graph embedding aims to represent structured entities and relations as continuous and dense low-dimensional vectors. With more and more embedding models being proposed, it has been widely used in many tasks such as semantic search, knowledge graph completion and intelligent question and answer. Most knowledge graph embedding models focus...
Chapter
In this paper, we study the Dispute Generation (DG) problem from the plaintiff allegation (PA) and the defendant argument (DA) in a law document. We are the first to formulate DG as a text-to-text natural language generation (NLG) problem. Since the logical relationships between a PA and a DA are rather difficult to identify, existing models cannot...
Chapter
Large-scale knowledge graphs such as Wikidata and DBpedia have become a powerful asset for semantic search and question answering. However, most of the knowledge graph construction works focus on organizing and discovering textual knowledge in a structured representation while paying little attention to the proliferation of visual resources on the...
Article
Full-text available
Knowledge graphs (KGs) contain rich resources that represent human knowledge in the world. There are mainly two kinds of reasoning techniques in knowledge graphs, symbolic reasoning and statistical reasoning. However, both of them have their merits and limitations. Therefore, it is desirable to combine them to provide hybrid reasoning in a knowledg...