PreprintPDF Available
Preprints and early-stage research may not have been peer reviewed yet.

Abstract and Figures

With the explosive growth of artificial intelligence (AI) and big data, it has become vitally important to organize and represent the enormous volume of knowledge appropriately. As graph data, knowledge graphs accumulate and convey knowledge of the real world. It has been well-recognized that knowledge graphs effectively represent complex information; hence, they rapidly gain the attention of academia and industry in recent years. Thus to develop a deeper understanding of knowledge graphs, this paper presents a systematic overview of this field. Specifically, we focus on the opportunities and challenges of knowledge graphs. We first review the opportunities of knowledge graphs in terms of two aspects: (1) AI systems built upon knowledge graphs; (2) potential application fields of knowledge graphs. Then, we thoroughly discuss severe technical challenges in this field, such as knowledge graph embeddings, knowledge acquisition, knowledge graph completion, knowledge fusion, and knowledge reasoning. We expect that this survey will shed new light on future research and the development of knowledge graphs.
Content may be subject to copyright.
Knowledge Graphs: Opportunities and
Challenges
Ciyuan Peng1, Feng Xia2*, Mehdi Naseriparsa3
and Francesco Osborne4
1Institute of Innovation, Science and Sustainability, Federation
University Australia, Ballarat, 3353, VIC, Australia.
2School of Computing Technologies, RMIT University,
Melbourne, 3000, VIC, Australia.
3Global Professional School, Federation University Australia,
Ballarat, 3353, VIC, Australia.
4Knowledge Media Institute, The Open University, Milton
Keynes, MK7 6AA, UK.
*Corresponding author(s). E-mail(s): f.xia@ieee.org;
Contributing authors: ciyuan.p@outlook.com;
m.naseriparsa@federation.edu.au;francesco.osborne@open.ac.uk;
Abstract
With the explosive growth of artificial intelligence (AI) and big data,
it has become vitally important to organize and represent the enor-
mous volume of knowledge appropriately. As graph data, knowledge
graphs accumulate and convey knowledge of the real world. It has
been well-recognized that knowledge graphs effectively represent com-
plex information; hence, they rapidly gain the attention of academia
and industry in recent years. Thus to develop a deeper understanding of
knowledge graphs, this paper presents a systematic overview of this field.
Specifically, we focus on the opportunities and challenges of knowledge
graphs. We first review the opportunities of knowledge graphs in terms
of two aspects: (1) AI systems built upon knowledge graphs; (2) poten-
tial application fields of knowledge graphs. Then, we thoroughly discuss
severe technical challenges in this field, such as knowledge graph embed-
dings, knowledge acquisition, knowledge graph completion, knowledge
fusion, and knowledge reasoning. We expect that this survey will shed
new light on future research and the development of knowledge graphs.
1
arXiv:2303.13948v1 [cs.AI] 24 Mar 2023
2Knowledge Graphs: Opportunities and Challenges
Keywords: Knowledge graphs, artificial intelligence, graph embedding,
knowledge engineering, graph learning
1 Introduction
Knowledge plays a vital role in human existence and development. Learning
and representing human knowledge are crucial tasks in artificial intelligence
(AI) research. While humans are able to understand and analyze their sur-
roundings, AI systems require additional knowledge to obtain the same abilities
and solve complex tasks in realistic scenarios (Ji et al,2021). To support these
systems, we have seen the emergence of many approaches for representing
human knowledge according to different conceptual models. In the last decade,
knowledge graphs have become a standard solution in this space, as well as a
research trend in academia and industry (Kong et al,2022).
Knowledge graphs are defined as graphs of data that accumulate and con-
vey knowledge of the real world. The nodes in the knowledge graphs represent
the entities of interest, and the edges represent the relations between the
entities (Hogan et al,2021;Cheng et al,2022b). These representations uti-
lize formal semantics, which allows computers to process them efficiently and
unambiguously. For example, the entity “Bill Gates” can be linked to the entity
“Microsoft” because Bill Gates is the founder of Microsoft; thus, they have
relationships in the real world.
Due to the great significance of knowledge graphs in processing heteroge-
neous information within a machine-readable context, a considerable amount
of research has been conducted continuously on these solutions in recent years
(Dai et al,2020b). The proposed knowledge graphs are widely employed in
various AI systems recently (Ko et al,2021;Mohamed et al,2021), such as rec-
ommender systems, question answering, and information retrieval. They are
also widely applied in many fields (e.g., education and medical care) to benefit
human life and society. (Sun et al,2020;Bounhas et al,2020).
Therefore, knowledge graphs have seized great opportunities by improving
the quality of AI systems and being applied to various areas. However, the
research on knowledge graphs still faces significant technical challenges. For
example, there are major limitations in the current technologies for acquiring
knowledge from multiple sources and integrating them into a typical knowledge
graph. Thus, knowledge graphs provide great opportunities in modern society.
However, there are technical challenges in their development. Consequently,
it is necessary to analyze the knowledge graphs with respect to their oppor-
tunities and challenges to develop a better understanding of the knowledge
graphs.
To deeply understand the development of knowledge graphs, this survey
extensively analyzes knowledge graphs in terms of their opportunities and
challenges. Firstly, we discuss the opportunities of knowledge graphs in terms
of two aspects: AI systems whose performance is significantly improved by
Knowledge Graphs: Opportunities and Challenges 3
knowledge graphs and application fields that benefit from knowledge graphs.
Then, we analyze the challenges of the knowledge graph by considering the
limitations of knowledge graph technologies. The main contributions of this
paper are as follows:
Survey on knowledge graphs. We conduct a comprehensive survey of
existing knowledge graph studies. In particular, this work thoroughly ana-
lyzes the advancements in knowledge graphs in terms of state-of-the-art
technologies and applications.
Knowledge graph opportunities. We investigate potential opportunities
for knowledge graphs in terms of knowledge graph-based AI systems and
application fields that utilize knowledge graphs. Firstly, we examine the ben-
efits of knowledge graphs for AI systems, including recommender systems,
question-answering systems, and information retrieval. Then, we discuss the
far-reaching impacts of knowledge graphs on human society by describing
current and potential knowledge graph applications in various fields (e.g.,
education, scientific research, social media, and medical care).
Knowledge graph challenges. We provide deep insights into significant
technical challenges facing knowledge graphs. In particular, we elaborate
on limitations concerning five representative knowledge graph technologies,
including knowledge graph embeddings, knowledge acquisition, knowledge
graph completion, knowledge fusion, and knowledge reasoning.
The rest of the paper is organized as follows. Section 2provides an overview
of knowledge graphs, including the definitions and the categorization of exist-
ing research on knowledge graphs. To examine the opportunities of knowledge
graphs, Section 3and Section 4introduce relevant AI systems and application
fields, respectively. Section 5details the challenges of knowledge graphs based
on the technologies. Finally, we conclude this paper in Section 6.
2 Overview
In this section, the definition of knowledge graphs is provided first; then, we
categorize significant state-of-the-art research in this area.
2.1 What are Knowledge Graphs?
A knowledge base is a typical data set that represents real-world facts and
semantic relations in the form of triplets. When the triplets are represented as a
graph with edges as relations and nodes as entities, it is considered a knowledge
graph. Generally, the knowledge graph and knowledge base are regarded as
the same concept and are used interchangeably. In addition, the schema for a
knowledge graph can be defined as an ontology, which shows the properties of
a specific domain and how they are related. Therefore, one essential stage of
knowledge graph construction is ontology construction.
4Knowledge Graphs: Opportunities and Challenges
Fig. 1 An example of a knowledge graph. In this knowledge graph, (e1, r1, e2) is a triplet
that indicates e1and e2are connected by relation r1.
In 2012, Google first put forward Knowledge Graph by introducing their
knowledge base called Google Knowledge Graph (Ehrlinger and ,2016).
Afterward, many knowledge graphs are introduced and adopted such as:
DBpedia, a knowledge graph that intends to discover semantically mean-
ingful information form Wikipedia and convert it into an effective well-
structured ontological knowledge base in DBpedia (Auer et al,2007).
Freebase, a knowledge graph which is built upon multiple sources that
provides a structured and global resource of information (Bollacker et al,
2008).
Facebook’s entity graph, a knowledge graph that converts the unstructured
content of the user profiles into meaningful structured data (Ugander et al,
2011).
Wikidata, a cross-lingual document-oriented knowledge graph which sup-
ports many sites and services such as Wikipedia (Vrandeˇci´c and Kr¨otzsch,
2014).
Yago, is a quality knowledge base that contains a huge number of entities and
their corresponding relationships. These entities are extracted from multiple
sources such as Wikipedia and WordNet (Rebele et al,2016).
WordNet, is a lexical knowledge base to measure the semantic similar-
ity between words. The knowledge base contains a number of hierarchical
concept graphs to analyse the semantic similarity (Pedersen et al,2004).
A knowledge graph is a directed graph composed of nodes and edges, where
one node indicates an entity (a real object or abstract concept), and the edge
between the two nodes conveys the semantic relation between the two enti-
ties (Bordes et al,2011). Resource Description Framework (RDF) and Labeled
Property Graphs (LPGs) are two typical ways to represent and manage knowl-
edge graphs (arber et al,2018;Baken,2020). The fundamental unit of a
knowledge graph is the triple (subject, predicate, object) (or (head, relation,
tail)), i.e., (Bill Gates, founderOf, Microsoft). Since the relation is not neces-
sarily symmetric, the direction of a link matters. Therefore, a knowledge graph
can also be seen as a directed graph in which the head entities point to the
tail entities via the relation’s edge.
Knowledge Graphs: Opportunities and Challenges 5
Fig. 2 Research on knowledge graphs.
Fig. 1depicts an example of a simple knowledge graph. As shown in Fig. 1,
nodes e1and e2darkened in color are connected by relation r1, which goes
from e1to e2. Therefore, e1,e2, and r1can form the triplet (e1, r1, e2), in
which e1and e2are the head and tail entities, respectively.
2.2 Current Research on Knowledge Graphs
In recent years, knowledge graphs have gained extensive research interest.
Plenty of studies have focused on exploring knowledge graphs. This paper
conducts a comprehensive survey on knowledge graphs and lists seven impor-
tant categories of current research on this topic. Fig. 2illustrates a schema
of the most popular research lines regarding knowledge graphs. Among them,
AI systems are services that utilize knowledge graphs for their foundation,
and application fields are domains where knowledge graphs reach. These two
research lines are listed for discussing the opportunities of knowledge graphs.
Another five research lines are five main knowledge graph technologies corre-
sponding to five tasks. In this paper, we introduce these five technologies and
emphasize their limitations to give useful insights into the major challenges of
the knowledge graphs.
Knowledge Graph Embedding: Knowledge graph embedding is one
of the central research issues. This task aims to map entities and relations
of a knowledge graph to a low-dimensional vector space so that it captures
the semantics and the structure of the knowledge graph efficiently (Dai et al,
2020b). Then, the obtained feature vector can be effectively learned by machine
learning models. Three main triplet fact-based embedding methods are as
follows: (a) tensor factorization-based, (b) translation-based, and (c) neural
network-based methods (Dai et al,2020b).
Knowledge Acquisition: Knowledge acquisition, which focuses on mod-
eling and constructing knowledge graphs, is another crucial research direction
of knowledge graph study. Typically, the knowledge is imported from struc-
tured sources by employing mapping languages, such as R2RML (Rodriguez-
Muro and Rezk,2015). Furthermore, the knowledge could be extracted from
6Knowledge Graphs: Opportunities and Challenges
unstructured documents (e.g., news, research papers, and patents) by adopt-
ing relation, entity, or attribute extraction methods (Liu et al,2020;Yu et al,
2020;Yao et al,2019).
Knowledge Graph Completion: Although there are many methods for
constructing knowledge graphs, it is still unfeasible to create comprehensive
representations of all the knowledge in a field. Most knowledge graphs still
lack a good number of entities and relationships. Thereby, significant efforts
have been made for completing knowledge graphs. Knowledge graph comple-
tion aims to improve the quality of knowledge graphs by predicting additional
relationships and entities. The first task typically adopts link prediction tech-
niques to generate triplets and then assigns the triplets plausibility scores (Ji
et al,2021). The second task employs entity prediction methods for obtaining
and integrating further information from external sources.
Knowledge Fusion: Knowledge fusion is also an important research
direction that focuses on capturing knowledge from different sources and inte-
grating it into a knowledge graph (Nguyen et al,2020). The knowledge fusion
approaches are useful for both generating and completing knowledge graphs.
Recently, entity alignment has been the primary method for implementing
knowledge fusion tasks.
Knowledge Reasoning: Tremendous research efforts have focused on rea-
soning to enrich the knowledge graphs, which aims to infer new facts based on
existing data (Minervini et al,2020). In particular, new relations between two
unconnected entities are inferred, forming new triplets. Also, by reasoning out
the false facts, knowledge reasoning has the ability to identify erroneous knowl-
edge. The main methods for knowledge reasoning include logic rule-based,
distributed representation-based, and neural network-based methods (Chen
et al,2020b).
AI Systems: Nowadays, knowledge graphs are widely utilized by AI sys-
tems (Liang et al,2022), such as recommenders, question-answering systems,
and information retrieval tools. Typically, the richness of information within
knowledge graphs enhances the performance of these solutions. Therefore,
many studies have focused on taking advantage of knowledge graphs to improve
AI systems’ performance.
Application Fields: Knowledge graphs have numerous applications in
various fields, including education, scientific research, social media, and med-
ical care (Li et al,2020b). A variety of intelligent applications are required to
improve the standard of human life.
Differing from other works, this paper focuses on surveying the opportuni-
ties and challenges of knowledge graphs. In particular, knowledge graphs meet
great opportunities by improving the quality of AI services and being applied in
various fields. On the contrary, this paper regards the limitations of knowledge
graph technologies as the challenges. Therefore, we will discuss the techni-
cal limitations regarding knowledge graph embeddings, knowledge acquisition,
knowledge graph completion, knowledge fusion, and knowledge reasoning.
Knowledge Graphs: Opportunities and Challenges 7
3 Knowledge Graphs for AI Systems
This section explains the opportunities by analyzing the advantages that
knowledge graphs bring for improving the functionalities of AI Systems.
Specifically, there are a couple of systems, including recommender systems,
question-answering systems, and information retrieval tools (Guo et al,2020;
Zou,2020), which utilize knowledge graphs for their input data and benefit
the most from knowledge graphs. In addition to these systems, other AI sys-
tems, such as image recognition systems (Chen et al,2020a), have started to
consider the characteristic of knowledge graphs. However, the application of
knowledge graphs in these systems is not widespread. Moreover, these systems
do not directly optimize performance by utilizing knowledge graphs for the
input data. Therefore, the advantages that knowledge graphs bring for recom-
mender systems, question-answering systems, and information retrieval tools
are discussed in detail to analyze the opportunities of knowledge graphs. Typ-
ically, these solutions greatly benefit from adopting knowledge graphs that
offer high-quality representations of the domain knowledge. Table 1 presents
a summary of the AI systems that we will discuss below.
3.1 Recommender Systems
With the continuous development of big data, we observe the exponential
growth of information. In the age of information explosion, it becomes chal-
lenging for people to receive valid and reliable information (Shokeen and Rana,
2020;Monti et al,2021;omez et al,2022). Specifically, online users may feel
confused when they want to select some items they are interested in among
thousands of choices. To tackle this issue, we saw the emergence of several
recommender systems to provide users with more accurate information. Typ-
ically, recommender systems learn the preference of target users for a set of
items (Wan et al,2020;Zheng and Wang,2022) and produce a set of suggested
items with similar characteristics. Recommender systems are fruitful solutions
to the information explosion problem and are employed in various fields for
enhancing user experience (Quijano-S´anchez et al,2020).
3.1.1 Traditional Recommender Systems
There are two traditional methods for developing recommender systems,
including content-based and collaborative filtering-based (CF-based) methods.
Shu et al. (Sun et al,2019b) and Guo et al. (Guo et al,2020) have compared
and summarised these two approaches.
Content-based Recommender Systems: The content-based recom-
mender systems first analyze the content features of items (e.g., descriptions,
documents). These items are previously scored by the target users (Guo et al,
2020;Xia et al,2014b). Then, the recommender systems learn the user inter-
ests by employing machine learning models. Thus, these systems are able to
effectively recommend trending items to the target users according to their
preferences. Some recommender systems utilize the content of the original
8Knowledge Graphs: Opportunities and Challenges
Table 1 AI systems using knowledge graphs.
AI Systems Approaches Techniques on knowledge graphs
Recommender systems KPRN (Wang et al,2019c) Entity-relation path generation based on user-item
interaction
RippleNet (Wang et al,2018b) Preference propagation
MKR (Wang et al,2019a) Laten user-item interaction
MKGAT (Sun et al,2020) Neighbor information extraction; relation reasoning
Ripp-MKR (Wang et al,2021) Preference propagation; laten user-item interaction
RKG (Shu and Huang,2021) User preferenfce lists-based knowledge graph con-
struction
Question-answering systems MHPGM (Bauer et al,2018) Multiple hop relation reasoning
PCQA (Shin et al,2019) Predicate constraints-based relation extraction
KEQA (Huang et al,2019) Simple question-based triplet construction
EmbedKGQA (Saxena et al,
2020)
Knowledge graph embedding-based multi-hop
question answering
Information retrieval EQFE (Dalton et al,2014) Query knowledge graph-based feature expansion
Knowledge graph based Infor-
mation Retrieval Technology
(Wang et al,2018a)
Query-document knowledge graph construction
CKG (Wise et al,2020) Document knowledge graph construction
EDRM(Liu et al,2018) Integration of semantics from knowledge graphs
and entities from queries and documents represen-
tations of their entities
Knowledge Graphs: Opportunities and Challenges 9
query result to discover highly-related items for the users that may interest
them (Naseriparsa et al,2019b). These systems employ machine learning tech-
niques or statistical measures such as correlation to compute the highly-similar
items to those that are visited by the users (Naseriparsa et al,2019a). Another
group of content-based recommender systems employs lexical references such
as dictionaries to utilize semantic relationships of the user query results to
recommend highly semantically-related items to the users that may directly
satisfy their information needs (Naseriparsa et al,2018;Sun et al,2017).
CF-based Recommender Systems: CF-based recommender systems
suggest items to the users based on the information of user-item interaction
(Chen et al,2020c). CF-based recommender systems infer the user prefer-
ence by clustering similar users instead of extracting the features of the items
(Wang et al,2019b). However, we face data sparsity and cold start problems
in traditional CF-based systems. In general, users can only rate a few items
among a large number of items, which leads to preventing many items from
receiving appropriate feedback. Therefore, the recommender systems do not
effectively learn user preferences accurately because of data sparsity (Bai et al,
2019;Xia et al,2014a). On the other hand, the cold start problem makes
it even more difficult to make recommendations when the items or users are
new because there is no historical data or ground truth. Moreover, because
abundant user information is required for achieving effective recommendations,
CF-based recommender systems face privacy issues. How to achieve personal-
ized recommendations while protecting the privacy of users is still an unsolved
problem.
3.1.2 Knowledge Graph-based Recommender Systems
To address inherent problems of traditional approaches, the community has
produced several hybrid recommender systems, which consider both item
features and the distribution of user scores. Most of these solutions adopt
knowledge graphs for representing and interlinking items (Palumbo et al,
2020). Specifically, Knowledge graph-based recommender systems integrate
knowledge graphs as auxiliary information and leverage users and items net-
works to learn the relationships of items-users, items-items, and users-users
(Palumbo et al,2018).
Fig 3presents an example of knowledge graph-based movie recommenda-
tion. Here we can see that the movies “Once Upon A Time in Hollywood”
and “Interstellar” are recommended to three users according to a knowledge
graph that contains the nodes of users, films, directors, actors, and genres. The
knowledge graph is thus used to infer latent relations between the user and
the recommended movies.
Recently, a great deal of research has been conducted to utilize knowl-
edge graphs for recommendation tasks. For instance, Wang et al. (Wang et al,
2019c) introduced KPRN. KPRN is a recommender system that generates
entity-relation paths according to the user-item interaction and constructs a
knowledge graph that consists of the users, items, and their interaction. It
10 Knowledge Graphs: Opportunities and Challenges
Fig. 3 An example of knowledge graph-based recommender system.
then infers the user preference based on the entity-relation path. The user-item
interaction, which is extracted from knowledge graphs, improves the quality of
the recommendations and allows the presentation of the recommended results
in a more explainable manner. Wang et al. (Wang et al,2019a) also applied
multi-task knowledge graph representation (MKR) for recommendation tasks.
MKR models knowledge graphs based on the user-item interaction. It is worth
noting that MKR focuses on the structural information of knowledge graphs for
learning the latent user-item interaction. Sun et al. (Sun et al,2020) proposed
a Multi-modal Knowledge Graph Attention Network (MKGAT) for achieving
precise recommendations. MKGAT constructs knowledge graphs based on two
aspects: (1) it enriches entity information by extracting the information of the
neighbor entities; (2) it scores the triplets to construct the reasoning relations.
Finally, they applied knowledge graphs that are enriched with structured data
to recommender systems.
Wang et al. (Wang et al,2018b) presented the RippleNet model, which
incorporates knowledge graphs into recommendation tasks by preference prop-
agation. RippleNet firstly regards users’ historical records as the basis of a
knowledge graph. Then, it predicts the user preference list among candidate
items based on the knowledge graph links. Based on both RippleNet and
MKR models, Wang et al. (Wang et al,2021) applied the Ripp-MKR model.
Ripp-MKR combines the advantages of preference propagation and user-item
interaction to dig the potential information of knowledge graphs. Shu et al.
(Shu and Huang,2021) proposed RKG, which achieves recommendation by
referring to the user preference-based knowledge graph. RKG first obtains
users’ preference lists; then, it analyzes the relations between the user’s pre-
ferred items and the items which are to be recommended. Therefore, the
model effectively learns the score of the candidate items for recommendation
according to the candidate items’ relationship with the user’s preferred items.
Many studies have utilized ontological knowledge base information to
improve retrieving results from various data sources (Farf´an et al,2009). Wu
et al. (Wu et al,2013) adopted the ontological knowledge base to extract
Knowledge Graphs: Opportunities and Challenges 11
highly semantically similar sub-graphs in graph databases. Their method effec-
tively recommends semantically relevant sub-graphs according to ontological
information. Farf et al. (Farf´an et al,2009) proposed the XOntoRank, which
adopts the ontological knowledge base to facilitate the data exploration and
recommendation on XML medical records.
Compared with the traditional recommender systems, knowledge graph-
based recommender systems have the following advantages:
Better Representation of Data: Generally, the traditional recommender
systems suffer from data sparsity issues because users usually have experi-
ence with only a small number of items. However, the rich representation of
entities and their connections in knowledge graphs alleviate this issue.
Alleviating Cold Start Issues: It becomes challenging for traditional
recommender systems to make recommendations when there are new users
or items in the data set. In knowledge graph-based recommender systems,
information about new items and users can be obtained through the rela-
tions between entities within knowledge graphs. For example, when a new
Science-Fiction movie such as “Tenet” is added to the data set of a movie
recommender system that employs knowledge graphs, the information about
“Tenet” can be gained by its relationship with the genre Science-Fiction
(gaining triplet (Tenet, has genre of, Sci-Fi)).
The Explainability of Recommendation: Users and the recommended
items are connected along with the links in knowledge graphs. Thereby, the
reasoning process can be easily illustrated by the propagation of knowledge
graphs.
3.2 Question-answering Systems
Question answering is one of the most central AI services, which aims to
search for the answers to natural language questions by analyzing the semantic
meanings (Dimitrakis et al,2020;Das et al,2022). The traditional question-
answering systems match the textual questions with the answers in the
unstructured text database. In the search process, the semantic relationship
between the question and answer is analyzed; then, the system matches the
questions and answers with the maximum semantic similarity. Finally, the
system outputs the answer. However, the answers are obtained by filtrating
massive unstructured data, which deteriorates the efficiency of the traditional
question-answering systems due to analyzing an enormous search space. To
solve this issue, a lot of research focuses on employing structured data for
question answering, particularly knowledge graph-based question-answering
systems (Singh et al,2020;Qiu et al,2020).
The sophisticated representation of information in knowledge graphs is a
natural fit for question-answering systems. Knowledge graph-based question-
answering systems typically analyze the user question and retrieve the portion
of knowledge graphs for answering. The answering task is facilitated either
by using similarity measures or by producing structured queries in standard
12 Knowledge Graphs: Opportunities and Challenges
Fig. 4 The illustration of knowledge graph based question-anwsering systems.
formats (e.g., SPARQL). Fig 4presents an example of the knowledge graph-
based question-answering system. The system answer “Shakespeare” is a node
that is linked to the node “Romeo”. The node “Romeo” is extracted from the
question.
There are two main types of questions in this space: simple and multi-
hop questions, respectively. Simple questions are answered only by referring
to a single triplet, while multi-hop questions require combining multiple enti-
ties and relations. Focusing on simple questions, Huang et al. (Huang et al,
2019) proposed a knowledge graph embedding-based question-answering sys-
tem (KEQA). They translated the question and its corresponding answer into
a single triplet. For instance, the question Which film acted by Leonardo”
and one of its answers “Inception” can be expressed as the following triplet:
(Leonard, act, I nception). Then, the head entity, relation, and tail entity of the
triplet are represented by a vector matrix in the embedding space for learning
the question-answer information. Considering the semantic meanings of the
questions, Shin et al. (Shin et al,2019) presented a predicate constraint-based
question-answering system (PCQA). They took advantage of the predicate
constraints of knowledge graphs, which is a triplet contains a subject, pred-
icate, and an object to capture the connection between the questions and
answers. Using the triplet for question-answering integration, the processing of
the question-answering service can be simplified; therefore, the result improves.
Bauer et al. (Bauer et al,2018) focused on multi-hop questions and pro-
posed a Multi-Hop Pointer-Generator Model (MHPGM). They selected the
relation edges that are related to the questions in a knowledge graph and
injected attention to achieve multi-hop question answering. Because of the
advantages of knowledge graphs’ structure, multi-hop question answering can
extract coherent answers effectively. Saxena et al. (Saxena et al,2020) proposed
EmbedKGQA to achieve multi-hop question answering over sparse knowl-
edge graphs (such as knowledge graphs with missing edges). The main idea
of EmbedKGQ is to utilize knowledge graph embeddings to reduce knowledge
Knowledge Graphs: Opportunities and Challenges 13
graph sparsity. It first creates embeddings of all entities and then selects the
embedding of a given question. Lastly, it predicts the answer by combining
these embeddings.
Compared to the traditional question answering, the advantages of knowl-
edge graph-based question-answering systems can be summarized as follows:
Increased Efficiency: Instead of searching for answers from massive tex-
tual data, which may contain a large volume of useless data items, knowledge
graph-based question-answering systems focus only on entities with rel-
evant properties and semantics. Therefore, they reduce the search space
significantly and extract the answers effectively and efficiently.
Multi-hop Question Answering: The answers can be more complex and
sophisticated than the ones produced with traditional methods relying on
unstructured data since they can combine several facts and concepts from
the knowledge graph via multi-hop question answering.
3.3 Information Retrieval
Information retrieval enables retrieval systems to match end-user queries with
relevant documents, such as web pages (Liu et al,2019). Traditional infor-
mation retrieval systems index the documents according to the user queries
and return the matched documents to the users (Hersh,2021). Nevertheless,
index processing is complex and requires plenty of time because of the massive-
ness and diversity of documents. As a result, traditional information retrieval
faces the challenge of inaccurate search results and potentially low efficiency.
Also, since search engines have limitations with respect to text interpretation
ability, keyword-based text search usually outputs limited results. Thus, to
address these problems, many modern search engines take advantage of knowl-
edge graphs (Bounhas et al,2020;Zheng et al,2020). Knowledge graph-based
information retrieval introduces a new research direction that takes advantage
of knowledge graphs for improving the performance of search engines and the
explainability of the results.
Typically, these systems rely on the advanced representation of the docu-
ments based on entities and relationships from knowledge graphs. These formal
and machine-readable representations are then matched to the user query for
retrieving the more pertinent documents. For instance, Wise et al. (Wise et al,
2020) proposed a COVID-19 Knowledge Graph (CKG) to extract the rela-
tionships between the scientific articles about COVID-19. In particular, they
combined the topological information of documents with the semantic meaning
to construct document knowledge graphs. Wang et al. (Wang et al,2018a) pro-
posed a knowledge graph-based information retrieval technology that extracts
entities by mining entity information on web pages via an open-source relation
extraction method. Then, the entities with relationships are linked to construct
a knowledge graph.
Knowledge graphs can also support methods for query expansion, which is
able to enrich the user query by adding relevant concepts (e.g., synonymous).
14 Knowledge Graphs: Opportunities and Challenges
For example, Dalton et al. (Dalton et al,2014) presented an entity query fea-
ture expansion (EQFE) to enrich the queries based on the query knowledge
graph, including structured attributes and text. Liu et al. (Liu et al,2018)
proposed the Entity-Duet Neural Ranking Model (EDRM). EDRM integrates
the semantics extracted from knowledge graphs with the distributed represen-
tations of entities in queries and documents. Then, it ranks the search results
using interaction-based neural ranking networks.
Compared to traditional information retrieval, the knowledge graph-based
information retrieval has the following advantages:
Semantic Representation of Items: Items are represented according to
a formal and interlinked model that supports semantic similarity, reason-
ing, and query expansion. This typically allows the system to retrieve more
relevant items and makes the system more interpretable.
High Search Efficiency: Knowledge graph-based information retrieval can
use the advanced representation of the items to reduce the search space sig-
nificantly (e.g., discarding documents that use the same terms with different
meanings), resulting in improved efficiency.
Accurate Retrieval Results: In knowledge graph-based information
retrieval, the correlation between query and documents is analyzed based on
the relations between entities in the knowledge graph. This is more accurate
than finding the similarities between queries and documents.
4 Applications and Potentials
In this section, we discuss the applications and potentials of knowledge
graphs in four domains: education, scientific research, social networks, and
health/medical care. Although some researchers try to take advantage of
knowledge graphs to develop beneficial applications in other domains such
as finance (Cheng et al,2022c), the knowledge graph-based intelligent ser-
vice in these areas is relatively obscure and still needs to be explored.
Therefore, this section mainly focuses on education, scientific research, social
networks, and medical care to summarize the opportunities of knowledge
graphs. Table 2 presents several recent applications of knowledge graphs that
make contributions to these fields.
4.1 Education
Education is of great importance to the development of human society. Many
studies have focused on deploying intelligent applications to improve the qual-
ity of education (Bai et al,2021;Wang et al,2020d). Specifically, in the
age of big data, data processing becomes a challenging task because of the
complex and unstructured educational data. Thereby, intelligent educational
systems tend to apply structured data, such as knowledge graphs. Several
Knowledge Graphs: Opportunities and Challenges 15
Table 2 Fields of applications of knowledge graphs.
Fields Applications Methods Functions
Education Knowledge Graph based
Course Management
Model (Aliyu et al,2020)
Course knowledge graphs Courses management; Generation of
course allocation schedule
KnowEdu (Chen et al,
2018)
Instructional concepts extrac-
tion; Educational relation iden-
tification
Educational knowledge graph con-
struction
Knowledge Graph-based
Tool for Online Learning
(Zablith,2022)
Integration of social media con-
tents and formal learning con-
tents
Efficient online knowledge acquisition
Scientific Research Scientific Publication
Management Model (Chi
et al,2018)
Knowledge graph based aca-
demic network
Scientific publication management
Reviewer Recommenda-
tion System Yong et al
(2021)
Knowledge graph-based rule
engine establishment
Precise matching of reviewer and
paper
Social Networks DEAP-FAKED (Mayank
et al,2021)
News-Entity knowledge graphs Fake news detection
GraphRec (Fan et al,
2019)
Information aggregation of
user-user and user-item graphs
Social Recommendation
Graph Reasoning Model
(Wang et al,2018d)
Knowledge graph propogation Social relationship extraction
Health/Medical
Care
SMR (Gong et al,2021) Medical knowledge graph
embeddings
Safe medicine recommendation
DETERRENT (Cui et al,
2020)
Knowledge guided graph atten-
tion network
Health misinformation detection
KGNN (Lin et al,2020) Mining the relationships
between drugs
Drug discovery
COVID-KG(Yuan and
Deng,2021)
Multimedia knowledge graph
construction
Drug discovery
16 Knowledge Graphs: Opportunities and Challenges
knowledge graph-based applications support the educational process, focus-
ing in particular on data processing and knowledge dissemination (Yao et al,
2020).
In education, the quality of offline school teaching is of vital importance.
Therefore, several knowledge graph-based applications focus on supporting
teaching and learning. For example, considering the importance of course
allocation tasks in university, Aliyu et al. (Aliyu et al,2020) proposed a knowl-
edge graph-based course management approach to achieve automatic course
allocation. They constructed a course knowledge graph in which the entities
are courses, lecturers, course books, and authors in order to suggest rele-
vant courses to students. Chen et al.(Chen et al,2018) presented KnowEdu,
a system for educational knowledge graph construction, which automatically
builds knowledge graphs for learning and teaching in schools. First, KnowEdu
extracts the instructional concepts of the subjects and courses as the entity
features. Then, it identifies the educational relations based on the students’
assessments and activities to make the teaching effect more remarkable.
The abovementioned knowledge graph-based intelligent applications are
dedicated to improving the quality of offline school teaching. However, online
learning has become a hot trend recently. Moreover, online study is an indis-
pensable way of learning for students during the COVID-19 pandemic(Saraji
et al,2022). Struggling with confusing online content (e.g., learning content of
low quality on social media), students face major challenges in acquiring signif-
icant knowledge efficiently. Therefore, researchers have focused on improving
online learning environments by constructing education-efficient knowledge
graphs (d’Aquin,2016;Pereira et al,2017). For example, to facilitate online
learning and establish connections between formal learning and social media,
Zablith (Zablith,2022) proposed to construct a knowledge graph by integrating
social media and formal educational content, respectively. Then, the produced
knowledge graph can filter social media content, which is fruitful for formal
learning and help students with efficient online learning to some extent.
Offline school teaching and online learning are two essential parts of edu-
cation, and it is necessary to improve the quality of both to promote the
development of education. Significantly, knowledge graph-based intelligent
applications can deal with complicated educational data and make both offline
and online education more convenient and efficient.
4.2 Scientific Research
A variety of knowledge graphs focus on supporting the scientific process and
assisting researchers in exploring research knowledge and identifying rele-
vant materials (Xia et al,2016). They typically describe documents (e.g.,
research articles, patents), actors (e.g., authors, organizations), entities (e.g.,
topics, tasks, technologies), and other contextual information (e.g., projects,
funding) in an interlinked manner. For instance, Microsoft Academic Graph
(MAG) (Wang et al,2020a) is a heterogeneous knowledge graph. MAG
Knowledge Graphs: Opportunities and Challenges 17
contains the metadata of more than 248M scientific publications, includ-
ing citations, authors, institutions, journals, conferences, and fields of study.
The AMiner Graph (Zhang et al,2018) is the corpus of more than 200M
publications generated and used by the AMiner system1. The Open Aca-
demic Graph (OAG)2is a massive knowledge graph that integrates Microsoft
Academic Graph and AMiner Graph. AceKG (Wang et al,2018c) is a large-
scale knowledge graph that provides 3 billion triples of academic facts about
papers, authors, fields of study, venues, and institutes, as well as the relations
among them. The Artificial Intelligence Knowledge Graph (AI-KG) (Dess`ı
et al,2020)3describes 800K entities (e.g., tasks, methods, materials, metrics)
extracted from the 330K most cited articles in the field of AI. The Academi-
a/Industry Dynamics Knowledge Graph (AIDA KG) (Angioni et al,2021)4
describes 21M publications and 8M patents according to the research top-
ics drawn from the Computer Science Ontology (Salatino et al,2020) and 66
industrial sectors (e.g., automotive, financial, energy, electronics).
In addition to constructing academic knowledge graphs, many researchers
also take advantage of knowledge graphs to develop various applications ben-
eficial to scientific research. Chi et al. (Chi et al,2018) proposed a scientific
publication management model to help non-researchers learn methods for
sustainability from research thinking. They built a knowledge graph-based
academic network to manage scientific entities. The scientific entities, includ-
ing researchers, papers, journals, and organizations, are connected regarding
their properties. For the convenience of researchers, many scientific knowledge
graph-based recommender systems, including citation recommendation, col-
laboration recommendation, and reviewer recommendation, are put forward
(Shao et al,2021). For instance, Yong et al.(Yong et al,2021) designed a
knowledge graph-based reviewer assignment system to achieve precise match-
ing of reviewers and papers. Particularly, they matched knowledge graphs
and recommendation rules to establish a rule engine for the recommendation
process.
4.3 Social Networks
With the rapid growth of social media such as Facebook and Twitter, online
social networks have penetrated human life and bring plenty of benefits such
as social relationship establishment and convenient information acquisition
(Li et al,2020a;Hashemi and Hall,2020). Various social knowledge graphs
are modeled and applied to analyze the critical information from the social
network. These knowledge graphs are usually constituted based on the peo-
ple’s activities and their posts on social media, which are applied to numerous
applications for different functions (Xu et al,2020).
1AMiner - https://www.aminer.cn/
2Open Academic Graph - https://www.openacademic.ai/oag/
3AI-KG - https://w3id.org/aikg/
4AIDA - http://w3id.org/aida
18 Knowledge Graphs: Opportunities and Challenges
Remarkably, social media provides high chances for people to make friends
and gain personalized information. Furthermore, social media raises funda-
mental problems, such as how to recommend accurate content that interests
us and how to connect with persons interested in a common topic. To address
these issues, various studies have been proposed to match users with their
favorite content (or friends) for recommendation (Ying et al,2018). With the
increase in users’ demand, a number of researchers utilize knowledge graph-
based approaches for more precise recommendations (Gao et al,2020). A
representative example is GraphRec (a graph neural network framework for
social recommendations) proposed by Fan et al. (Fan et al,2019). They con-
sidered two kinds of social knowledge graphs: user-user and user-item graphs.
Then, they extracted information from the two knowledge graphs for the learn-
ing task. As a result, their model can provide accurate social recommendations
because it aggregates the social relationships of users and the interactions
between users and items.
In addition, people’s activities on social media reveal social relationships.
For example, we can learn about the relationships around a person through
his photos or comments on Twitter. Significantly, social relationship extrac-
tion assists companies in tracking users and enhancing the user experience.
Therefore, many works are devoted to social relationship extraction. Wang
et al. (Wang et al,2018d) propose a graph reasoning model to recognize the
social relationships of people in a picture that is posted on social media. Their
model enforces a particular function based on the social knowledge graph and
deep neural networks. In their method, they initialized the relation edges and
entity nodes with the features that are extracted from the semantic objects
in an image. Then, they employed GGNN to propagate the knowledge graph.
Therefore, they explored the relations of the people in the picture.
One of the biggest problems in this space is fake news (Zhang et al,2019a).
Online social media has become the principal platform for people to consume
news. Therefore, a considerable amount of research has been done for fake
news detection (Choi et al,2020;Meel and Vishwakarma,2020). Most recently,
Mayank et al. (Mayank et al,2021) exploited a knowledge graph-based model
called DEAP-FAKED to detect fake news on social media. Specifically, DEAP-
FAKED learns news content and identifies existing entities in the news as the
nodes of the knowledge graph. Afterward, a GNN-based technique is applied
to encode the entities and detect anomalies that may be linked with fake news.
4.4 Health/Medical Care
With medical information explosively growing, medical knowledge analysis
plays an instrumental role in different healthcare systems. Therefore, research
focuses on integrating medical information into knowledge graphs to empower
intelligent systems to understand and process medical knowledge quickly and
correctly (Li et al,2020b). Recently, a variety of biomedical knowledge graphs
have become available. Therefore, many medical care applications exploit
knowledge graphs. For instance, Zhang et al. (Zhang et al,2020a) presented a
Knowledge Graphs: Opportunities and Challenges 19
Health Knowledge Graph Builder (HKGB) to build medical knowledge graphs
with clinicians’ expertise.
Specifically, we discuss the three most common intelligent medical care
applications, including medical recommendation, health misinformation detec-
tion, and drug discovery. Firstly, with the rapid development of the medical
industry, medical choices have become more abundant. Nevertheless, in the
variety of medical choices, people often feel confused and unable to make the
right decision to get the most suitable and personalized medical treatment.
Therefore, medical recommender systems, especially biomedical knowledge
graph-based recommender systems (such as doctor recommender systems and
medicine recommender systems), have been put forward to deal with this
issue (Katzman et al,2018). Taking medicine recommendation as an example,
Gong et al. (Gong et al,2021) provided a medical knowledge graph embedding
method by constructing a heterogeneous graph whose nodes are medicines,
diseases, and patients to recommend accurate and safe medicine prescriptions
for complicated patients.
Secondly, although many healthcare platforms aim to provide accurate
medical information, health misinformation is an inevitable problem. Health
misinformation is defined as incorrect information that contradicts authen-
tic medical knowledge or biased information that covers only a part of the
facts (Wang et al,2020e). Unfortunately, a great deal of health-related infor-
mation on various healthcare platforms (e.g., medical information on social
media) is health misinformation. What’s worse, the wrong information leads to
consequential medical malpractice; therefore, it is urgent to detect health mis-
information. Utilizing authoritative medical knowledge graphs to detect and
filter misinformation can help people make correct treatment decisions and
suppress the spread of misinformation (Cui et al,2020). Representatively, Cui
et al. (Cui et al,2020) presented a model called DETERREN to detect health
misinformation. DETERREN leverages a knowledge-guided attention network
that incorporates an article-entity graph with a medical knowledge graph.
Lastly, drug discovery, such as drug repurposing and drug-drug interac-
tion prediction, has been a research trend for intelligent healthcare in recent
years. Benefiting from the rich entity information (e.g., the ingredients of a
drug) and relationship information (e.g., the interaction of drugs) in medi-
cal knowledge graphs, drug discovery based on knowledge graphs is one of
the most reliable approaches (MacLean,2021). Lin et al. (Lin et al,2020)
presented an end-to-end framework called KGNN (Knowledge Graph Neural
Network) for drug-drug interaction prediction. The main idea of KGNN is to
mine the relations between drugs and their potential neighborhoods in medical
knowledge graphs. It first exploits the topological information of each entity;
then, it aggregates all the neighborhood information from the local receptive
entities to extract both semantic relations and high-order structures. Wang
et al. (Wang et al,2020c) developed a knowledge discovery framework called
COVID-KG to generate COVID-19-related drug repurposing reports. They
first constructed multimedia knowledge graphs by extracting medicine-related
20 Knowledge Graphs: Opportunities and Challenges
entities and their relations from images and texts. Afterward, they utilized the
constructed knowledge graphs to generate drug repurposing reports.
5 Technical Challenges
Although knowledge graphs offer fantastic opportunities for various services
and applications, many challenges are yet to be addressed (Noy et al,2019).
Specifically, the limitations of existing knowledge graph technologies are the
key challenges for promoting the development of knowledge graphs (Hogan
et al,2021). Therefore, this section discusses the challenges of knowledge
graphs in terms of the limitations of five topical knowledge graph technolo-
gies, including knowledge graph embeddings, knowledge acquisition, knowledge
graph completion, knowledge fusion, and knowledge reasoning.
5.1 Knowledge Graph Embeddings
The aim of knowledge graph embeddings is to effectively represent knowledge
graphs in a low-dimensional vector space while still preserving the semantics
(Xia et al,2021;Vashishth et al,2020). Firstly, the entities and relations are
embedded into a dense dimensional space in a given knowledge graph, and a
scoring function is defined to measure the plausibility of each fact (triplet).
Then, the plausibility of the facts is maximized to obtain the entity and rela-
tion embeddings (Chaudhri et al,2022;Sun et al,2022). The representation of
knowledge graphs brings various benefits to downstream tasks. The three main
types of triplet fact-based knowledge graph embedding approaches are ten-
sor factorization-based, translation-based, and neural network-based methods
(Rossi et al,2021).
5.1.1 Tensor Factorization-based Methods
The core idea of tensor factorization-based methods is transforming the triplets
in the knowledge graph into a 3D tensor (Balaˇzevi´c et al,2019). As Fig 5
presents, the tensor X Rm×m×n, where mand nindicate the number of
entity and relation, respectively, contains nslices, and each slice corresponds
to one relation type. If the condition Xijk = 1 is met, the triplet (ei, rk, ej),
where eand rdenote entity and relation, respectively, exists in the knowledge
graph. Otherwise, if Xijk = 0, there is no such a triplet in the knowledge graph.
Then, the tensor is represented by the embedding matrices that consist of the
vectors of entities and relations.
5.1.2 Translation-based Methods
Translation-based methods exploit the scoring function, which is based on
translation invariance. Translation invariance interprets the distance between
the vectors of the two words, which is represented by the vector of their
semantic relationships (Mikolov et al,2013). Bordes et al. (Bordes et al,2013)
firstly utilized the translation invariance-based scoring functions to measure
Knowledge Graphs: Opportunities and Challenges 21
Table 3 Knowledge graph embedding methods.
Categories Techniques Evaluation Approaches Data Set Results
Tensor factorization-based
methods
RESCAL (Nickel et al,2011) Link prediction[Hits@10] FB15K 44.1%
HolE (Nickel et al,2016) Link prediction[Hits@10] FB15K 73.9%
ComplEx (Trouillon et al,2016) Link prediction[Hits@10] FB15K 84%
SimplE (Kazemi and Poole,2018) Link prediction[Hits@10] FB15K 83.8%
RotatE (Sun et al,2019a) Link prediction[Hits@10] FB15K 88.4%
QuatE (Zhang et al,2019c) Link prediction[Hits@10] FB15K 90%
Translation-based methods TransE (Bordes et al,2013) Link prediction[Hits@10] FB15K 47.1%
TransH (Wang et al,2014) Link prediction[Hits@10] FB15K 64.4%
TransR (Lin et al,2015) Link prediction[Hits@10] FB15K 68.7%
TransD (Ji et al,2015) Link prediction[Hits@10] FB15K 77.3%
TranSparse (Ji et al,2016) Link prediction[Hits@10] FB15K 79.9%
STransE (Nguyen et al,2016) Link prediction[Hits@10] FB15K 79.7%
TransA (Jia et al,2016) Link prediction[Hits@10] FB15K 80.4%
KG2E (He et al,2015) Link prediction[Hits@10] FB15K 71.5%
TransG (Xiao et al,2015) Link prediction[Hits@10] FB15K 88.2%
Neural network-based methods SME (Bordes et al,2014) Link prediction[Hits@10] FB15K 41.3 %
NTN (Socher et al,2013) Triplet classification[Accuracy] WN11 86.2%
SLM (Socher et al,2013) Triplet classification[Accuracy] WN11 76%
RMNN (Liu et al,2016) Triplet classification[Accuracy] WN11 89.9%
R-GCN (Schlichtkrull et al,2018) Link prediction[Hits@10] FB15K 84.2%
ConvKB (Nguyen et al,2017) Link prediction[Hits@10] WN18RR 52.5 %
KBGAN (Cai and Wang,2017) Link prediction[Hits@10] WN18 89.2%
(1) In this table, all the results of link prediction are filter results.
22 Knowledge Graphs: Opportunities and Challenges
Fig. 5 An illustration of tensor factorization of knowledge graphs.
the embedding results. They creatively proposed the TransE model, which
translates all the entities and relations of a knowledge graph into a continuous
and low vector space. Specifically, the vectors of the head and tail entities in a
triplet are connected by the vector of their relation. Consequently, in the vec-
tor space, the semantic meaning of every triplet is preserved. Formally, given a
triplet (head, relation, tail), the embedding vectors of the head entity, relation,
and tail entity are h,r, and t, respectively. In the vector space, the plausibility
of the triplet (h,r,t) is computed by the translation invariance-based scoring
function to ensure it follows the geometric principle: h+rt.
After TransE, a lot of related extensions, such as TransH (Wang et al,
2014) and TransR (Lin et al,2015), are continually proposed to improve the
performance of the Translation-based knowledge graph embeddings.
5.1.3 Neural Network-based Methods
Nowadays, deep learning has become a popular tool that is utilized for knowl-
edge graph embeddings, and a considerable amount of research proposes to
employ neural networks to represent the triplets of knowledge graphs (Dai et al,
2020a). In this section, we discuss three representative works, including SME,
ConvKB, and R-GCN, to briefly introduce neural network-based knowledge
graph embeddings.
SME (Bordes et al,2014) designs an energy function to conduct semantic
matching, which utilizes neural networks to measure the confidence of each
triplet (h, r, t) in knowledge graphs. The scoring function of SME is defined as
follows:
fr(h, t)=(Wh1h+Wh2r+bh)>(Wt1t+Wt2r+bt).(1)
The scoring function of SME (bilinear) is:
fr(h, t) = ((Wh1h)(Wh2r) + bh)>((Wt1t)(Wt2r) + bt).(2)
Knowledge Graphs: Opportunities and Challenges 23
Here WRd×ddenotes the weight matrix, bindicates the bias vector. h,
r, and tare the embedding vectors of head entity, relation, and tail entity,
respectively.
ConvKB (Nguyen et al,2017) utilizes a convolutional neural network
(CNN) to conduct knowledge graph embeddings. ConvKB represents each
triplet (h, r, t) as a three-row matrix A, which is input to a convolution layer
to obtain feature maps. Afterward, the feature maps are concatenated as a
vector, and then a score is calculated to estimate the confidence of the triplet.
The scoring function is as follows:
fr(h, t) = O(g(AΩ))w,(3)
where Osignifies the concatenation operator, g(·) is the ReLU activation func-
tion, A indicates the convolution operation of matrix Aby using the filters
in the set Ω, wR3dis a weight vector.
R-GCN (Schlichtkrull et al,2018) is an improvement of graph neural
networks (GNNs). R-GCN represents knowledge graphs by providing relation-
specific transformation. Its forward propagation is calculated as follows:
h(l+1)
k=σX
rRX
iNr
k
1
nk,r
W(l)
ih(l)
i+W(l)
kh(l)
k,(4)
where h(l+1)
kis the hidden state of the entity kin l-th layer, Nr
kdenotes a
neighbor collection of entity kand relation rR,nk,r is the normalization
process, W(l)
iand W(l)
kare the weight matrices.
5.1.4 Limitations of Existing Methods
The existing methods for generating knowledge graph embeddings still suf-
fer several severe limitations. Many established methods only consider surface
facts (triplets) of knowledge graphs. However, additional information, such as
entity types and relation paths, are ignored, which can further improve the
embedding accuracy. The performance of most traditional methods that do not
consider the additional information is unsatisfactory. Table 3lists the embed-
ding methods, which do not consider the additional information. In Table 3, the
performance evaluation is based on the link prediction and triplet classification
tasks. The metrics that are for evaluation results are hit rate at 10 (Hits@10)
and accuracy. As Table 3presents, only a few models have impressive results,
including the results of QuatE (90%), RMNN (89.9%), and KBGAN (89.2%).
Recently, some researchers have started to combine additional information
with a knowledge graph to improve the efficiency of embedding models. For
example, Guo et al. (Guo et al,2015) take advantage of additional entity type
information, which is the semantic category of each entity, to obtain the cor-
relation between the entities and to tackle the data sparsity issue. Therefore,
knowledge graphs are represented more accurately. Not only entity types, some
24 Knowledge Graphs: Opportunities and Challenges
other information, including relation paths (Li et al,2021), time information of
dynamic graphs (Messner et al,2022), and textual descriptions of entities (An
et al,2018), are getting the researchers’ attention in recent years. However, it
is still a daunting challenge to effectively utilize rich additional information to
improve the accuracy of knowledge graph embeddings.
General additional information can not adequately represent the semantic
meaning of the triplets. For instance, the entity types are not related to the
semantic information of triplets. Furthermore, the types of additional infor-
mation that can be incorporated into the features of the triplets are now
severely limited. Therefore, to improve the performance of existing knowledge
graph embedding methods, multivariate information (such as the hierarchi-
cal descriptions of relations and the combination of entity types and textual
descriptions) needs to be incorporated into the features of the triplets.
To the best of our knowledge, complex relation path remains an open
research problem (Peng et al,2021). For example, the inherent relations,
referring to the indirect relationships between two unconnected entities,
are not represented effectively. Although the inherent relations between the
entities can be explored based on the chain of relationships in knowledge
graphs, the inherent relations are complex and multiple. Therefore, it is not
straightforward to represent these relations effectively.
5.2 Knowledge Acquisition
Knowledge acquisition is a critical step for combining data from different
sources and generating new knowledge graphs. The knowledge is extracted
from both structured and unstructured data. Three main methods of knowl-
edge acquisition are relation extraction, entity extraction, and attribute
extraction (Fu et al,2019). Here, attribute extraction can be regarded as a spe-
cial case of entity extraction. Zhang et al. (Zhang et al,2019b) took advantage
of knowledge graph embeddings and graph convolution networks to extract
long-tail relations. Shi et al. (Shi et al,2021) proposed entity set expansion to
construct large-scale knowledge graphs.
Nevertheless, existing methods for knowledge acquisition still face the chal-
lenge of low accuracy, which could result in incomplete or noisy knowledge
graphs and hinder the downstream tasks. Therefore, the first critical issue
regards the reliability of knowledge acquisition tools and their evaluation. In
addition, a domain-specific knowledge graph schema is knowledge-oriented,
while a constructed knowledge graph schema is data-oriented for covering all
data features (Zhou et al,2022). Therefore, it is inefficient to produce domain-
specific knowledge graphs by extracting entities and properties from raw data.
Hence, it is an essential issue to efficiently achieve knowledge acquisition tasks
by generating domain-specific knowledge graphs.
Besides, most existing knowledge acquisition methods focus on construct-
ing knowledge graphs with one specific language. However, in order to make
the information in knowledge graphs richer and more comprehensive, we
need cross-lingual entity extraction. It is thus vitally important to give more
Knowledge Graphs: Opportunities and Challenges 25
attention to cross-lingual entity extraction and the generation of multilingual
knowledge graphs. For example, Bekoulis et al.(Bekoulis et al,2018) proposed
a joint neural model for cross-lingual (English and Dutch) entity and relation
extraction. Nevertheless, multilingual knowledge graph construction is still a
daunting task since non-English training data sets are limited, language trans-
lation systems are not always accurate, and the cross-lingual entity extraction
models have to be retrained for each new language.
Multi-modal knowledge graph construction is regarded as another chal-
lenging issue of knowledge acquisition. The existing knowledge graphs are
mostly represented by pure symbols, which could result in the poor capabil-
ity of machines to understand our real world (Zhu et al,2022b). Therefore,
many researchers focus on multi-modal knowledge graphs with various entities,
such as texts and images. The construction of multi-modal knowledge graphs
requires the exploration of entities with different modalities, which makes the
knowledge acquisition tasks complicated and inefficient.
5.3 Knowledge Graph Completion
Knowledge graphs are often incomplete, i.e., missing several relevant triplets
and entities (Zhang et al,2020b). For instance, in Freebase, one of the most
well-known knowledge graphs, more than half of person entities do not have
information about their birthplaces and parents. Generally, semi-automated
and human leveraging mechanisms, which can be applied to ensure the qual-
ity of knowledge graphs, are essential tools for the evaluation of knowledge
graph completion. Specifically, human supervision is currently considered the
gold standard evaluation in knowledge graph completion (Ballandies and
Pournaras,2021).
Knowledge graph completion aims to expand existing knowledge graphs by
adding new triplets using techniques for link prediction (Wang et al,2020b;
Akrami et al,2020) and entity prediction (Ji et al,2021). These approaches
typically train a machine learning model on the knowledge graph to assess
the plausibility of new candidate triplets. Then, they add the candidate
triplets with high plausibility to the graph. For example, for an incomplete
triplet (Tom, friendOf, ?), it is possible to assess the range of tails and
return the more plausible ones to enrich the knowledge graph. These models
successfully utilized knowledge graphs in many different domains, including
digital libraries (Yao et al,2017), biomedical (Harnoune et al,2021), social
media (Abu-Salih,2021), and scientific research (Nayyeri et al,2021). Some
new methods are able to process fuzzy knowledge graphs in which each triple
is associated with a confidence value (Chen et al,2019).
However, most current knowledge graph completion methods only focus on
extracting triplets from a closed-world data source. That means the generated
triplets are new, but the entities or relations in the triplets need to already
exist in the knowledge graph. For example, for the incomplete triplet (Tom,
friendOf, ?), predicting the triplet (Tom, friendOf, Jerry) is only possible if
the entity Jerry is already in the knowledge graph. Because of this limitation,
26 Knowledge Graphs: Opportunities and Challenges
these methods cannot add new entities and relations to the knowledge graph.
To tackle this issue, we are starting to see the emergence of open-world tech-
niques for knowledge graph completion that extracts potential objects from
outside of the existing knowledge bases. For instance, the ConMask model
(Shi and Weninger,2018) has been proposed to predict the unseen entities in
knowledge graphs. However, methods for open-world knowledge graph com-
pletion still suffer from low accuracy. The main reason is that the data source
is usually more complex and noisy. In addition, the similarity of the predicted
new entities to the existing entities can mislead the results. In other words, two
similar entities are regarded as connected entities, while they may not have a
direct relationship.
Knowledge graph completion methods assume knowledge graphs are static
and fail to capture the dynamic evolution of knowledge graphs. To obtain accu-
rate facts over time, temporal knowledge graph completion, which considers
the temporal information reflecting the validity of knowledge, has emerged.
Compared to static knowledge graph completion, temporal knowledge graph
completion methods integrate timestamps into the learning process. Hence,
they explore the time-sensitive facts and improve the link prediction accuracy
significantly. Although temporal knowledge graph completion methods have
shown brilliant performance, they still face serious challenges. Because these
models consider time information would be less efficient (Shao et al,2022), the
key challenge of temporal knowledge graph completion is how to effectively
incorporate timestamps of facts into the learning models and properly capture
the temporal dynamics of facts.
5.4 Knowledge Fusion
Knowledge fusion aims to combine and integrate knowledge from different data
sources. It is often a necessary step for the generation of knowledge graphs
(Nguyen et al,2020;Smirnov and Levashova,2019). The primary method of
knowledge fusion is entity alignment or ontology alignment (Ren et al,2021),
which aims to match the same entity from multiple knowledge graphs (Zhao
et al,2020). Achieving efficient and accurate knowledge graph fusion is a
challenging task because of the complexity, variety, and large volume of data
available today.
While a lot of work has been done in this direction, there are still several
intriguing research directions that deserve to be investigated in the future.
One of them regards cross-language knowledge fusion (Mao et al,2020), which
allows the integration of information from different languages. This is often
used to support cross-lingual recommender systems (Javed et al,2021). For
example, Xu et al. (Xu et al,2019) adopted a graph-matching neural net-
work to achieve cross-language entity alignment. However, the result of the
cross-language knowledge fusion is still unsatisfactory because the accuracy of
the matching entities from different languages is relatively low. Therefore, it
remains a daunting challenge to explore cross-language knowledge fusion.
Knowledge Graphs: Opportunities and Challenges 27
Another primary challenge regards entity disambiguation (Nguyen et al,
2020). As the polysemy problem of natural language, the same entity may
have various expressions in different knowledge graphs. Hence, entity disam-
biguation is required before conducting entity alignment. Existing entity dis-
ambiguation methods mainly focus on discriminating and matching ambiguous
entities based on extracting knowledge from texts containing rich contextual
information (Zhu and Iglesias,2018). However, these methods can not pre-
cisely measure the semantic similarity of entities when the texts are short and
have limited contextual information. Only a few works have focused on solv-
ing this issue. For example, Zhu and Iglesias (Zhu and Iglesias,2018) have
proposed SCSNED for entity disambiguation. SCSNED measures semantic
similarity based on both informative words of entities in knowledge graphs
and contextual information in short texts. Although SCSNED alleviates the
issue of limited contextual information to some extent, more effort is needed
to improve the performance of entity disambiguation.
In addition, many knowledge fusion methods only focus on matching
entities with the same modality and ignore multi-modal scenes in which knowl-
edge is presented in different forms. Specifically, entity alignment considering
only single-modality knowledge graph scenario has insignificant performance
because it can not fully reflect the relationships of entities in the real world
(Cheng et al,2022a). Recently, to solve this issue, some studies have proposed
multi-modal knowledge fusion, which matches the same entities having differ-
ent modalities and generates a multi-modal knowledge graph. For example,
HMEA (Guo et al,2021) aligns entities with multiple forms by mapping multi-
modal representations into hyperbolic space. Although many researchers have
worked on multi-modal knowledge fusion, it is still a critical task. Multi-modal
knowledge fusion mainly aims to find equivalent entities by integrating their
multi-modal features (Cheng et al,2022a). Nevertheless, how to efficiently
incorporate the features having multiple modalities is still a tricky issue facing
current methods.
5.5 Knowledge Reasoning
The goal of knowledge reasoning is to infer new knowledge, such as the implicit
relations between two entities (Liu et al,2021;Wang et al,2019c), based on
existing data. For a given knowledge graph, wherein there are two unconnected
entities hand t, denoted as h, t G, here Gmeans the knowledge graph,
knowledge reasoning can find out the potential relation rbetween these enti-
ties and form a new triplet (h, r, t). The knowledge reasoning methods are
mainly categorized into logic rule-based (De Meester et al,2021), distributed
representation-based (Chen et al,2020b), and neural network-based methods
(Xiong et al,2017). Logic rule-based knowledge reasoning aims to discover
knowledge according to the random walk and logic rules, while distributed
representation-based knowledge reasoning embeds entities and relations into a
vector space to obtain distributed representation (Chen et al,2020b). Neural
28 Knowledge Graphs: Opportunities and Challenges
network-based knowledge reasoning method utilizes neural networks to infer
new triplets given the body of knowledge in the graph (Xian et al,2019).
There are two tasks in knowledge reasoning: single-hop prediction and
multi-hop reasoning (Ren et al,2022). Single-hop prediction predicts one ele-
ment of a triplet for the given two elements, while multi-hop reasoning predicts
one or more elements in a multi-hop logical query. In other words, in the
multi-hop reasoning scenario, finding the answer to a typical question and
forming new triplets requires the prediction and imputation of multiple edges
and nodes. Multi-hop reasoning achieves a more precise formation of triplets
when compared with the single-hop prediction. Therefore, multi-hop reasoning
has attracted more attention and become a critical need for the develop-
ment of knowledge graphs in recent years. Although many works have been
done, multi-hop reasoning over knowledge graphs remains largely unexplored.
Notably, multi-hop reasoning on massive knowledge graphs is one of the chal-
lenging tasks (Zhu et al,2022a). For instance, most recent studies focus on
multi-hop reasoning over knowledge graphs, which have only 63K entities and
592K relations. The existing models can’t learn the training set effectively for
a massive knowledge graph that has more than millions of entities. Moreover,
multi-hop reasoning needs to traverse multiple relations and intermediate enti-
ties in the knowledge graph, which could lead to exponential computation cost
(Zhang et al,2021). Therefore, it is still a daunting task to explore multi-hop
knowledge reasoning.
Besides, the verification of inferred new knowledge is also a critical issue.
Knowledge reasoning enriches existing knowledge graphs and brings benefits to
the downstream tasks (Wan et al,2021). However, the inferred new knowledge
is sometimes uncertain, and the veracity of new triplets needs to be veri-
fied. Furthermore, the conflicts between new and existing knowledge should be
detected. To address these problems, some research has proposed multi-source
knowledge reasoning (Zhao et al,2020) that detects erroneous knowledge and
conflicting knowledge. Overall, more attention should be paid to multi-source
knowledge reasoning and erroneous knowledge reduction.
6 Conclusion
Knowledge graphs have played an instrumental role in creating many intelli-
gent services and applications for various fields. In this survey, we provided
an overview of knowledge graphs in terms of opportunities and challenges.
We first introduced the definitions and existing research directions regarding
knowledge graphs to provide an introductory analysis of knowledge graphs.
Afterward, we discussed AI systems that take advantage of knowledge graphs.
Then, we presented some representative knowledge graph applications in sev-
eral fields. Furthermore, we analyzed the limitations of current knowledge
graph technologies, which lead to severe technical challenges. We expect this
survey to spark new ideas and insightful perspectives for future research and
development activities involving knowledge graphs.
Knowledge Graphs: Opportunities and Challenges 29
Declarations
Conflict of interest. The authors declare that they have no compet-
ing financial interests or personal relationships that could have appeared to
influence the work reported in this paper.
References
Abu-Salih B (2021) Domain-specific knowledge graphs: A survey. Journal of
Network and Computer Applications 185:103,076
Akrami F, Saeef MS, Zhang Q, et al (2020) Realistic re-evaluation of knowledge
graph completion methods: An experimental study. In: Proceedings of the
2020 ACM SIGMOD International Conference on Management of Data, pp
1995–2010
Aliyu I, Kana A, Aliyu S (2020) Development of knowledge graph for university
courses management. International Journal of Education and Management
Engineering 10(2):1
An B, Chen B, Han X, et al (2018) Accurate text-enhanced knowledge graph
representation learning. In: Proceedings of the 2018 Conference of the North
American Chapter of the Association for Computational Linguistics: Human
Language Technologies, Volume 1 (Long Papers), pp 745–755
Angioni S, Salatino A, Osborne F, et al (2021) Aida: a knowledge graph about
research dynamics in academia and industry. Quantitative Science Studies
pp 1–43
Auer S, Bizer C, Kobilarov G, et al (2007) Dbpedia: A nucleus for a web of
open data. In: The semantic web. Springer, p 722–735
Bai X, Wang M, Lee I, et al (2019) Scientific paper recommendation: A survey.
Ieee Access 7:9324–9339
Bai X, Zhang F, Li J, et al (2021) Educational big data: Prediction,
applications and challenges. Big Data Research 26(100270)
Baken N (2020) Linked data for smart homes: Comparing rdf and labeled
property graphs. In: LDAC2020—8th Linked Data in Architecture and
Construction Workshop, pp 23–36
Balaˇzevi´c I, Allen C, Hospedales TM (2019) Tucker: Tensor factorization for
knowledge graph completion. arXiv preprint arXiv:190109590
Ballandies MC, Pournaras E (2021) Mobile link prediction: Automated cre-
ation and crowdsourced validation of knowledge graphs. Microprocessors
30 Knowledge Graphs: Opportunities and Challenges
and Microsystems 87:104,335
Bauer L, Wang Y, Bansal M (2018) Commonsense for generative multi-hop
question answering tasks. arXiv preprint arXiv:180906309
Bekoulis G, Deleu J, Demeester T, et al (2018) Joint entity recognition and
relation extraction as a multi-head selection problem. Expert Systems with
Applications 114:34–45
Bollacker K, Evans C, Paritosh P, et al (2008) Freebase: a collaboratively
created graph database for structuring human knowledge. In: Proceedings of
the 2008 ACM SIGMOD international conference on Management of data,
pp 1247–1250
Bordes A, Weston J, Collobert R, et al (2011) Learning structured embed-
dings of knowledge bases. In: Twenty-Fifth AAAI Conference on Artificial
Intelligence
Bordes A, Usunier N, Garcia-Duran A, et al (2013) Translating embeddings for
modeling multi-relational data. Advances in neural information processing
systems 26
Bordes A, Glorot X, Weston J, et al (2014) A semantic matching energy func-
tion for learning with multi-relational data. Machine Learning 94(2):233–259
Bounhas I, Soudani N, Slimani Y (2020) Building a morpho-semantic knowl-
edge graph for arabic information retrieval. Information Processing &
Management 57(6):102,124
Cai L, Wang WY (2017) Kbgan: Adversarial learning for knowledge graph
embeddings. arXiv preprint arXiv:171104071
Chaudhri V, Baru C, Chittar N, et al (2022) Knowledge graphs: Introduction,
history and, perspectives. AI Magazine 43(1):17–29
Chen P, Lu Y, Zheng VW, et al (2018) Knowedu: A system to construct
knowledge graph for education. Ieee Access 6:31,553–31,563
Chen R, Chen T, Hui X, et al (2020a) Knowledge graph transfer network for
few-shot recognition. In: Proceedings of the AAAI Conference on Artificial
Intelligence, pp 10,575–10,582
Chen X, Chen M, Shi W, et al (2019) Embedding uncertain knowledge graphs.
In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 3363–
3370
Chen X, Jia S, Xiang Y (2020b) A review: Knowledge reasoning over knowledge
graph. Expert Systems with Applications 141:112,948
Knowledge Graphs: Opportunities and Challenges 31
Chen YC, Hui L, Thaipisutikul T, et al (2020c) A collaborative filtering recom-
mendation system with dynamic time decay. The Journal of Supercomputing
pp 1–19
Cheng B, Zhu J, Guo M (2022a) Multijaf: Multi-modal joint entity alignment
framework for multi-modal knowledge graph. Neurocomputing
Cheng D, Yang F, Xiang S, et al (2022b) Financial time series forecasting with
multi-modality graph neural network. Pattern Recognition 121:108,218
Cheng D, Yang F, Xiang S, et al (2022c) Financial time series forecasting with
multi-modality graph neural network. Pattern Recognition 121:108,218
Chi Y, Qin Y, Song R, et al (2018) Knowledge graph in smart education: A case
study of entrepreneurship scientific publication management. Sustainability
10(4):995
Choi D, Chun S, Oh H, et al (2020) Rumor propagation is amplified by echo
chambers in social media. Scientific reports 10(1):1–10
Cui L, Seo H, Tabar M, et al (2020) Deterrent: Knowledge guided graph atten-
tion network for detecting healthcare misinformation. In: Proceedings of the
26th ACM SIGKDD international conference on knowledge discovery & data
mining, pp 492–502
Dai Y, Wang S, Chen X, et al (2020a) Generative adversarial networks based
on wasserstein distance for knowledge graph embeddings. Knowledge-Based
Systems 190:105,165
Dai Y, Wang S, Xiong NN, et al (2020b) A survey on knowledge graph
embedding: Approaches, applications and benchmarks. Electronics 9(5):750
Dalton J, Dietz L, Allan J (2014) Entity query feature expansion using knowl-
edge base links. In: Proceedings of the 37th international ACM SIGIR
conference on Research & development in information retrieval, pp 365–374
Das A, Mandal J, Danial Z, et al (2022) An improvement of bengali factoid
question answering system using unsupervised statistical methods. adhan¯a
47(1):1–14
De Meester B, Heyvaert P, Arndt D, et al (2021) Rdf graph validation using
rule-based reasoning. Semantic Web (Preprint):1–26
Dess`ı D, Osborne F, Recupero DR, et al (2020) AI-KG: an automatically gen-
erated knowledge graph of artificial intelligence. In: ISWC 2020, vol 12507.
Springer, pp 127–143
32 Knowledge Graphs: Opportunities and Challenges
Dimitrakis E, Sgontzos K, Tzitzikas Y (2020) A survey on question answering
systems over linked data and documents. Journal of Intelligent Information
Systems 55(2):233–259
d’Aquin M (2016) On the use of linked open data in education: Current and
future practices. In: Open data for education. Springer, p 3–15
Ehrlinger L, W (2016) Towards a definition of knowledge graphs.
SEMANTiCS (Posters, Demos, SuCCESS) 48(1-4):2
Fan W, Ma Y, Li Q, et al (2019) Graph neural networks for social recommen-
dation. In: The world wide web conference, pp 417–426
arber M, Bartscherer F, Menne C, et al (2018) Linked data quality of dbpedia,
freebase, opencyc, wikidata, and yago. Semantic Web 9(1):77–129
Farf´an F, Hristidis V, Ranganathan A, et al (2009) Xontorank: Ontology-aware
search of electronic medical records. In: Proceedings of the 25th International
Conference on Data Engineering, ICDE 2009, March 29 2009 - April 2 2009,
Shanghai, China. IEEE Computer Society, pp 820–831
Fu TJ, Li PH, Ma WY (2019) Graphrel: Modeling text as relational graphs
for joint entity and relation extraction. In: Proceedings of the 57th Annual
Meeting of the Association for Computational Linguistics, pp 1409–1418
Gao Y, Li YF, Lin Y, et al (2020) Deep learning on knowledge graph for
recommender system: A survey. arXiv preprint arXiv:200400387
omez E, Zhang CS, Boratto L, et al (2022) Enabling cross-continent provider
fairness in educational recommender systems. Future Generation Computer
Systems 127:435–447
Gong F, Wang M, Wang H, et al (2021) Smr: Medical knowledge graph
embedding for safe medicine recommendation. Big Data Research 23:100,174
Guo H, Tang J, Zeng W, et al (2021) Multi-modal entity alignment in
hyperbolic space. Neurocomputing 461:598–607
Guo Q, Zhuang F, Qin C, et al (2020) A survey on knowledge graph-
based recommender systems. IEEE Transactions on Knowledge and Data
Engineering
Guo S, Wang Q, Wang B, et al (2015) Semantically smooth knowledge graph
embedding. In: Proceedings of the 53rd Annual Meeting of the Association
for Computational Linguistics and the 7th International Joint Conference
on Natural Language Processing (Volume 1: Long Papers), pp 84–94
Knowledge Graphs: Opportunities and Challenges 33
Harnoune A, Rhanoui M, Mikram M, et al (2021) Bert based clinical knowl-
edge extraction for biomedical knowledge graph construction and analysis.
Computer Methods and Programs in Biomedicine Update 1:100,042
Hashemi M, Hall M (2020) Multi-label classification and knowledge extrac-
tion from oncology-related content on online social networks. Artificial
Intelligence Review 53(8):5957–5994
He S, Liu K, Ji G, et al (2015) Learning to represent knowledge graphs with
gaussian embedding. In: Proceedings of the 24th ACM international on
conference on information and knowledge management, pp 623–632
Hersh W (2021) Information retrieval. In: Biomedical Informatics. Springer, p
755–794
Hogan A, Blomqvist E, Cochez M, et al (2021) Knowledge graphs. ACM
Computing Surveys (CSUR) 54(4):1–37
Huang X, Zhang J, Li D, et al (2019) Knowledge graph embedding based
question answering. In: Proceedings of the Twelfth ACM International
Conference on Web Search and Data Mining, pp 105–113
Javed U, Shaukat K, Hameed IA, et al (2021) A review of content-based and
context-based recommendation systems. International Journal of Emerging
Technologies in Learning (iJET) 16(3):274–306
Ji G, He S, Xu L, et al (2015) Knowledge graph embedding via dynamic map-
ping matrix. In: Proceedings of the 53rd Annual Meeting of the Association
for Computational Linguistics and the 7th International Joint Conference
on Natural Language Processing (Volume 1: Long Papers), pp 687–696
Ji G, Liu K, He S, et al (2016) Knowledge graph completion with adap-
tive sparse transfer matrix. In: Thirtieth AAAI conference on artificial
intelligence
Ji S, Pan S, Cambria E, et al (2021) A survey on knowledge graphs: Represen-
tation, acquisition, and applications. IEEE Transactions on Neural Networks
and Learning Systems
Jia Y, Wang Y, Lin H, et al (2016) Locally adaptive translation for knowledge
graph embedding. In: Thirtieth AAAI conference on artificial intelligence
Katzman JL, Shaham U, Cloninger A, et al (2018) Deepsurv: personalized
treatment recommender system using a cox proportional hazards deep neural
network. BMC medical research methodology 18(1):1–12
34 Knowledge Graphs: Opportunities and Challenges
Kazemi SM, Poole D (2018) Simple embedding for link prediction in knowledge
graphs. Advances in neural information processing systems 31
Ko H, Witherell P, Lu Y, et al (2021) Machine learning and knowledge
graph based design rule construction for additive manufacturing. Additive
Manufacturing 37:101,620
Kong Y, Liu X, Zhao Z, et al (2022) Bolt defect classification algorithm based
on knowledge graph and feature fusion. Energy Reports 8:856–863
Li J, Cai T, Deng K, et al (2020a) Community-diversified influence maximiza-
tion in social networks. Information Systems 92:101,522
Li L, Wang P, Yan J, et al (2020b) Real-world data medical knowledge graph:
construction and applications. Artificial intelligence in medicine 103:101,817
Li Z, Liu H, Zhang Z, et al (2021) Learning knowledge graph embedding with
heterogeneous relation attention networks. IEEE Transactions on Neural
Networks and Learning Systems
Liang B, Su H, Gui L, et al (2022) Aspect-based sentiment analysis via affec-
tive knowledge enhanced graph convolutional networks. Knowledge-Based
Systems 235:107,643
Lin X, Quan Z, Wang ZJ, et al (2020) Kgnn: Knowledge graph neural network
for drug-drug interaction prediction. In: IJCAI, pp 2739–2745
Lin Y, Liu Z, Sun M, et al (2015) Learning entity and relation embeddings for
knowledge graph completion. In: Twenty-ninth AAAI conference on artificial
intelligence
Liu J, Kong X, Zhou X, et al (2019) Data mining and information retrieval
in the 21st century: A bibliographic review. Computer science review
34:100,193
Liu J, Ren J, Zheng W, et al (2020) Web of scholars: A scholar knowledge
graph. In: Proceedings of the 43rd International ACM SIGIR Conference on
Research and Development in Information Retrieval, pp 2153–2156
Liu J, Xia F, Wang L, et al (2021) Shifu2: A network representation learning
based model for advisor-advisee relationship mining. IEEE Transactions on
Knowledge and Data Engineering 33(4):1763–1777
Liu Q, Jiang H, Evdokimov A, et al (2016) Probabilistic reasoning via deep
learning: Neural association models. arXiv preprint arXiv:160307704
Liu Z, Xiong C, Sun M, et al (2018) Entity-duet neural ranking: Understanding
the role of knowledge graph semantics in neural information retrieval. arXiv
Knowledge Graphs: Opportunities and Challenges 35
preprint arXiv:180507591
MacLean F (2021) Knowledge graphs and their applications in drug discovery.
Expert opinion on drug discovery 16(9):1057–1069
Mao X, Wang W, Xu H, et al (2020) Mraea: an efficient and robust entity
alignment approach for cross-lingual knowledge graph. In: Proceedings of the
13th International Conference on Web Search and Data Mining, pp 420–428
Mayank M, Sharma S, Sharma R (2021) Deap-faked: Knowledge graph based
approach for fake news detection. arXiv preprint arXiv:210710648
Meel P, Vishwakarma DK (2020) Fake news, rumor, information pollution in
social media and web: A contemporary survey of state-of-the-arts, challenges
and opportunities. Expert Systems with Applications 153:112,986
Messner J, Abboud R, Ceylan II (2022) Temporal knowledge graph completion
using box embeddings. In: Proceedings of the AAAI Conference on Artificial
Intelligence, pp 7779–7787
Mikolov T, Chen K, Corrado G, et al (2013) Efficient estimation of word
representations in vector space. arXiv preprint arXiv:13013781
Minervini P, Boˇsnjak M, Rockt¨aschel T, et al (2020) Differentiable reasoning
on large knowledge bases and natural language. In: Proceedings of the AAAI
conference on artificial intelligence, pp 5182–5190
Mohamed SK, Nounu A, Nov´cek V (2021) Biological applications of knowl-
edge graph embedding models. Briefings in bioinformatics 22(2):1679–1693
Monti D, Rizzo G, Morisio M (2021) A systematic literature review of multi-
criteria recommender systems. Artificial Intelligence Review 54:427–468
Naseriparsa M, Islam MS, Liu C, et al (2018) No-but-semantic-match: com-
puting semantically matched xml keyword search results. World Wide Web
21(5):1223–1257
Naseriparsa M, Islam MS, Liu C, et al (2019a) Xsnippets: Exploring semi-
structured data via snippets. Data Knowl Eng 124
Naseriparsa M, Liu C, Islam MS, et al (2019b) Xplorerank: exploring XML
data via you may also like queries. World Wide Web 22(4):1727–1750
Nayyeri M, Cil GM, Vahdati S, et al (2021) Trans4e: Link prediction on
scholarly knowledge graphs. Neurocomputing 461:530–542
Nguyen DQ, Sirts K, Qu L, et al (2016) Stranse: a novel embedding
model of entities and relationships in knowledge bases. arXiv preprint
36 Knowledge Graphs: Opportunities and Challenges
arXiv:160608140
Nguyen DQ, Nguyen TD, Nguyen DQ, et al (2017) A novel embedding model
for knowledge base completion based on convolutional neural network. In: In
Proceedings of the 2018 Conference of the North American Chapter of the
Association for Computational Linguistics: Human Language Technologies,
p 327–333
Nguyen HL, Vu DT, Jung JJ (2020) Knowledge graph fusion for smart systems:
A survey. Information Fusion 61:56–70
Nickel M, Tresp V, Kriegel HP (2011) A three-way model for collective learning
on multi-relational data. In: Icml
Nickel M, Rosasco L, Poggio T (2016) Holographic embeddings of knowledge
graphs. In: Proceedings of the AAAI Conference on Artificial Intelligence
Noy N, Gao Y, Jain A, et al (2019) Industry-scale knowledge graphs: Lessons
and challenges: Five diverse technology companies show how it’s done.
Queue 17(2):48–75
Palumbo E, Rizzo G, Troncy R, et al (2018) Knowledge graph embeddings with
node2vec for item recommendation. In: European Semantic Web Conference,
Springer, pp 117–120
Palumbo E, Monti D, Rizzo G, et al (2020) entity2rec: Property-specific knowl-
edge graph embeddings for item recommendation. Expert Systems with
Applications 151:113,235
Pedersen T, Patwardhan S, Michelizzi J, et al (2004) Wordnet:: Similarity-
measuring the relatedness of concepts. In: AAAI, pp 25–29
Peng C, Vu DT, Jung JJ (2021) Knowledge graph-based metaphor represen-
tation for literature understanding. Digital Scholarship in the Humanities
Pereira CK, Siqueira SWM, Nunes BP, et al (2017) Linked data in education:
A survey and a synthesis of actual research and future challenges. IEEE
Transactions on Learning Technologies 11(3):400–412
Qiu Y, Wang Y, Jin X, et al (2020) Stepwise reasoning for multi-relation ques-
tion answering over knowledge graph with weak supervision. In: Proceedings
of the 13th International Conference on Web Search and Data Mining, pp
474–482
Quijano-S´anchez L, Cantador I, Cort´es-Cediel ME, et al (2020) Recommender
systems for smart cities. Information systems 92:101,545
Knowledge Graphs: Opportunities and Challenges 37
Rebele T, Suchanek F, Hoffart J, et al (2016) Yago: A multilingual knowledge
base from wikipedia, wordnet, and geonames. In: International semantic web
conference, Springer, pp 177–185
Ren H, Dai H, Dai B, et al (2022) Smore: Knowledge graph completion and
multi-hop reasoning in massive knowledge graphs. In: Proceedings of the
28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining,
pp 1472–1482
Ren J, Xia F, Chen X, et al (2021) Matching algorithms: Fundamentals,
applications and challenges. IEEE Transactions on Emerging Topics in
Computational Intelligence 5(3):332–350
Rodriguez-Muro M, Rezk M (2015) Efficient sparql-to-sql with r2rml map-
pings. Journal of Web Semantics 33:141–169
Rossi A, Barbosa D, Firmani D, et al (2021) Knowledge graph embedding for
link prediction: A comparative analysis. ACM Transactions on Knowledge
Discovery from Data (TKDD) 15(2):1–49
Salatino AA, Thanapalasingam T, Mannocci A, et al (2020) The computer
science ontology: A comprehensive automatically-generated taxonomy of
research areas. Data Intell 2(3)
Saraji MK, Mardani A, oppen M, et al (2022) An extended hesitant fuzzy set
using swara-multimoora approach to adapt online education for the control
of the pandemic spread of covid-19 in higher education institutions. Artificial
Intelligence Review 55(1):181–206
Saxena A, Tripathi A, Talukdar P (2020) Improving multi-hop question
answering over knowledge graphs using knowledge base embeddings. In: Pro-
ceedings of the 58th annual meeting of the association for computational
linguistics, pp 4498–4507
Schlichtkrull M, Kipf TN, Bloem P, et al (2018) Modeling relational data
with graph convolutional networks. In: European semantic web conference,
Springer, pp 593–607
Shao B, Li X, Bian G (2021) A survey of research hotspots and frontier trends
of recommendation systems from the perspective of knowledge graph. Expert
Systems with Applications 165:113,764
Shao P, Zhang D, Yang G, et al (2022) Tucker decomposition-based temporal
knowledge graph completion. Knowledge-Based Systems 238:107,841
Shi B, Weninger T (2018) Open-world knowledge graph completion. In: Thirty-
Second AAAI Conference on Artificial Intelligence
38 Knowledge Graphs: Opportunities and Challenges
Shi C, Ding J, Cao X, et al (2021) Entity set expansion in knowledge graph:
a heterogeneous information network perspective. Frontiers of Computer
Science 15(1):1–12
Shin S, Jin X, Jung J, et al (2019) Predicate constraints based question
answering over knowledge graph. Information Processing & Management
56(3):445–462
Shokeen J, Rana C (2020) A study on features of social recommender systems.
Artificial Intelligence Review 53(2):965–988
Shu H, Huang J (2021) User-preference based knowledge graph feature
and structure learning for recommendation. In: 2021 IEEE International
Conference on Multimedia and Expo (ICME), IEEE, pp 1–6
Singh K, Lytra I, Radhakrishna AS, et al (2020) No one is perfect: Analysing
the performance of question answering components over the dbpedia knowl-
edge graph. Journal of Web Semantics 65:100,594
Smirnov A, Levashova T (2019) Knowledge fusion patterns: A survey. Infor-
mation Fusion 52:31–40
Socher R, Chen D, Manning CD, et al (2013) Reasoning with neural tensor
networks for knowledge base completion. In: Advances in neural information
processing systems, pp 926–934
Sun J, Xu J, Zheng K, et al (2017) Interactive spatial keyword querying with
semantics. In: Proceedings of the 2017 ACM on Conference on Information
and Knowledge Management, CIKM 2017, Singapore, November 06 - 10,
2017. ACM, pp 1727–1736
Sun K, Yu S, Peng C, et al (2022) Relational structure-aware knowledge graph
representation in complex space. Mathematics 10(11):1930
Sun R, Cao X, Zhao Y, et al (2020) Multi-modal knowledge graphs for recom-
mender systems. In: Proceedings of the 29th ACM International Conference
on Information & Knowledge Management, pp 1405–1414
Sun Z, Deng ZH, Nie JY, et al (2019a) Rotate: Knowledge graph embedding
by relational rotation in complex space. arXiv preprint arXiv:190210197
Sun Z, Guo Q, Yang J, et al (2019b) Research commentary on recommen-
dations with side information: A survey and research directions. Electronic
Commerce Research and Applications 37:100,879
Trouillon T, Welbl J, Riedel S, et al (2016) Complex embeddings for simple
link prediction. In: International conference on machine learning, PMLR, pp
2071–2080
Knowledge Graphs: Opportunities and Challenges 39
Ugander J, Karrer B, Backstrom L, et al (2011) The anatomy of the facebook
social graph. arXiv preprint arXiv:11114503
Vashishth S, Sanyal S, Nitin V, et al (2020) Interacte: Improving convolution-
based knowledge graph embeddings by increasing feature interactions. In:
Proceedings of the AAAI Conference on Artificial Intelligence, pp 3009–3016
Vrandeˇci´c D, Kr¨otzsch M (2014) Wikidata: a free collaborative knowledgebase.
Communications of the ACM 57(10):78–85
Wan G, Pan S, Gong C, et al (2021) Reasoning like human: Hierarchical rein-
forcement learning for knowledge graph reasoning. In: Proceedings of the
Twenty-Ninth International Conference on International Joint Conferences
on Artificial Intelligence, pp 1926–1932
Wan L, Xia F, Kong X, et al (2020) Deep matrix factorization for trust-aware
recommendation in social networks. IEEE Transactions on Network Science
and Engineering 8(1):511–528
Wang C, Yu H, Wan F (2018a) Information retrieval technology based on
knowledge graph. In: 2018 3rd International Conference on Advances in
Materials, Mechatronics and Civil Engineering (ICAMMCE 2018), Atlantis
Press, pp 291–296
Wang H, Zhang F, Wang J, et al (2018b) Ripplenet: Propagating user pref-
erences on the knowledge graph for recommender systems. In: Proceedings
of the 27th ACM International Conference on Information and Knowledge
Management, pp 417–426
Wang H, Zhang F, Zhao M, et al (2019a) Multi-task feature learning for
knowledge graph enhanced recommendation. In: The World Wide Web
Conference, pp 2000–2010
Wang K, Shen Z, Huang C, et al (2020a) Microsoft academic graph: When
experts are not enough. Quantitative Science Studies 1(1):396–413
Wang L, Ren J, Xu B, et al (2020b) Model: Motif-based deep feature learning
for link prediction. IEEE Transactions on Computational Social Systems
7(2):503–516
Wang Q, Li M, Wang X, et al (2020c) Covid-19 literature knowledge
graph construction and drug repurposing report generation. arXiv preprint
arXiv:200700576
Wang R, Yan Y, Wang J, et al (2018c) Acekg: A large-scale knowledge graph
for academic data mining. In: Proceedings of the 27th ACM International
Conference on Information and Knowledge Management. Association for
40 Knowledge Graphs: Opportunities and Challenges
Computing Machinery, New York, NY, USA, CIKM ’18, p 1487–1490
Wang W, Liu J, Yang Z, et al (2019b) Sustainable collaborator recommen-
dation based on conference closure. IEEE Transactions on Computational
Social Systems 6(2):311–322
Wang W, Liu J, Tang T, et al (2020d) Attributed collaboration network
embedding for academic relationship mining. ACM Transactions on the Web
(TWEB) 15(1):1–20
Wang X, Wang D, Xu C, et al (2019c) Explainable reasoning over knowledge
graphs for recommendation. In: Proceedings of the AAAI Conference on
Artificial Intelligence, pp 5329–5336
Wang Y, Dong L, Li Y, et al (2021) Multitask feature learning approach
for knowledge graph enhanced recommendations with ripplenet. Plos one
16(5):e0251,162
Wang Z, Zhang J, Feng J, et al (2014) Knowledge graph embedding by trans-
lating on hyperplanes. In: Proceedings of the AAAI Conference on Artificial
Intelligence
Wang Z, Chen T, Ren J, et al (2018d) Deep reasoning with knowledge graph
for social relationship understanding. arXiv preprint arXiv:180700504
Wang Z, Yin Z, Argyris YA (2020e) Detecting medical misinformation on
social media using multimodal deep learning. IEEE journal of biomedical
and health informatics 25(6):2193–2203
Wise C, Ioannidis VN, Calvo MR, et al (2020) Covid-19 knowledge graph:
accelerating information retrieval and discovery for scientific literature.
arXiv preprint arXiv:200712731
Wu Y, Yang S, Yan X (2013) Ontology-based subgraph querying. In: 29th
IEEE International Conference on Data Engineering, ICDE 2013, Brisbane,
Australia, April 8-12, 2013. IEEE Computer Society, pp 697–708
Xia F, Asabere NY, Liu H, et al (2014a) Socially aware conference participant
recommendation with personality traits. IEEE Systems Journal 11(4):2255–
2266
Xia F, Liu H, Asabere NY, et al (2014b) Multi-category item recommendation
using neighborhood associations in trust networks. In: Proceedings of the
23rd International Conference on World Wide Web, pp 403–404
Xia F, Liu H, Lee I, et al (2016) Scientific article recommendation: Exploiting
common author relations and historical preferences. IEEE Transactions on
Knowledge Graphs: Opportunities and Challenges 41
Big Data 2(2):101–112
Xia F, Sun K, Yu S, et al (2021) Graph learning: A survey. IEEE Transactions
on Artificial Intelligence 2(2):109–127
Xian Y, Fu Z, Muthukrishnan S, et al (2019) Reinforcement knowledge graph
reasoning for explainable recommendation. In: Proceedings of the 42nd inter-
national ACM SIGIR conference on research and development in information
retrieval, pp 285–294
Xiao H, Huang M, Hao Y, et al (2015) Transg: A generative mixture model
for knowledge graph embedding. arXiv preprint arXiv:150905488
Xiong W, Hoang T, Wang WY (2017) Deeppath: A reinforcement learning
method for knowledge graph reasoning. arXiv preprint arXiv:170706690
Xu J, Yu S, Sun K, et al (2020) Multivariate relations aggregation learning
in social networks. In: Proceedings of the ACM/IEEE Joint Conference on
Digital Libraries in 2020, pp 77–86
Xu K, Wang L, Yu M, et al (2019) Cross-lingual knowledge graph alignment
via graph matching neural network. arXiv preprint arXiv:190511605
Yao L, Zhang Y, Wei B, et al (2017) Incorporating knowledge graph embed-
dings into topic modeling. In: Thirty-First AAAI Conference on Artificial
Intelligence
Yao L, Mao C, Luo Y (2019) Kg-bert: Bert for knowledge graph completion.
arXiv preprint arXiv:190903193
Yao S, Wang R, Sun S, et al (2020) Joint embedding learning of edu-
cational knowledge graphs. Artificial Intelligence Supported Educational
Technologies pp 209–224
Ying R, He R, Chen K, et al (2018) Graph convolutional neural networks for
web-scale recommender systems. In: Proceedings of the 24th ACM SIGKDD
international conference on knowledge discovery & data mining, pp 974–983
Yong Y, Yao Z, Zhao Y (2021) A framework for reviewer recommendation
based on knowledge graph and rules matching. In: 2021 IEEE Interna-
tional Conference on Information Communication and Software Engineering
(ICICSE), pp 199–203
Yu H, Li H, Mao D, et al (2020) A relationship extraction method for domain
knowledge graph construction. World Wide Web 23(2):735–753
Yuan H, Deng W (2021) Doctor recommendation on healthcare consultation
platforms: an integrated framework of knowledge graph and deep learning.
42 Knowledge Graphs: Opportunities and Challenges
Internet Research
Zablith F (2022) Constructing social media links to formal learning: A knowl-
edge graph approach. Educational technology research and development pp
1–26
Zhang H, Fang Q, Qian S, et al (2019a) Multi-modal knowledge-aware event
memory network for social media rumor detection. In: Proceedings of the
27th ACM International Conference on Multimedia, pp 1942–1951
Zhang N, Deng S, Sun Z, et al (2019b) Long-tail relation extraction via knowl-
edge graph embeddings and graph convolution networks. arXiv preprint
arXiv:190301306
Zhang S, Tay Y, Yao L, et al (2019c) Quaternion knowledge graph embeddings.
Advances in neural information processing systems 32
Zhang Y, Zhang F, Yao P, et al (2018) Name disambiguation in aminer: Clus-
tering, maintenance, and human in the loop. In: Proceedings of the 24th
ACM SIGKDD International Conference on Knowledge Discovery & Data
Mining, pp 1002–1011
Zhang Y, Sheng M, Zhou R, et al (2020a) Hkgb: An inclusive, extensible,
intelligent, semi-auto-constructed knowledge graph framework for health-
care with clinicians’ expertise incorporated. Information Processing &
Management 57(6):102,324
Zhang Z, Cai J, Zhang Y, et al (2020b) Learning hierarchy-aware knowl-
edge graph embeddings for link prediction. In: Proceedings of the AAAI
Conference on Artificial Intelligence, pp 3065–3072
Zhang Z, Wang J, Chen J, et al (2021) Cone: Cone embeddings for multi-
hop reasoning over knowledge graphs. Advances in Neural Information
Processing Systems 34:19,172–19,183
Zhao X, Jia Y, Li A, et al (2020) Multi-source knowledge fusion: a survey.
World Wide Web 23(4):2567–2592
Zheng D, Song X, Ma C, et al (2020) Dgl-ke: Training knowledge graph
embeddings at scale. In: Proceedings of the 43rd International ACM SIGIR
Conference on Research and Development in Information Retrieval, pp
739–748
Zheng Y, Wang DX (2022) A survey of recommender systems with multi-
objective optimization. Neurocomputing 474:141–153
Knowledge Graphs: Opportunities and Challenges 43
Zhou D, Zhou B, Zheng Z, et al (2022) Schere: Schema reshaping for
enhancing knowledge graph construction. In: Proceedings of the 31st ACM
International Conference on Information & Knowledge Management, pp
5074–5078
Zhu A, Ouyang D, Liang S, et al (2022a) Step by step: A hierarchical frame-
work for multi-hop knowledge graph reasoning with reinforcement learning.
Knowledge-Based Systems 248:108,843
Zhu G, Iglesias CA (2018) Exploiting semantic similarity for named entity
disambiguation in knowledge graphs. Expert Systems with Applications
101:8–24
Zhu X, Li Z, Wang X, et al (2022b) Multi-modal knowledge graph construction
and application: A survey. arXiv preprint arXiv:220205786
Zou X (2020) A survey on application of knowledge graph. Journal of Physics:
Conference Series 1487:012,016
... Additionally, the TuckER (Balažević 2019) model leverages tensor decomposition to map triplets into tensor space , enhancing its ability to model complex relationships. The HINGE (Rosso et al., 2020) model, on the other hand, captures logical relations in knowledge graphs through logical rules, thereby improving knowledge reasoning capabilities (Peng et al., 2023). Although these single-view models have shown promising results in processing triplets, they often struggle to scale to more complex hyper-relational scenarios . ...
Article
Full-text available
This study proposes a dual-view hyper-relational knowledge graph embedding model aimed at addressing the challenges of embedding complex relationships in knowledge graphs. Traditional methods primarily handle simple triplet relations and struggle with the complexity of hyper-relations. By integrating instance view and ontology view, our model, DVHE, captures hierarchical structural information between entities and is applied to link prediction tasks. Experimental results show that DVHE significantly outperforms existing single-view and dual-view models across multiple benchmark datasets, particularly in handling complex hyper-relations and hierarchical information. Ablation studies further validate the effectiveness of the model’s components, providing new insights for the development of knowledge graph embeddings.
Article
Full-text available
Relations in knowledge graphs have rich relational structures and various binary relational patterns. Various relation modelling strategies are proposed for embedding knowledge graphs, but they fail to fully capture both features of relations, rich relational structures and various binary relational patterns. To address the problem of insufficient embedding due to the complexity of the relations, we propose a novel knowledge graph representation model in complex space, namely MARS, to exploit complex relations to embed knowledge graphs. MARS takes the mechanisms of complex numbers and message-passing and then embeds triplets into relation-specific complex hyperplanes. Thus, MARS can well preserve various relation patterns, as well as structural information in knowledge graphs. In addition, we find that the scores generated from the score function approximate a Gaussian distribution. The scores in the tail cannot effectively represent triplets. To address this particular issue and improve the precision of embeddings, we use the standard deviation to limit the dispersion of the score distribution, resulting in more accurate embeddings of triplets. Comprehensive experiments on multiple benchmarks demonstrate that our model significantly outperforms existing state-of-the-art models for link prediction and triple classification.
Article
Full-text available
Knowledge graphs (KGs) have emerged as a compelling abstraction for organizing the world's structured knowledge and for integrating information extracted from multiple data sources. They are also beginning to play a central role in representing information extracted by AI systems, and for improving the predictions of AI systems by giving them knowledge expressed in KGs as input. The goals of this article are to (a) introduce KGs and discuss important areas of application that have gained recent prominence; (b) situate KGs in the context of the prior work in AI; and (c) present a few contrasting perspectives that help in better understanding KGs in relation to related technologies.
Article
Recent years have witnessed the resurgence of knowledge engineering which is featured by the fast growth of knowledge graphs. However, most of existing knowledge graphs are represented with pure symbols, which hurts the machine's capability to understand the real world. The multi-modalization of knowledge graphs is an inevitable key step towards the realization of human-level machine intelligence. The results of this endeavor are Multi-modal Knowledge Graphs (MMKGs). In this survey on MMKGs constructed by texts and images, we first give definitions of MMKGs, followed with the preliminaries on multi-modal tasks and techniques. We then systematically review the challenges, progresses and opportunities on the construction and application of MMKGs respectively, with detailed analyses of the strengths and weaknesses of different solutions. We finalize this survey with open research problems relevant to MMKGs.
Article
Entity Alignment (EA) is a crucial task in knowledge fusion, which aims to link entities with the same real-world identity from different Knowledge Graphs (KGs). Existing methods have achieved satisfactory performance, however, they mainly focus on single modal KG, which is difficult to be effectively applied to multi-modal scenes. In this paper, we propose a Multi-modal Joint entity Alignment Framework (MultiJAF), which can effectively utilize the knowledge of various modalities. Concretely, we first learn the embeddings of different modalities, i.e., structure, attribute and image modalities. Next, we adopt an attention-based multi-modal fusion network to integrate these embeddings and use obtained joint embeddings to compute a joint embedding-based similarity matrix SJ. Moreover, we design a Numerical Process Module (NPM) to infer a similarity matrix SN according to the numerical information of entities. In the end, we utilize a simple late fusion method to ensemble two similarity matrices for the final alignment. In addition, to reduce the cost of labeling data, we propose a novel NPM-based unsupervised multi-modal EA method. Experimental results on two real-world datasets demonstrate the effectiveness of our proposed MultiJAF.
Article
Recently, knowledge graph reasoning has sparked great interest in research community, which aims at inferring missing information in triples and provides critical support to various tasks (e.g., question answering and recommendation). To date, multi-hop reasoning is a dominant approach which infers the target answer by walking along the path connecting entities and relations, ensuring both accuracy and interpretability. However, in most knowledge graphs, there are multiple relations related to an identical entity, and multiple tail entities for an identical pair of head entity and relation. Due to this one-to-many dilemma, enlarged action space and ignoring logical relationship between entity and relation increase the difficulty of learning. In order to deal with such an issue, this work presents a novel paradigm for knowledge graph reasoning by decomposing it to a two-level hierarchical decision process. We apply the hierarchical reinforcement learning framework which dismantles the task into a high-level process for relation detector and a low-level process for entity reasoning, respectively. In this way, the action space is effectively controlled where the policies can be optimized. The interactions between entity and relation decision enhance the rationality of reasoning. Moreover, we introduce a dynamic prospect mechanism for low-level policy where the information can guide us to a refined and improved action space, assisted by embedding based method. Our proposed model is evaluated on four benchmark datasets and the results validate its superiority over state-of-the-art baselines, showing the interpretability of reasoning process simultaneously.
Article
In this paper, we propose to build a morpho-semantic knowledge graph from Arabic vocalized corpora. Our work focuses on classical Arabic as it has not been deeply investigated in related works. We use a tool suite which allows analyzing and disambiguating Arabic texts, taking into account short diacritics to reduce ambiguities. At the morphological level, we combine Ghwanmeh stemmer and MADAMIRA which are adapted to extract a multi-level lexicon from Arabic vocalized corpora. At the semantic level, we infer semantic dependencies between tokens by exploiting contextual knowledge extracted by a concordancer. Both morphological and semantic links are represented through compressed graphs, which are accessed through lazy methods. These graphs are mined using a measure inspired from BM25 to compute one-to-many similarity. Indeed, we propose to evaluate the morpho-semantic Knowledge Graph in the context of Arabic Information Retrieval (IR). Several scenarios of document indexing and query expansion are assessed. That is, we vary indexing units for Arabic IR based on different levels of morphological knowledge, a challenging issue which is not yet resolved in previous research. We also experiment several combinations of morpho-semantic query expansion. This permits to validate our resource and to study its impact on IR based on state-of-the art evaluation metrics.
Article
While education increasingly relies on social media technologies to provide richer learning experiences, the rigid and course-centric design of curricula still imposes a challenge for students to construct meaningful connections between social media and formal learning. Building on the knowledge graphs’ potential to establish semantic links among data entities, this paper investigates to what extent knowledge graph-based tools help students with integrating and accessing transdisciplinary social media content in formal courses, and contribute to constructivism in online learning environments? This study proposes a framework that includes a set of tools built on a novel knowledge graph designed to help educators in exposing detailed coverage of their formal courses through explicit concepts, which can serve as building blocks for students to integrate and access transdisciplinary social media content in formal learning settings. The framework is piloted in a business school where 180 students used these tools in an information systems (IS) course. The preliminary results indicate the majority (around 68%) of materials shared and accessed by students through this framework was connected to other disciplines beyond IS, reflecting the possible creation and exploration of transdisciplinary links between social media content and formal courses. Thirty-three students were interviewed to evaluate their opinion on the tools with respect to social constructivism in online learning environments. The interviews provide initial insights on the tools’ potential to promote constructivism by supporting collaborative, learner-centered, high-quality, authentic, facilitated, and interactive learning principles. The study helps students and educators better integrate and access emerging social media content in formal courses.