Conference PaperPDF Available

Language networks and semantic networks

Authors:

Abstract and Figures

How does the human brain work? How can we make sentences and make sentences? These are the questions dealt with already many scientists and several companies developing artificial intelligence. This article presents a study on language networks. At the beginning is the research of the works that have already dealt with this issue. In the next part, the author deals with the application of language networks and semantic networks.
Content may be subject to copyright.
Language networks and Semantic networks
Martin Žáček1, a), Zdeňka Telnarová1, b)
1 University of Ostrava, Faculty of Science, Department of Informatics and Computers, 30. dubna 22, Ostrava,
Czech Republic.
a) Corresponding author: Martin.Zacek@osu.cz
b) Zdenka.Telnarova@osu.cz
Abstract. How does the human brain work? How can we make sentences and make sentences? These are the questions
dealt with already many scientists and several companies developing artificial intelligence. This article presents a study
on language networks. At the beginning is the research of the works that have already dealt with this issue. In the next
part, the author deals with the application of language networks and semantic networks.
INTRODUCTION
Artificial intelligence modeling is already standard on at least two levels of abstraction, at conceptualization and
implementation level. On the first one, i.e. at a higher level of abstraction, conceptualization takes place, the output
of which is a conceptual model in language. Although this language has already strict formal rules, it also has
language resources that can be understood by the layman as well, thus contributing to the creation of a conceptual
model of the defined reference system. The second level uses the language as a rule to implement the model, which
is a modification of the first-order logic language and requires deeper mastering of language resources. [1]
In linguistics, the so-called Ullmann's triangle, which illustrates the relationships between reality, abstract
concepts and their linguistic representation, is well known (Figure 1). [1]
FIGURE 1. The Ullmann's triangle.
concept in mind
(conceptualization)
symbol
(language)
abstraction
representation
concrete things
(reality)
Dashed arrow named references between language and reality shows that a conceptualization always supports
the relationship between linguistic expression and the real thing. The language expression of the "college" in the
Czech language (in Czech koleje”) may respond by a conceptualization of reality "railway," in another
conceptualization "accommodation buildings for students." [1]
LANGUAGE NETWORKS
If network structure is a potential key for understanding universal statistical trends, then the first step is clearly
to define more precisely what kinds of networks are involved. It turns out that there are several possibilities (Fig. 1).
[2] First of all, we can look at the network structure of the language elements themselves, and this at different levels:
semantics and pragmatics, syntax, morphology, phonetics and phonology. Second, we can look at the language
community and the social structures defined by their members. Social networks can help in understanding how fast
new conventions propagate or what language variation will be sustained [2, 3].
Moreover, the network organization of individual interactions has been shown to influence the emergence of a
self-consistent language [2, 4]
How to build language networks.
Starting from a given text (a) we can define different types of relationships between words, including precedence
relations and syntactic relations. In (b) we show them using blue and black arrows, respectively. In figures (c) and
(d) the corresponding co-occurrence and syntactic networks are shown. Paths on the network (c) can be understood
as the potential universe of sentences that can be constructed with the given lexicon. An example of such a path is
the sentence indicated in red. [2]
Nodes have been colored according to the total (in- and out-) word degree, highlighting key nodes during
navigation (The higher the degree the lighter its color). In (d) we build the corresponding syntactic network, taking
as a descriptive Framework dependency syntax (50), assuming as a criterion that arcs begin in complements and end
in the nucleus of the phrase; taking the verb as the nucleus of well-formed sentences. The previous sentence appears
now dissected into two different paths converging towards try.“ An example of the global pattern found in a larger
network is shown in (e) which is the occurrence network of a fragment of Moby Dick. In this graph hubs, we have
the, a, of, to. [2]
SEMANTIC NETWORKS
Semantic networks (Figure 3) [5, 6, 7] consist of individual words that lexicalize concepts and then map basic
semantic relationships (for example such as is special types of isa-relationships) that must be a binary relation (this
is the main condition semantic networks). They can be potentially built automatically from corpus data [8, 9, 10,
11]. The topology of these networks reveals highly efficient organizations, where mushrooms are polysemantic
words that have a great effect on the overall structure. It has been suggested that hubs organize a semantic web in a
categorical representation and explain the ubiquity of polysemy in different languages [8]. In this context, while
claiming that polysemy may be some type of historical accident (which should be avoided), the analysis of these
sites instead suggests that they are a necessary component of all languages. Additionally, as discussed in [9], the
topology without semantic network radiation creates some limitations on how these websites (and the ones
previously) can be implemented into neural hardware. The high clustering found in these sites favors search by
association, whereas short paths separating two arbitrary items glance [2, 10]
Semantic networks can define different ways. Figure 2 shows a simple network of semantic relationships
between lexicalized concepts. Nodes are the terms and links of semantic relations are between concepts. Links are
colored to highlight the different nature of relationships. Yellow arcs define isa-relationships (Flower Rose
indicates that Flower is hypernym Rose). Two concepts are linked by a blue arc when there is a partial relationship
between them. Binary opposition relations are bi-directional and violet-colored. Isa-relationship defines a tree-
structured network, and other relationships create shortcuts that lead the network to display a small world pattern,
making it easier and easier to navigate the network. [12, 13]
FIGURE 2. The language networks. [2]
FIGURE 3. The semantic networks. [2]
ACKNOWLEDGMENTS
The research described here has been financially supported by University of Ostrava grant SGS06/PŘF/18. Any
opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do
not reflect the views of the sponsors.
CONCLUSION
The aim of the article is to find the meaning of using language networks in artificial intelligence. As we can see
in theory, language networks are mainly used in biology and medicine.
This article introduces and compares the language network with semantic networks. In conclusion, we would
especially like to thank the authors of Solé, Corominas-Murtra, Valverde, and Steels, who have created a beautiful
article: Language structures: Their structure, function, and evolution. This article has become a great inspiration, and
for this article I draw the theory from this article.
The authors will use this language network in the Czech language analysis. The Czech language does not have
clear rules compared to the English language, so we can create arbitrary sentences without fixed rules.
REFERENCES
1. A. Lukasová, M. Žáček, M. Vajgl, M. and Z. Telnarová. Formal logic and semantic web (in Czech), (Plzeň:
Polypress s.r.o., 2015). p. 262. ISBN 978-80-261-0408-7.
2. R.V. Solé, B. Corominas-Murtra, S. Valverde and L. Steels, Language networks: Their structure, function, and
evolution. (Complexity, 2010), 15, pp. 20-26.
3. W. Labov, Principles of Linguistic change. Volume II: Social Factors. (Blackwell, Oxford. 2000).
4. B. Corominas and R. V. Solé. Network topology and self-consistency in language games. In Journal of
Theoretical Biology. 2006, 241(2), pp. 438-441.
5. M. Žáček, R. Miarka, O. Sýkora, Visualization of semantic data, edited by R. Silhavy et al. (Springer, Cham,
2015), pp. 277-285. doi:10.1007/978-3-319-18476-0_28
6. M. Žáček and D. Homola, “Analysis of the English morphology by semantic networks”, In: International
Conference of Computational Methods in Sciences and Engineering 2017, ICCMSE 2017, AIP Conference
Proceedings 1906, edited by Simos T.E. et al. (American Institute of Physics, Melville, NY, 2017) p. 080006.
doi: 10.1063/1.5012351.
7. A. Lukasová, M. Vajgl and M. Žáček, Knowledge Represented Using RDF Semantic Network In The
Concept of Semantic Web, In: International Conference of Numerical Analysis and Applied Mathematics
2015, ICNAAM 2015, AIP Conference Proceedings 1738, edited by Simos T.E. et al. (American Institute of
Physics, Melville, NY, 2015) p. 120012. doi: 10.1063/1.4951895.
8. M. Sigman and G.A. Cecchi, G.A. Global organization of the Wordnet lexicon. (Natl.Acad. Sci. USA, 2002),
99(3), pp. 1742-1747.
9. M. Steyvers, and J.B. Tenenbaum, The largescale structure of semantic networks: statistical analyses and a
model of semantic growth. (Cogn. Sci., 2005), 29(1), pp. 41-78.
10. A. Motter, A. et al. Topology of the conceptual network of language. (Phys. Rev., 2002), E65, p. 065102.
11. A.J. Holanda, I. Torres Pisa, O. Kinouchi, A. Souto Martinez, and E. Seron Ruiz, Thesaurus as a complex
network. (Physica, 2004), A 344, pp. 530-536.
12. G. A. Miller, The Science of words. Sci. Am. Library. (Freeman and Co. NewYork, 1996).
13.
... The associative memory of an artificial cognitive agent in this case will represent a very strongly connected semantic network [Žáček & Telnarová, 2019], the activation of nodes in which will represent an «understanding of the current situation». The set of all nodes of the semantic network activated at a time is an act of cognition, a thought of a cognitive agent. ...
Chapter
The article presents a review of the phenomenon of understanding the meaning of the natural language and, more broadly, the meaning of the situation in which the cognitive agent is located, taking into account the context. A specific definition of understanding is given, which is at the intersection of neurophysiology, information theory and cybernetics. The scheme of abstract architecture of a cognitive agent (of arbitrary nature) is given, concerning which it is stated that an agent with such architecture can understand in the sense described in the work. It also provides a critique of J. Searle’s mental experiment «The Chinese Room» from the point of view of the construction of artificial cognitive agents implemented within a hybrid paradigm of artificial intelligence. The novelty of the presented work is based on the application of the author’s methodological approach to the construction of artificial cognitive agents, while in the framework of this approach is considered not just the perception of external stimuli from the environment, but the philosophical problem of «understanding» artificial cognitive agent of its sensory inputs. The relevance of the work follows from the renewed interest of the scientific community in the theme of Strong Artificial Intelligence (or AGI). The author’s contribution to the considered theme consists in complex consideration from different points of view of the theme of understanding perceived by artificial cognitive agents with the formation of prerequisites for the development of new models and the theory of understanding within the framework of artificial intelligence, which in the future will help to build a holistic theory of the nature of human mind. The article will be interesting for specialists working in the field of artificial intellectual systems and cognitive agents construction, as well as for scientists from other scientific fields — first of all, philosophy, neurophysiology, psychology etc. KeywordsPhilosophy of mindPhilosophy of artificial intelligenceChinese roomSemanticsPerceptionUnderstandingLearningMachine learningArtificial intelligenceStrong artificial intelligence
... Ассоциативная память искусственного когнитивного агента в этом случае будет представлять очень сильносвязанную семантическую сеть [Žáček & Telnarová, 2019], активация узлов в которых будет представлять собой «понимание текущей ситуации». Множество всех активированных в момент времени узлов семантической сети является актом когниции, мыслью когнитивного агента. ...
Preprint
Full-text available
The article presents a review of the phenomenon of understanding the meaning of the natural language and, more broadly, the meaning of the situation in which the cognitive agent is located, taking into account the context. A specific definition of understanding is given, which is at the intersection of neurophysiology, information theory and cybernetics. The scheme of abstract architecture of a cognitive agent (of arbitrary nature) is given, concerning which it is stated that an agent with such architecture can understand in the sense described in the work. It also provides a critique of J. Searle's mental experiment «The Chinese Room» from the point of view of the construction of artificial cognitive agents implemented within a hybrid paradigm of artificial intelligence. The novelty of the presented work is based on the application of the author's methodological approach to the construction of artificial cognitive agents, while in the framework of this approach is considered not just the perception of external stimuli from the environment, but the philosophical problem of «understanding» artificial cognitive agent of its sensory inputs. The relevance of the work follows from the renewed interest of the scientific community in the theme of Strong Artificial Intelligence (or AGI). The author's contribution to the considered theme consists in complex consideration from different points of view of the theme of understanding perceived by artificial cognitive agents with the formation of prerequisites for the development of new models and the theory of understanding within the framework of artificial intelligence, which in the future will help to build a holistic theory of the nature of human mind. The article will be interesting for specialists working in the field of artificial intellectual systems and cognitive agents construction, as well as for scientists from other scientific fields — first of all, philosophy, neurophysiology, psychology etc.
Chapter
The article describes the author’s approach to solving the problem of symbol grounding, which can be used in the development of artificial cognitive agents of the general level. When implementing this approach, such agents can receive the function of understanding the sense and context of the situations in which they find themselves. The article gives a brief description of the problem of understanding the meaning and sense. In addition, the author’s vision is given of how the symbol grounding should occur when the artificial cognitive agent uses sensory information flows of various modality. Symbol grounding is carried out by building an associative-heterarchical network of concepts, with the help of which the hybrid architecture of an artificial cognitive agent is expanded. The novelty of the article is based on the author’s approach to solving the problem, which is represented by several important principles—these are multisensory integration, the use of an associative-heterarchical network of concepts and a hybrid paradigm of artificial intelligence. The relevance of the work is based on the fact that today the problem of constructing artificial cognitive agents of a general level is becoming more and more important for solving, including within the framework of national strategies for the development of artificial intelligence in various countries of the world. The article is of a theoretical nature and will be of interest to specialists in the field of artificial intelligence, as well as to all those who want to stay within the framework of modern trends in the field of artificial intelligence. KeywordsMeaningSenseFrege’s triangleSymbol groundingSemanticsUnderstandingArtificial intelligenceMultisensory integrationAssociative-heterarchical networkHybrid cognitive architecture
Conference Paper
Full-text available
The article is devoted to study the morphology of natural language, in this case English language. The research is of the language is from the perspective of knowledge representation, when we look at the word as a concept in the Concept languages. The research is in the relationship of the individual words and their classification in the sentence. For the analysis there are used several methods (syntax, lexical categories, morphology). This article focuses mainly on the word, as the foundation of every natural language (English).
Conference Paper
Full-text available
The RDF(S) model has been declared as the basic model to capture knowledge of the semantic web. It provides a common and flexible way to decompose composed knowledge to elementary statements, which can be represented by RDF triples or by RDF graph vectors. From the logical point of view, elements of knowledge can be expressed using at most binary predicates, which can be converted to RDF-triples or graph vectors. However, it is not able to capture implicit knowledge representable by logical formulas. This contribution shows how existing approaches (semantic networks and clausal form logic) can be combined together with RDF to obtain RDF-compatible system with ability to represent implicit knowledge and inference over knowledge base.
Conference Paper
Full-text available
The main goal of the Semantic Web is to direct the current syntactic web on the path to the Semantic Web. The vision of the Semantic Web is to interpret information on the web to be readable and machine-interpretable. Therefore , the article focuses on the creation of an instrument for visualizing semantic data on the basis of the identified advantages and disadvantages of existing applications for visualization, which are currently available.
Article
Full-text available
A thesaurus is one, out of many, possible representations of term (or word) connectivity. The terms of a thesaurus are seen as the nodes and their relationship as the links of a directed graph. The directionality of the links retains all the thesaurus information and allows the measurement of several quantities. This has lead to a new term classification according to the characteristics of the nodes, for example, nodes with no links in, no links out, etc. Using an electronic available thesaurus we have obtained the incoming and outgoing link distributions. While the incoming link distribution follows a stretched exponential function, the lower bound for the outgoing link distribution has the same envelope of the scientific paper citation distribution proposed by Albuquerque and Tsallis. However, a better fit is obtained by simpler function which is the solution of Ricatti's differential equation. We conjecture that this differential equation is the continuous limit of a stochastic growth model of the thesaurus network. We also propose a new manner to arrange a thesaurus using the ``inversion method''.
Article
Full-text available
We define two words in a language to be connected if they express similar concepts. The network of connections among the many thousands of words that make up a language is important not only for the study of the structure and evolution of languages, but also for cognitive science. We study this issue quantitatively, by mapping out the conceptual network of the English language, with the connections being defined by the entries in a Thesaurus dictionary. We find that this network presents a small-world structure, with an amazingly small average shortest path, and appears to exhibit an asymptotic scale-free feature with algebraic connectivity distribution.
Article
Human language is the key evolutionary innovation that makes humans different from other species. And yet, the fabric of language is tangled and all levels of description (from semantics to syntax) involve multiple layers of complexity. Recent work indicates that the global traits displayed by such levels can be analyzed in terms of networks of connected words. Here, we review the state of the art on language webs and their poten-tial relevance to cognitive science. The emergence of syntax through language acquisition is used as a case study to illustrate how the approach can shed light into relevant questions concerning language organization and its evolution.
Article
We present statistical analyses of the large-scale structure of 3 types of semantic networks: word associations, WordNet, and Roget's Thesaurus. We show that they have a small-world structure, characterized by sparse connectivity, short average path lengths between words, and strong local clustering. In addition, the distributions of the number of connections follow power laws that indicate a scale-free pattern of connectivity, with most nodes having relatively few connections joined together through a small number of hubs with many connections. These regularities have also been found in certain other complex natural networks, such as the World Wide Web, but they are not consistent with many conventional models of semantic organization, based on inheritance hierarchies, arbitrarily structured networks, or high-dimensional vector spaces. We propose that these structures reflect the mechanisms by which semantic networks grow. We describe a simple model for semantic growth, in which each new word or concept is connected to an existing network by differentiating the connectivity pattern of an existing node. This model generates appropriate small-world statistics and power-law connectivity distributions, and it also suggests one possible mechanistic basis for the effects of learning history variables (age of acquisition, usage frequency) on behavioral performance in semantic processing tasks.
Article
The lexicon consists of a set of word meanings and their semantic relationships. A systematic representation of the English lexicon based in psycholinguistic considerations has been put together in the database Wordnet in a long-term collaborative effort. We present here a quantitative study of the graph structure of Wordnet to understand the global organization of the lexicon. Semantic links follow power-law, scale-invariant behaviors typical of self-organizing networks. Polysemy (the ambiguity of an individual word) is one of the links in the semantic network, relating the different meanings of a common word. Polysemous links have a profound impact in the organization of the semantic graph, conforming it as a small world network, with clusters of high traffic (hubs) representing abstract concepts such as line, head, or circle. Our results show that: (i) Wordnet has global properties common to many self-organized systems, and (ii) polysemy organizes the semantic graph in a compact and categorical representation, in a way that may explain the ubiquity of polysemy across languages.
Article
(Steels 2003, Steels 2000, Hashimoto 1997) to abstract models of signal-object associations (Komarova & Niyogi 2004). In the later case, strong assumptions are typically made in order to reduce the potential complexity of the game dynamics, in such a way that analytic results can be extracted. A relevant example,in this context is a family of language,games involving a well-defined payoff defined in terms of the structure of the lexical matrix where all agents interact with each other (Nowak, Plotkin & Krakauer, 1999; Komarova & Niyogi, 2004; Komarova & Nowak, 2001). This type of games leads eventually the system to an optimization of a code shared by a number of interacting agents. In a general framework, this results belong to Information and Game theory (Plotkin & Nowak, 2001), but they have been related with both evolution of animal,communication,and human,languages,(Nowak & Krakauer, 1999). All these models consider a scenario of agent interactions where all-to-all exchanges occur. More precisely, the fitness function measuring the payoff associated to proper communication is computed,by matching,the performance,of each player with all the others. Real networks involving social interactions do not need to follow such a rule. Actually, it seems clear from available data that social networks,are typically sparse and have small world structure (Watts & Strogatz, 1998). Small path lengths between agents and large clustering are the two essential characteristics of these graphs. Here clustering refers to the presence of order at the local level defined in terms of many,triangles. Network topology,largely influences the way information or epidemics spread through the community, and both local and global properties can constrain the way,comunication,develops,and,evolves. It might,actually pervade the emergence of complex language traits, such as syntax (Ferrer-Cancho et al., 2005; Sol´e, 2005). Here we consider the impact of local structure, such as the presence of triangles in the graph of agent-agent interactions, in language games. As will be shown below, an important