Chapter

Lexical Networks Constructed to Correspond to Students’ Short Written Responses: A Quantum Semantic Approach

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

A simple method to construct lexical networks (lexicons) of how students use scientific terms in written texts is introduced. The method is based on a recently introduced quantum semantics generalization of a word-pair co-occurrence. Quantum semantics allows entangled co-occurrence, thus allowing to model the effect of subjective bias on weighting the importance of word co-occurrence. Using such a generalized word-pair co-occurrence counting, we construct students’ lexicons of scientific (life-science) terms they use in their written responses to questions concerning food chains in life-science contexts. The method allows us to construct ensembles of lexicons that probabilistically simulate the variability of individual lexicons. The re-analyses of the written reports show that while sets of top-ranking terms contain nearly the same terms irrespective of details of the method used to count co-occurrences, the relative rankings of some key-terms may be different in quantum semantic analysis.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

... Moreover, applying the aligning techniques to answers can remove various relevant words, which can affect the sentence embeddings. [36,145]. ...
Article
Full-text available
In natural language processing, short-text semantic similarity (STSS) is a very prominent field. It has a significant impact on a broad range of applications, such as question–answering systems, information retrieval, entity recognition, text analytics, sentiment classification, and so on. Despite their widespread use, many traditional machine learning techniques are incapable of identifying the semantics of short text. Traditional methods are based on ontologies, knowledge graphs, and corpus-based methods. The performance of these methods is influenced by the manually defined rules. Applying such measures is still difficult, since it poses various semantic challenges. In the existing literature, the most recent advances in short-text semantic similarity (STSS) research are not included. This study presents the systematic literature review (SLR) with the aim to (i) explain short sentence barriers in semantic similarity, (ii) identify the most appropriate standard deep learning techniques for the semantics of a short text, (iii) classify the language models that produce high-level contextual semantic information, (iv) determine appropriate datasets that are only intended for short text, and (v) highlight research challenges and proposed future improvements. To the best of our knowledge, we have provided an in-depth, comprehensive, and systematic review of short text semantic similarity trends, which will assist the researchers to reuse and enhance the semantic information.
Article
Full-text available
This article proposes an approach to compare semantic networks using concept-centered sub-networks. A concept-centered sub-network is defined as an induced network whose vertex set consists of the given concept (ego) and all its adjacent concepts (alters) and whose link set consists of all the links between the ego and alters (including alter-alter links). By looking at the vertex and link overlap indices of concept-centered networks we infer semantic similarity of the underlying concepts. We cross-evaluate the semantic similarity by close-reading textual contexts from which networks are derived. We illustrate the approach on written and interview texts from an ethnographic study of flood management practice in England.
Article
Full-text available
The purpose of this study is to increase the understanding about undergraduate life science students’ conceptions concerning the role of photosynthesizing plants in the ecosystem, utilizing a network analysis method. Science learning requires the integration and linking of abstract and often counterintuitive concepts successfully into multifaceted networks. The quality of these networks, together with their abilities to communicate via the language of science, influences students’ success in academic, verbal problem-solving tasks. This study contributes to investigating students’ understanding, utilizing a modern network analysis method in exploring first-year university life science students’ written answers. In this study, a total of 150 first-year life science students answered two open-ended tasks related to the role of photosynthesizing plants in the ecosystem. A network analysis tool was used in exploring the occurrence of different-level science concepts and the interrelatedness between these concepts in students’ verbal outputs. The results showed that the richness of concept networks and students’ use of macro-concepts were remarkably varied between the tasks. Higher communicability measures were connected to the more abundant existence of macro-concepts in the task concerning the role of plants from the food-chain perspective. In the answers for the task concerning the role of plants regarding the atmosphere, the students operated mainly with single facts, and there were only minor interconnections made between the central concepts. On the basis of these results, the need for more all-encompassing biology teaching concerning complex environmental and socio-economic problems became evident. Thus, methodological and pedagogical contributions are discussed.
Article
Full-text available
The paper presents quantum model of subjective text perception based on binary cognitive distinctions corresponding to words of natural language. The result of perception is quantum cognitive state represented by vector in the qubit Hilbert space. Complex-valued structure of the quantum state space extends the standard vector-based approach to semantics, allowing to account for subjective dimension of human perception in which the result is constrained, but not fully predetermined by input information. In the case of two distinctions, the perception model generates a two-qubit state, entanglement of which quantifies semantic connection between the corresponding words. This two-distinction perception case is realized in the algorithm for detection and measurement of semantic connectivity between pairs of words. The algorithm is experimentally tested with positive results. The developed approach to cognitive modeling unifies neurophysiological, linguistic, and psychological descriptions in a mathematical and conceptual structure of quantum theory, extending horizons of machine intelligence.
Article
Full-text available
The paper describes an algorithm for semantic representation of behavioral contexts relative to a dichotomic decision alternative. The contexts are represented as quantum qubit states in two-dimensional Hilbert space visualized as points on the Bloch sphere. The azimuthal coordinate of this sphere functions as a one-dimensional semantic space in which the contexts are accommodated according to their subjective relevance to the considered uncertainty. The contexts are processed in triples defined by knowledge of a subject about a binary situational factor. The obtained triads of context representations function as stable cognitive structure at the same time allowing a subject to model probabilistically-variative behavior. The developed algorithm illustrates an approach for quantitative subjectively-semantic modeling of behavior based on conceptual and mathematical apparatus of quantum theory.
Article
Full-text available
Resources such as FrameNet, which provide sets of semantic frame definitions and annotated textual data that maps into the evoked frames, are important for several NLP tasks. However, they are expensive to build and, consequently, are unavailable for many languages and domains. Thus, approaches able to induce semantic frames in an unsupervised manner are highly valuable. In this paper we approach that task from a network perspective as a community detection problem that targets the identification of groups of verb instances that evoke the same semantic frame and verb arguments that play the same semantic role. To do so, we apply a graph-clustering algorithm to a graph with contextualized representations of verb instances or arguments as nodes connected by edges if the distance between them is below a threshold that defines the granularity of the induced frames. By applying this approach to the benchmark dataset defined in the context of SemEval 2019, we outperformed all of the previous approaches to the task, achieving the current state-of-the-art performance.
Article
Full-text available
Knowledge is a network of interconnected concepts. Yet, precisely how the topological structure of knowledge constrains its acquisition remains unknown, hampering the development of learning enhancement strategies. Here, we study the topological structure of semantic networks reflecting mathematical concepts and their relations in college-level linear algebra texts. We hypothesize that these networks will exhibit structural order, reflecting the logical sequence of topics that ensures accessibility. We find that the networks exhibit strong core–periphery architecture, where a dense core of concepts presented early is complemented with a sparse periphery presented evenly throughout the exposition; the latter is composed of many small modules each reflecting more narrow domains. Using tools from applied topology, we find that the expositional evolution of the semantic networks produces and subsequently fills knowledge gaps, and that the density of these gaps tracks negatively with community ratings of each textbook. Broadly, our study lays the groundwork for future efforts developing optimal design principles for textbook exposition and teaching in a classroom setting.
Article
Full-text available
Learning the wave-particle dualism of electrons and photons plays a central role in understanding quantum physics. Teaching it requires that the teacher is fluent in using abstract and uncommon terms. We inspect the lexical structures of pre-service teachers' declarative knowledge about the wave-particle dualism of electrons and photons in the context of double-slit interference. The declarative knowledge is analyzed in the form of a lexical network of terms. We focus on lexical structures because, in teaching and learning, knowledge is communicated mostly through lexical structures, i.e., by speaking and writing. Using the lexical networks, we construct the lexicons used by pre-service teachers to express their knowledge of electrons and photons in the context of double-slit interference. The lexicons consist of eight different key terms, each representing a set of closely-related or synonymous terms. The lexicons by 14 pre-service teachers reveal remarkable variation and differences, and are strongly context-dependent. We also analyzed lexicons corresponding to two didactically-oriented research articles on the same topic and found that they also differ. Lexicons paralleling both texts are found among the pre-service teachers' lexicons. However, only some of the pre-service teachers use such rich vocabulary as would indicate multi-faceted understanding of quantum entities.
Article
Full-text available
Language is hierarchically organized: words are built into phrases, sentences and paragraphs to represent complex ideas. A similar hierarchical structure is observed across many other biological, electronic and transportation networks supporting complex functions. Here, we ask whether the organization of language in written text displays fractal hierarchical architecture. Specifically, we test two hypotheses: (i) that the structure of the exposition in scientific research articles displays the Rentian scaling principle, which marks hierarchical fractal-like structure and (ii) that the exponent of the scaling law changes as the article is revised to maximize information transmission. Using 32 scientific manuscripts—each containing between 3 and 26 iterations of revision—we construct semantic networks in which nodes represented unique words in each manuscript, and in which edges connect nodes if two words appeared within the same five-word window. We show that these semantic networks modelling the content of scientific articles display clear Rentian scaling, and that the Rent exponent varies over the publication life cycle, from the first draft to the final revision. Furthermore, we observe that manuscripts fell into three clusters in terms of how the scaling exponents changed across drafts: exponents rising over time, falling over time and remaining relatively stable over time. This change in exponent reflects the evolution in semantic network structure over the manuscript revision process, highlighting a balance between network complexity, which increases the exponent, and network efficiency, which decreases the exponent. Lastly, the final value of the Rent exponent is negatively correlated with the number of authors. Taken together, our results suggest that semantic networks reflecting the structure of exposition in scientific research articles display striking hierarchical architecture that arbitrates trade-offs between competing constraints on network organization, and that this arbitration is navigated differently depending on the social environment characteristic of the collaboration.
Article
Full-text available
The odds ratio is one of the most widely used measures of association for 2 × 2 tables. A generalized Yule coefficient transforms the odds ratio into a correlation-like scale with a range from -1 to 1. Yule's Y, Yule's Q, Digby's H, and a new coefficient are special cases of a generalized Yule coefficient. The new coefficient is shown to be similar in value to the phi coefficient. A confidence interval and sample size formula for a generalized Yule coefficient are proposed. The proposed confidence interval is shown to perform much better than the Wald intervals that are implemented in statistical packages.
Article
Understanding how information navigates through nodes of a complex network has become an increasingly pressing problem across scientific disciplines. Several approaches have been proposed on the basis of shortest paths or diffusive navigation. However, no existing approaches have tackled the challenges of efficient communication in networks without full knowledge of their global topology under external noise. Here, we develop a first principles approach and mathematical formalism to determine the informational cost of navigating a network under different levels of external noise. Using this approach we discover the existence of a trade-off between the ways in which networks route information through shortest paths, their entropies and stability, which define three classes of real-world networks. This approach reveals that environmental pressure has shaped the ways in which information is transferred in bacterial metabolic networks and allowed us to determine the levels of noise at which a protein–protein interaction network seems to work in normal conditions in a cell.
Article
Just as language reflects one’s thoughts, the text of science textbooks reflects the structure of scientific knowledge and thought. Therefore, students’ learning of scientific language leads to their acquisition of the structure of scientific knowledge and thought. The purposes of this study were to extract scientific semantic network from science textbooks as the ideal model of using scientific language, and to examine how well science teachers are implementing scientific semantic networks in science classes by comparing and analysing their spoken language. Through a recursive method, science textbooks were searched for keywords and their relationships, and consequently a scientific semantic network was presented. As a result of the comparison with science teachers’ spoken language, it was found that the teachers were not able to implement the scientific semantic network exactly and efficiently in class, and the focus of the contents of their instruction was also different from the textbook. Science textbook was focused on qualitative aspects of the concept such as meaning, element, characteristic etc, on the other hand science teachers’ language was focused on quantitative aspect of the concept such as calculation, graph etc.
Article
Purpose The paper aims to provide a semiotic interpretation of the role played by entanglement in quantum-based models aimed to information retrieval and suggests possible improvements. Actual models are capable of retrieving documents relevant to a query composed of a keyword and its acceptation expressed by a given context. The paper also considers some analogies between this technique and quantum-based approaches in other disciplines to discuss the consequence of this quantum turn, as epistemology and philosophy of language are concerned. Design/methodology/approach We use quantum geometry to design a formal model for textual semiotics. In particular, the authors refer to Greimas’s work on semantics and information theory, to Eco’s writings on semantic memory and to Lotman’s work on a cybernetic notion of culture. Findings Quantum approaches imply a particular point of view on meaning. Meaning is not a real, positive quality of a given word. It is a net of relations constructed in the text, whose value is progressively determined during the reading process. Furthermore, reading is not a neutral operation: to read is to determine meaning. If it is said that, from a general semiotic point of view, meaning is stored in quantum semantic memories and is read/written by semantic machines, then the operation of “reading/writing” is analogous to the operation of measuring in quantum theory: in other terms, meaning is a value, and this implies an instance (not necessarily human) according to which values are valuable. Research limitations/implications The authors are not proposing a complete quantum semantics. At the present, quantum information retrieval can detect the presence of semantic relations. The authors suggest a way to characterize them, leaving open the problem on how to formalize the document as a vector in four-state semantic space. Practical implications A quantum turn shows deep semiotic implications on the approach to language, which shows an immanent semantic organization not reducible to syntax and morphology. This organization is probabilistic and indeterministic and explains to what extent text fixes the meaning of its lexical units. Social implications In the authors’ perspective, signification is not the exclusivity of a human subject. Criticizing Turing test, the great semiotic and cybernetic scholar Jurij Lotman wrote that if we identify “intelligent” and “human”, we raise the failings of an actual form of intelligence to the rank of an essential characteristic. On this line, meaning is considered as a feature of social, artificial and biological systems. Originality/value The adoption of quantum formalism seems in line with cybernetic framework, involving a probabilistic, non-cartesian point of view on meaning aimed to critically discuss the human–machine relation. Furthermore, Quantum theory (QT) implies a phenomenological point of view on the conditions of possibility of meaning.
Article
The language used to construct knowledge, beliefs, and worldviews in school science is distinct from the social language that students use in their everyday ordinary life. This difference is a major source of reading difficulty for many students, especially struggling readers and English-language learners. This article identifies some of the linguistic challenges involved in reading middle-school science texts and suggests several teaching strategies to help students cope with these challenges. It is argued that explicit attention to the unique language of school science should be an integral part of science literacy pedagogy.
Article
It is argued that language plays an active role in the development of scientific thought. A research project is outlined which will investigate this hypothesis and, in addition, focus on such questions as the role of mathematics in science and the status of the genetic code. Nothing is more usual than for philosophers to encroach on the province of grammarians, and to engage in disputes of words, while they imagine they are handling controversies of the deepest importance and concern.—David Hume.
Article
A fundamental problem in the study of complex networks is to provide quantitative measures of correlation and information flow between different parts of a system. To this end, several notions of communicability have been introduced and applied to a wide variety of real-world networks in recent years. Several such communicability functions are reviewed in this paper. It is emphasized that communication and correlation in networks can take place through many more routes than the shortest paths, a fact that may not have been sufficiently appreciated in previously proposed correlation measures. In contrast to these, the communicability measures reviewed in this paper are defined by taking into account all possible routes between two nodes, assigning smaller weights to longer ones. This point of view naturally leads to the definition of communicability in terms of matrix functions, such as the exponential, resolvent, and hyperbolic functions, in which the matrix argument is either the adjacency matrix or the graph Laplacian associated with the network. Considerable insight on communicability can be gained by modeling a network as a system of oscillators and deriving physical interpretations, both classical and quantum-mechanical, of various communicability functions. Applications of communicability measures to the analysis of complex systems are illustrated on a variety of biological, physical and social networks. The last part of the paper is devoted to a review of the notion of locality in complex networks and to computational aspects that by exploiting sparsity can greatly reduce the computational efforts for the calculation of communicability functions for large networks.
  • M Newman
Talking science: language. Learning and values. Language and educational processes
  • J Lemke
Lemke, J.: Talking science: language. Learning and values. Language and educational processes. Ablex Publishing Corporation, Norwood, NJ (1990)
New directions in language and science education research
  • L D Yore
  • B Hand
  • S R Goldman
  • G M Hilderbrand
  • J F Osborne
  • D F Treagust
  • C S Wallace
Applied Multivariate Statistical Analysis. Pearson
  • R A Johnson
  • D W Wichern