Kai R. Larsen

Kai R. Larsen
University of Colorado Boulder | CUB · Division of Management & Entrepreneurship

Ph.D. Information Science and Behavioral Analytics

About

102
Publications
150,517
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
3,737
Citations
Introduction
Kai R. Larsen is an Associate Professor of Information Systems at Leeds School of Business, University of Colorado Boulder. He is a courtesy faculty member in the department of Information Science and a Fellow of the Institute of Behavioral Science, and a Research Advisor to the Gallup organization. Kai is most known for providing a practical solution to Edward Thorndike's (1904) Jingle Fallacy and for his contributions to the Semantic Theory of Survey Response (STSR), which evaluates leadership, adoption, and other attitude surveys at a deep, semantic level.
Additional affiliations
October 2015 - present
University of Colorado
Position
  • Associate Professor (courtesy appointment)
June 2000 - present
University of Colorado Boulder
Position
  • Professor (Associate)

Publications

Publications (102)
Article
Full-text available
The scholarly information-seeking process for behavioral research consists of three phases: search, access, and processing of past research. Existing IT artifacts, such as Google Scholar, have in part addressed the search and access phases, but fall short of facilitating the processing phase, creating a knowledge inaccessibility problem. We propose...
Article
Full-text available
The problem of detecting whether two behavioral constructs reference the same real world phenomenon has existed for over 100 years; we term discordant naming of constructs the Construct Identity Fallacy (CIF). We designed and evaluated the Construct Identity Detector (CID), the first tool with large-scale construct identity detection properties and...
Preprint
Full-text available
The goal of a review article is to present the current state of knowledge in a research area. Two important initial steps in writing a review article are boundary identification (identifying a body of potentially relevant past research) and corpus construction (selecting research manuscripts to include in the review). Using the Technology Acceptanc...
Preprint
Provides an accessible introduction to machine learning for business. The examples are built around the DataRobot Automated Machine Learning platform, but focus is on the principles of machine learning.
Conference Paper
Full-text available
We introduce the Information Systems Ontology (ISO), a new ontology for the Information Systems (IS) discipline designed to enable automated knowledge synthesis and meta-analysis of research findings in IS. We constructed ISO in a methodical manner, following known best practices for ontology construction. We also conducted a series of ontology ref...
Article
Full-text available
The 21st century has introduced the 4th Industrial Revolution, which describes an industrial paradigm shift that alters social, economic, and political environments simultaneously. Innovative technologies such as blockchain, artificial intelligence, and advanced mobile networks power this digital revolution. These technologies provide a unique comp...
Article
Full-text available
Academic disciplines are often organized according to the behaviors they examine. While most research on a behavior tends to exist within one discipline, some behaviors are examined by multiple disciplines. Better understanding of behaviors and their relationships should enable knowledge transfer across disciplines and theories, thereby dramaticall...
Article
Full-text available
The methodological tools available for psychological and organizational assessment are rapidly advancing through natural language processing (NLP). Computerized analyses of texts are increasingly available as extensions of traditional psychometric approaches. The present Research Topic is recognizing the contributions but also the challenges in pub...
Chapter
This chapter reviews the person-situation dimension in behavior prediction through the semantic theory of survey responses (STSR). This theory proposes that the most likely source of variation in correlations between scores on Likert-scale items is overlap in meaning. We review and explain a growing number of empirical studies that support this: Up...
Chapter
Full-text available
Research in design science has always acknowledged the need for evaluating its knowledge outcomes, with particular emphasis on assessing the efficacy and utility of the artifacts produced. However, the need to demonstrate the validity of the research process and outcomes has not received as much attention. This research examines scientific approach...
Article
The rapid and wide dissemination of up-to-date, localized information is a central issue during disasters. Being attributed to the original 140-character length, Twitter provides its users with quick-posting and easy-forwarding features that facilitate the timely dissemination of warnings and alerts. However, a concern arises with respect to the te...
Article
Full-text available
This study uses latent semantic analysis (LSA) to explore how prevalent measures of motivation are interpreted across very diverse job types. Building on the Semantic Theory of Survey Response (STSR), we calculate "semantic compliance" as the degree to which an individual's responses follow a semantically predictable pattern. This allows us to exam...
Conference Paper
Full-text available
Research in design science has always acknowledged the need for evaluating its knowledge outcomes, with particular emphasis on assessing the efficacy and utility of the artifacts produced. However, the need to demonstrate the validity of the research process and outcomes has not received as much attention. This research examines scientific approach...
Article
Full-text available
Trust and distrust are crucial aspects of human interaction that determine the nature of many organizational and business contexts. Because of socialization-borne familiarity that people feel about others, trust and distrust can influence people even when they do not know each other. Allowing that some aspects of the social knowledge that is acquir...
Article
Full-text available
Likert scale surveys are frequently used in cross-cultural studies on leadership. Recent publications using digital text algorithms raise doubt about the source of variation in statistics from such studies to the extent that they are semantically driven. The Semantic Theory of Survey Response (STSR) predicts that in the case of semantically determi...
Conference Paper
Full-text available
An important element of rigor in the information systems (IS) discipline are research validities. Broadly, validity deals with the quality of scientific research and dependability of scientific findings. Research validities provide procedural templates to collect and analyze evidence and justify the arguments and conclusions of a research study. Th...
Article
Full-text available
In this article, we provide a review of research-curation and knowledge-management efforts that may be leveraged to advance research and education in psychological science. After reviewing the approaches and content of other efforts, we focus on the metaBUS project’s platform, the most comprehensive effort to date. The metaBUS platform uses standar...
Conference Paper
Full-text available
Validity is among the most foundational and widely used concepts in science. Much has been written on the subject, yet, we continue to lack established definitions of research validities. This paper presents preliminary results for developing a general ontology of research validity. In this paper, we assembled the largest data set of validities and...
Article
Likert-scale surveys are frequently used in cross-cultural studies on leadership. Recent publications using digital text algorithms raise doubt about the source of variation in statistics from such studies to the extent that they are semantically driven. The Semantic Theory of Survey Response (STSR) predicts that in the case of semantically determi...
Article
Full-text available
Over the last century, the social and behavioral sciences have accumulated a vast storehouse of knowledge with the potential to transform society and all its constituents. Unfortunately, this knowledge has accumulated in a form (e.g., journal articles) that makes it extremely difficult to search, categorize, analyze and integrate across studies due...
Conference Paper
Full-text available
Validity and reliability are among the most widely used concepts in science. Broadly, both deal with the quality of scientific research and dependability of scientific findings. Many volumes have been written on the subject, with countless mentions and uses of the terms in scientific papers. Yet, confusion reigns supreme and we continue to lack est...
Article
Full-text available
This is a methodological presentation of the relationship between semantics and survey statistics in human resource development (HRD) research. This study starts with an introduction to the semantic theory of survey response (STSR) and proceeds by offering a guided approach to conducting such analyses. The reader is presented with two types of sema...
Article
Full-text available
Research on sensemaking in organisations and on linguistic relativity suggests that speakers of the same language may use this language in different ways to construct social realities at work. We apply a semantic theory of survey response (STSR) to explore such differences in quantitative survey research. Using text analysis algorithms, we have stu...
Data
Complete dataset as used in the article. All variables necessary to replicate our results with two exceptions: a) The MLQ items are copyright protected and only included with item numbers and semantic values, b) the replications with 2–500 dimensions are too extensive to be replicated here. (XLSX)
Article
Full-text available
The goal of a review article is to present the current state of knowledge in a research area. Two important initial steps in writing a review article are boundary identification (identifying a body of potentially relevant past research) and corpus construction (selecting research manuscripts to include in the review). We present a theory-as-discour...
Preprint
Full-text available
This chapter outlines the types of Automated Machine Learning, available tools and platforms, provides criteria for evaluating AutoML tools, and provides 30 machine learning principles and their conversion to AutoML. Chapter from: Kai R. Larsen and Daniel Becker, Automated Machine Learning for Business. Oxford University Press, 2019
Preprint
Full-text available
There are contexts in which one or more features, while entirely legitimate for modeling, are illegitimate for model evaluation. More specifically, a functional model could be built and put into production using such features, but these features would have occurred at or even after the data in the validation set, introducing a target leakage proble...
Preprint
Full-text available
Many experts consider target leakage one of the most insidious problems of automated machine learning. In this book, the term "target leakage" (aka. data leakage) will be defined in a broader scope than usual, as this provides an opportunity to discuss related issues of importance. Our definition of target leakage considers a target leak to have oc...
Preprint
Full-text available
Background: Academic disciplines are often organized according to the behaviors they examine. For example, behavioral medicine addresses health-related behaviors, such as smoking, drug abuse, and exercise, whereas a more specialized discipline such as information systems focuses on technology use. While most research on a behavior tends to exist wi...
Article
In this paper, we use Latent Semantic Analysis to explore the design battles in smartphones. Using newspaper coverage from 1992-2012, we build a semantic model of the media coverage to identify article clusters. Cluster membership gives us visibility into trends in coverage over the course of the study. We find that five distinct periods can be ide...
Article
Full-text available
The semantic theory of survey responses (STSR) proposes that the prime source of statistical covariance in survey data is the degree of semantic similarity (overlap of meaning) among the items of the survey. Because semantic structures are possible to estimate using digital text algorithms, it is possible to predict the response structures of Liker...
Article
The traditional understanding of data from Likert scales is that the quantifications involved result from measures of attitude strength. Applying a recently proposed semantic theory of survey response, we claim that survey responses tap two different sources: a mixture of attitudes plus the semantic structure of the survey. Exploring the degree to...
Article
Full-text available
In this guide, we introduce researchers in the behavioral sciences in general and MIS in particular to text analysis as done with latent semantic analysis (LSA). The guide contains hands-on annotated code samples in R that walk the reader through a typical process of acquiring relevant texts, creating a semantic space out of them, and then projecti...
Article
Full-text available
Word co-occurrences in text carry lexical information that can be harvested by data-mining tools such as latent semantic analysis (LSA). This research perspectives article (RPA) demonstrates the potency of using such embedded information by demonstrating that the Technology Acceptance Model (TAM) can be reconstructed significantly by analyzing unre...
Article
Full-text available
Literature reviews (LRs) are recognized for their increasing impact in the information systems literature. Methodologists have drawn attention to the question of how we can leverage the value of LRs to preserve and generate knowledge. The panelists who participated in the discussion of “Standalone Literature Reviews in IS Research: What Can Be Lear...
Conference Paper
Full-text available
Assessing the similarity of proposed theoretical constructs to each other and those previously known and studied is imperative in theoretical research. In this paper we turn to theories of similarity judgement from cognitive psychology for the understanding of the process of establishing similarity between one or more constructs. Then, guided by th...
Article
Full-text available
A central goal of behavioral medicine is the creation of evidence-based interventions for promoting behavior change. Scientific knowledge about behavior change could be more effectively accumulated using “ontologies.” In information science, an ontology is a systematic method for articulating a “controlled vocabulary” of agreed-upon terms and their...
Conference Paper
Full-text available
Poster
Full-text available
Abstract: The traditional understanding of data from Likert scales is that the quantifications involved are resulting from measures of attitude strength. Building on our recently proposed a semantic theory of survey response (STSR), we claim that survey responses tap two different sources; a mixture of attitudes plus the semantic structure of the s...
Poster
Full-text available
Abstract: The semantic theory of survey response (STSR) proposes that the prime source of statistical co-variance in survey data is the degree of semantic similarity (overlap of meaning) among the items of the survey. The present study applies STSR in an experimental way by mimicking real survey responses through the use semantic information. A sam...
Conference Paper
Full-text available
The accumulated literature base in the behavioral sciences represents a great source of knowledge on human behaviors, and yet the same literature has grown beyond human comprehension. We address this information overload problem by proposing a novel IT artifact-TheoryOn. Based on the design science paradigm, we identify five design requirements. We...
Conference Paper
Research on sensemaking in organisations and on linguistic relativity suggests that speakers of the same language may use this language in different ways to construct social realities at work. We apply a semantic theory of survey response (STSR) to explore such differences in quantitative survey research. Using text analysis algorithms, we have stu...
Article
Full-text available
People may confuse leadership with heroism due to the semantic overlap between their descriptions. This may explain some facets of fascination with leadership and obstructions to differentiated viewpoints of leadership as a group phenomenon. Building on the semantic theory of survey response (STSR), we are able to show how prevalent measures of cha...
Article
Full-text available
Is survey data a source of new information, or could surveys just be begging their questions? The authors of this opinion piece suspect that survey data in leadership research do not reflect attitudes to workplace phenomena. Instead, they may just be assessments of the similarity of the language in the applied items. In a recent article in the jour...
Article
The accumulated literature base in the behavioral sciences represents the IS discipline’s greatest source of knowledge, and yet the same literature has grown beyond human comprehension. An experiment is conducted showing the inability of experts to retrieve relevant constructs using full-text search. To address this inability to access the body of...
Article
Full-text available
Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language proc...
Technical Report
Full-text available
Through an evaluation of hospital websites, we show that 98% of the top 148 U.S. hospitals share visitor health-diagnostic meta-data with close to 200 private web tracking companies such as Google, Facebook, and ScorecardResearch, including data on current and future patients. The data sharing was pervasive, varied a great deal by hospital, and was...
Conference Paper
Full-text available
Theory identity is a fundamental problem for researchers seeking to determine theory quality, create theory ontologies and taxonomies, or perform focused theory-specific reviews and meta-analyses. We demonstrate a novel machine-learning approach to theory identification based on citation data and article features. The multi-disciplinary ecosystem o...
Conference Paper
Full-text available
Introduction to Theories in IS Research Minitrack.
Conference Paper
Full-text available
We propose an automatic construct-level citation extraction system (ACCE) to refine citations from the paper level to the construct level. This paper follows the design science paradigm (Hevner et al. 2004; March and Smith 1995; Nunamaker et al. 1991). The remaining sections are organized as follows. We first analyze the tasks involved in extractin...
Article
Full-text available
Advancement in science requires clarity of constructs.Like other fields in behavioral science, addiction research is being held back by researchers' use of different terms to mean similar things (synonymy) and the same term to mean different things (polysemy). Journals can help researchers to stay focused on novel and significant research questions...
Conference Paper
Purpose: We applied internomological network (INN) analysis, a novel approach that classifies constructs based on their underlying meaning, to constructs from the National Cancer Institute (NCI)’s Grid-enabled measures (GEM) database. Seven expert raters sorted these constructs using Michie’s Theoretical Domains Framework (TDF). Our objectives were...
Conference Paper
Full-text available
This research presents a meta-theoretic analysis of a nomological net for the purpose of identification of potential pathways for theory integration and multi-level theory development. Success in these two areas holds the potential to reduce theory clutter in IS and related social sciences. As a proof-of-concept, we identify theory domains that sha...
Conference Paper
Full-text available
As the specific parts of a theory are refined over time the core concepts and associations of a theory provide the identity of a theory’s domain. This research applies a meta-theoretical analysis to the problem of theory identity and the core-periphery distinction. The theoreticoempirical network for quantitative publications over a 30 year span of...
Article
Full-text available
When a construct such as patients' "transition to self-management" of chronic illness is studied by researchers across multiple disciplines, the meaning of key terms can become confused. This results from inherent problems in language where a term can have multiple meanings (polysemy) and different words can mean the same thing (synonymy). The aim...
Conference Paper
For the first time natural language processing approaches are applied on a large scale to psychometric methods. Psychometric methods have been applied in hundreds of thousands of published studies. This study examines automated approach to discovering behavioral knowledge that are encoded as constructs in social and behavioral science disciplines....
Article
Full-text available
As behavioral research has expanded in Information Systems and other scientific fields, researchers are recognizing that construct proliferation increases the difficulty in identifying the nomological networks of constructs pertaining to any given research question. An Inter-Nomological Network uses semantic analysis to systematically identify, cat...
Article
Full-text available
As specific parts of a theory are refined over time, the aggregated set of variables and associations of multiple theory instances provide the identity of a theory domain. This research applies a meta-theoretical analysis to the problem of theory identity and the core-periphery distinction. The theoretico-empirical network for quantitative publicat...