Zhiliang Ying’s research while affiliated with Columbia University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (152)


Jointly modeling responses and omitted items by a competing risk model: A survival analysis approach
  • Article
  • Full-text available

January 2025

·

20 Reads

British Journal of Mathematical and Statistical Psychology

·

·

·

[...]

·

Item response theory models are commonly adopted in educational assessment and psychological measurement. Such models need to be modified to accommodate practical situations when statistical sampling assumptions are violated. Omission is a common phenomenon in educational testing. In modern computer-based testing, we have not only examinees' responses but also their response times. This paper utilizes response time and develops a joint model of responses and response times. The new approach is analogous to those developed in survival analysis for dealing with right-censored data. In particular, a key ingredient is the introduction of the omission time (OT), which corresponds to the censoring time in survival analysis. By competing risk formulation, the proposed method provides an alternative narrative to how an item becomes answered versus omitted, depending on the competing relationship of response time and OT, so that the likelihood function can be constructed properly. The maximum likelihood estimator can be computed via the expectation-maximization algorithm. Simulation studies were conducted to evaluate the performance of the proposed method and its robustness against various mis-specifications. The method was applied to a dataset from the PISA 2015 Science Test.

Download

Q-matrix in Example 1.
Results on goodness of fit for PISA data.
The average BIAS and RMSE of item parameters under the proposed testlet DINA model.
BIAS and RMSE of π under Case 2.
Diagnostic Classification Models for Testlets: Methods and Theory

March 2024

·

75 Reads

Psychometrika

Diagnostic classification models (DCMs) have seen wide applications in educational and psychological measurement, especially in formative assessment. DCMs in the presence of testlets have been studied in recent literature. A key ingredient in the statistical modeling and analysis of testlet-based DCMs is the superposition of two latent structures, the attribute profile and the testlet effect. This paper extends the standard testlet DINA (T-DINA) model to accommodate the potential correlation between the two latent structures. Model identifiability is studied and a set of sufficient conditions are proposed. As a byproduct, the identifiability of the standard T-DINA is also established. The proposed model is applied to a dataset from the 2015 Programme for International Student Assessment. Comparisons are made with DINA and T-DINA, showing that there is substantial improvement in terms of the goodness of fit. Simulations are conducted to assess the performance of the new method under various settings.


External Correlates of Adult Digital Problem-Solving Process

January 2024

·

65 Reads

·

3 Citations

Zeitschrift für Psychologie

Computerized assessments and interactive simulation tasks are increasingly popular and afford the collection of process data, i.e., an examinee’s sequence of actions (e.g., clickstreams, keystrokes) that arises from interactions with each task. Action sequence data contain rich information on the problem-solving process but are in a nonstandard, variable-length discrete sequence format. Two methods that directly extract features from the raw action sequences, namely multidimensional scaling and sequence-to-sequence autoencoders, produce multidimensional numerical features that summarize original sequence information. This study explores the utility of action sequence features in understanding how problem-solving behavior relates to cognitive proficiencies and demographic characteristics. This is empirically illustrated with the process data from the 2012 PIAAC PSTRE digital assessment. Regularized regression results showed that action sequence features are more predictive of examinees’ demographic and cognitive characteristics compared to final outcomes. Partial least squares analysis further aided the identification of behavioral patterns systematically associated with demographic/cognitive characteristics.


Semiparametric Modeling and Analysis for Longitudinal Network Data

August 2023

·

46 Reads

We introduce a semiparametric latent space model for analyzing longitudinal network data. The model consists of a static latent space component and a time-varying node-specific baseline component. We develop a semiparametric efficient score equation for the latent space parameter by adjusting for the baseline nuisance component. Estimation is accomplished through a one-step update estimator and an appropriately penalized maximum likelihood estimator. We derive oracle error bounds for the two estimators and address identifiability concerns from a quotient manifold perspective. Our approach is demonstrated using the New York Citi Bike Dataset.


Statistical Applications to Cognitive Diagnostic Testing

November 2022

·

54 Reads

·

4 Citations

Annual Review of Statistics and Its Application

Diagnostic classification tests are designed to assess examinees’ discrete mastery status on a set of skills or attributes. Such tests have gained increasing attention in educational and psychological measurement. We review diagnostic classification models and their applications to testing and learning, discuss their statistical and machine learning connections and related challenges, and introduce some contemporary and future extensions. Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 10 is March 2023. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Subtask analysis of process data through a predictive model

November 2022

·

36 Reads

·

10 Citations

British Journal of Mathematical and Statistical Psychology

Response process data collected from human-computer interactive items contain detailed information about respondents' behavioural patterns and cognitive processes. Such data are valuable sources for analysing respondents' problem-solving strategies. However, the irregular data format and the complex structure make standard statistical tools difficult to apply. This article develops a computationally efficient method for exploratory analysis of such process data. The new approach segments a lengthy individual process into a sequence of short subprocesses to achieve complexity reduction, easy clustering and meaningful interpretation. Each subprocess is considered a subtask. The segmentation is based on sequential action predictability using a parsimonious predictive model combined with the Shannon entropy. Simulation studies are conducted to assess the performance of the new method. We use a case study of PIAAC 2012 to demonstrate how exploratory analysis for process data can be carried out with the new approach.


A Step-Wise Multiple Testing for Linear Regression Models with Application to the Study of Resting Energy Expenditure

September 2022

·

30 Reads

Statistics in Biosciences

Motivated by the mechanistic model of the resting energy expenditure, we present a new multiple hypothesis testing approach to evaluate organ/tissue-specific resting metabolic rates. The approach is based on generalized marginal regression estimates for a subset of coefficients along with a stepwise multiple testing procedure with a minimization–maximization of the normalized estimates (maximization over all its components and minimization over all possible choices of the subset). The approach offers a valid way to address challenges in multiple hypothesis testing on regression coefficients in linear regression analysis especially when covariates are highly correlated. Importantly, the approach yields estimates that are conditionally unbiased. In addition, the approach controls a family-wise error rate in the strong sense. The approach was used to analyze a real study on resting energy expenditure in 131 healthy adults, which yielded an interesting and surprising result of age-related decrease in resting metabolic rate of kidneys. Simulation studies were also presented with various strengths of multi-collinearity induced by pre-specified correlation in covariates.


Accurate Assessment via Process Data

August 2022

·

90 Reads

·

19 Citations

Psychometrika

Accurate assessment of a student's ability is the key task of a test. Assessments based on final responses are the standard. As the infrastructure advances, substantially more information is observed. One of such instances is the process data that is collected by computer-based interactive items and contain a student's detailed interactive processes. In this paper, we show both theoretically and with simulated and empirical data that appropriately including such information in the assessment will substantially improve relevant assessment precision.


Modeling Not-Reached Items in Timed Tests: A Response Time Censoring Approach

October 2021

·

276 Reads

·

6 Citations

Psychometrika

Time limits are imposed on many computer-based assessments, and it is common to observe examinees who run out of time, resulting in missingness due to not-reached items. The present study proposes an approach to account for the missing mechanisms of not-reached items via response time censoring. The censoring mechanism is directly incorporated into the observed likelihood of item responses and response times. A marginal maximum likelihood estimator is proposed, and its asymptotic properties are established. The proposed method was evaluated and compared to several alternative approaches that ignore the censoring through simulation studies. An empirical study based on the PISA 2018 Science Test was further conducted.


Item Response Theory -- A Statistical Framework for Educational and Psychological Measurement

August 2021

·

170 Reads

·

2 Citations

Item response theory (IRT) has become one of the most popular statistical models for psychometrics, a field of study concerned with the theory and techniques of psychological measurement. The IRT models are latent factor models tailored to the analysis, interpretation, and prediction of individuals' behaviors in answering a set of measurement items that typically involve categorical response data. Many important questions of measurement are directly or indirectly answered through the use of IRT models, including scoring individuals' test performances, validating a test scale, linking two tests, among others. This paper provides a review of item response theory, including its statistical framework and psychometric applications. We establish connections between item response theory and related topics in statistics, including empirical Bayes, nonparametric methods, matrix completion, regularized estimation, and sequential analysis. Possible future directions of IRT are discussed from the perspective of statistical learning.


Citations (70)


... Analysing behavioural data while solving problems is a crucial research topic in educational computer-based assessments. Process data have been analysed to explore student problem-solving competencies, strategies, patterns, and related background variables (Zhang et al., 2024). Recent research in problem solving has focused on identifying the links between process data and problem-solving practices and inferring latent constructs based on process data (Goldhammer et al., 2021;Greiff et al., 2016). ...

Reference:

Uncovering adults' problem‐solving patterns from process data with hidden Markov model and network analysis
External Correlates of Adult Digital Problem-Solving Process

Zeitschrift für Psychologie

... This personalized approach ultimately leads to more effective language learning and teaching outcomes. The process of CDA involves defining cognitive attributes, constructing items, creating a Q-matrix (which links test items to the attributes they measure), and employing cognitive diagnostic models (CDMs) to analyze data (Zhang et al., 2023). This approach allows for a detailed examination of language skills, which proves particularly valuable in large-scale language assessments where precise diagnostic feedback supports effective language teaching and learning initiatives. ...

Statistical Applications to Cognitive Diagnostic Testing
  • Citing Article
  • November 2022

Annual Review of Statistics and Its Application

... Based on the emission matrix, Figure 5 shows For the selected item, U01b, respondents need to finish the party invitation task. As found by Wang et al. (2023), viewing email, dragging the email and creating a new folder are the key actions for this item. Figure 6 shows that MAIL_DROP, MAIL_MOVED, and KEY-PRESS are the main actions in correct and incorrect groups; one of the main differences was that respondents in the incorrect group have a higher probability of MAIL_DROP and MAIL_MOVED produced by the hidden response states compared with correct groups. ...

Subtask analysis of process data through a predictive model
  • Citing Article
  • November 2022

British Journal of Mathematical and Statistical Psychology

... The primary purpose of measurement and evaluation practices is to validly and reliably demonstrate individuals' competencies related to the latent trait targeted for measurement (Zhang et al. 2023). To this end, drawing conclusions is commonly based on the scores obtained from the items. ...

Accurate Assessment via Process Data
  • Citing Article
  • August 2022

Psychometrika

... Similarly, the lognormal models for RTs and OTs can be substituted by more sophisticated distributions for times, e.g., mixture distributions that consider rapid responses or rapid omissions (e.g., Lu et al., 2023;Ulitzsch et al., 2024). The joint model can also be extended to cases where responses and RTs are conditionally dependent(e.g., Bolsinova et al., 2017;Guo et al., 2022;Wang & Hanson, 2005). Second, missingness in LSAs can be due to omissions as well as early quitting and NRIs (Guo et al., 2022;Lu & Wang, 2020;Ulitzsch et al., 2020b). ...

Modeling Not-Reached Items in Timed Tests: A Response Time Censoring Approach

Psychometrika

... Principal component analysis (PCA) is a widely used multivariate analysis approach, originally proposed about 100 years ago [1,2], that has found increasing applications since the widespread availability of digital computers to reduce the dimensionality of highdimensional datasets. This reduction is enabled by linear transformation from the original variables to new collective variables, so that a small number of "principal components" dominate the features of the dataset. ...

Item Response Theory -- A Statistical Framework for Educational and Psychological Measurement
  • Citing Preprint
  • August 2021

... Computer-based assessments, as highlighted by international assessments such as the Programme for International Student Assessment (PISA) and Trends in International Mathematics and Science Study (TIMSS) [1,4,5], enable the recording of behavior sequences and response processes. These sequences are rich in implicit information, including strategies, thinking processes, and metacognition [6], and marked by complexity, high noise, and irregularity [7,8], making them ill-suited for conventional analysis techniques. Converting unstructured behavior sequences into structured numerical data through feature extraction is critical for the accuracy and reliability of model analysis. ...

ProcData: An R Package for Process Data Analysis
  • Citing Article
  • August 2021

Psychometrika

... Multivariate categorical data are routinely collected in various social and behavioral sciences, such as psychological tests (Chen et al., 2019), educational assessments (Shang et al., 2021), and political surveys (Chen et al., 2021b). In these applications, it is often of great interest to use latent variables to model the unobserved constructs such as personalities, abilities, political ideologies, etc. Popular latent variable models for multivariate categorical data include the item response theory models (IRT; Embretson and Reise, 2013) and latent class models (LCM; Hagenaars and McCutcheon, 2002), which employ continuous and discrete latent variables, respectively. ...

Unfolding-model-based visualization: theory, method and applications

... However, some researchers did find relatively stable bifactor solutions across time (Greene & Eaton, 2017). Fang et al. (2021) proved that the standard bifactor model with orthogonal factors is identified if it contains at least three group factors with at least three indicators per factor. However, models with just two group factors, with fewer indicators per factor, or with correlated group factors are common in practice. ...

Identifiability of Bifactor Models

Statistica Sinica

... In this paper, we focus on analyzing student behavior in the context of school conflicts, using the data from Paluck et al. [2016]. Despite the development of numerous statistical models to analyze network-linked data in recent years [Zhang et al., 2016, Li et al., 2019, Su et al., 2019, Zhang et al., 2020, Sit and Ying, 2021, Mao et al., 2021, Mukherjee et al., 2021, Le and Li, 2022, Hayes et al., 2022, Fang et al., 2023, He et al., 2023, Lunde et al., 2023, Zhu et al., 2017, Wu and Leng, 2023, Armillotta and Fokianos, 2023, Chang and Paul, 2024, the noisy nature of the network structures in this study necessitates non-trivial generalizations of the existing literature to effectively analyze the school conflict data. This challenge motivates the development of our new model. ...

Event history analysis of dynamic networks
  • Citing Article
  • September 2020

Biometrika