Chia-Yi Chiu’s research while affiliated with Columbia University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (38)


To Use or Not to Use: Method Selections in Cognitive Diagnosis
  • Article

March 2025

·

9 Reads

Measurement Interdisciplinary Research and Perspectives

Hyunjee Oh

·

Chia-Yi Chiu

A Two-Step Q-Matrix Estimation Method

October 2024

·

33 Reads

·

1 Citation

Applied Psychological Measurement

·

Chia-Yi Chiu

·

Olasumbo Oluwalana

·

[...]

·

Jiaxi Wang

Cognitive Diagnosis Models in educational measurement are restricted latent class models that describe ability in a knowledge domain as a composite of latent skills an examinee may have mastered or failed. Different combinations of skills define distinct latent proficiency classes to which examinees are assigned based on test performance. Items of cognitively diagnostic assessments are characterized by skill profiles specifying which skills are required for a correct item response. The item-skill profiles of a test form its Q-matrix. The validity of cognitive diagnosis depends crucially on the correct specification of the Q-matrix. Typically, Q-matrices are determined by curricular experts. However, expert judgment is fallible. Data-driven estimation methods have been developed with the promise of greater accuracy in identifying the Q-matrix of a test. Yet, many of the extant methods encounter computational feasibility issues either in the form of excessive amounts of CPU times or inadmissible estimates. In this article, a two-step algorithm for estimating the Q-matrix is proposed that can be used with any cognitive diagnosis model. Simulations showed that the new method outperformed extant estimation algorithms and was computationally more efficient. It was also applied to Tatsuoka’s famous fraction-subtraction data. The paper concludes with a discussion of theoretical and practical implications of the findings.


Nonparametric CD-CAT for multiple-choice items: Item selection method and Q-optimality
  • Article
  • Full-text available

May 2024

·

19 Reads

British Journal of Mathematical and Statistical Psychology

Computerized adaptive testing for cognitive diagnosis (CD‐CAT) achieves remarkable estimation efficiency and accuracy by adaptively selecting and then administering items tailored to each examinee. The process of item selection stands as a pivotal component of a CD‐CAT algorithm, with various methods having been developed for binary responses. However, multiple‐choice (MC) items, an important item type that allows for the extraction of richer diagnostic information from incorrect answers, have been underemphasized. Currently, the Jensen–Shannon divergence (JSD) index introduced by Yigit et al. ( Applied Psychological Measurement , 2019, 43, 388) is the only item selection method exclusively designed for MC items. However, the JSD index requires a large sample to calibrate item parameters, which may be infeasible when there is only a small or no calibration sample. To bridge this gap, the study first proposes a nonparametric item selection method for MC items (MC‐NPS) by implementing novel discrimination power that measures an item's ability to effectively distinguish among different attribute profiles. A Q‐optimal procedure for MC items is also developed to improve the classification during the initial phase of a CD‐CAT algorithm. The effectiveness and efficiency of the two proposed algorithms were confirmed by simulation studies.

Download


Figure 1. Posterior probabilities RMSE as a function of the model (i.e., R-DINA, DINA),
Figure 2. Estimated (i.e., í µí¼) and true (i.e., PCP) classification accuracy for each of the 460800
Recovery of Posterior Probabilities
True Classification Accuracy
Estimated Classification Accuracy Bias

+3

The Restricted DINA Model: A Comprehensive Cognitive Diagnostic Model for Classroom-Level Assessments

March 2023

·

316 Reads

·

6 Citations

Journal of Educational and Behavioral Statistics

The nonparametric classification (NPC) method has been proven to be a suitable procedure for cognitive diagnostic assessments at a classroom level. However, its nonparametric nature impedes the obtention of a model likelihood, hindering the exploration of crucial psychometric aspects, such as model fit or reliability. Reporting the reliability and validity of scores is imperative in any applied context. The present study proposes the restricted deterministic input, noisy “and” gate (R-DINA) model, a parametric cognitive diagnosis model based on the NPC method that provides the same attribute profile classifications as the nonparametric method while allowing to derive a model likelihood and, subsequently, to compute fit and reliability indices. The suitability of the new proposal is examined by means of an exhaustive simulation study and a real data illustration. The results show that the R-DINA model properly recovers the posterior probabilities of attribute mastery, thus becoming a suitable alternative for comprehensive small-scale diagnostic assessments.


Proper and Useful Distractors in Multiple-Choice Diagnostic Classification Models

March 2023

·

19 Reads

·

1 Citation

The multiple-choice (MC) item format has been implemented in educational assessments that are used across diverse content domains. MC items comprise two components: the stem that provides the context with a motivating narrative, and the collection of response options consisting of the correct answer, called the “key,” and several incorrect alternatives, the “distractors.” The MC-DINA model was the first diagnostic classification model for MC items that used distractors explicitly as potential sources of diagnostic information. However, the MC-DINA model requires that the q-vectors of the distractors are nested within each other and that of the key, which poses a serious constraint on item development. Consequently, later adaptations of the MC item format to cognitive diagnosis dropped the nestedness condition. The relaxation of the nestedness-condition, however, comes at a price: distractors may become redundant (i.e., they do not contribute to any further diagnostic differentiation between examinees), and they may induce undesirable diagnostic ambiguity (i.e., they are equally likely to be chosen by an examinee, but their q-vectors point at different diagnostic classifications). In this article, two criteria, useful and proper, are proposed to identify redundant and diagnostically ambiguous distractors.KeywordsCognitive diagnosisNonparametric cognitive diagnosisPolytomous itemsMC-DINAMC-NPC


Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis

November 2022

·

28 Reads

·

6 Citations

Journal of Educational and Behavioral Statistics

The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format has also been adapted to the cognitive diagnosis (CD) framework. Early approaches simply dichotomized the responses and analyzed them with a CD model for binary responses. Obviously, this strategy cannot exploit the additional diagnostic information provided by MC items. De la Torre’s MC Deterministic Inputs, Noisy “And” Gate (MC-DINA) model was the first for the explicit analysis of items having MC response format. However, as a drawback, the attribute vectors of the distractors are restricted to be nested within the key and each other. The method presented in this article for the CD of DINA items having MC response format does not require such constraints. Another contribution of the proposed method concerns its implementation using a nonparametric classification algorithm, which predestines it for use especially in small-sample settings like classrooms, where CD is most needed for monitoring instruction and student learning. In contrast, default parametric CD estimation routines that rely on EM- or MCMC-based algorithms cannot guarantee stable and reliable estimates—despite their effectiveness and efficiency when samples are large—due to computational feasibility issues caused by insufficient sample sizes. Results of simulation studies and a real-world application are also reported.


Commentary on “Extending the Basic Local Independence Model to Polytomous Data” by Stefanutti, de Chiusole, Anselmi, and Spoto

June 2022

·

20 Reads

Psychometrika

The Polytomous Local Independence Model (PoLIM) by Stefanutti, de Chiusole, Anselmi, and Spoto, is an extension of the Basic Local Independence Model (BLIM) to accommodate polytomous items. BLIM, a model for analyzing responses to binary items, is based on Knowledge Space Theory, a framework developed by cognitive scientists and mathematical psychologists for modeling human knowledge acquisition and representation. The purpose of this commentary is to show that PoLIM is simply a paraphrase of a DINA model in cognitive diagnosis for polytomous items. Specifically, BLIM is shown to be equivalent to the DINA model when the BLIM-items are conceived as binary single-attribute items, each with a distinct attribute; thus, PoLIM is equivalent to the DINA for polytomous single-attribute items, each with a distinct attribute.


Advances in CD-CAT: The General Nonparametric Item Selection Method

August 2021

·

64 Reads

·

14 Citations

Psychometrika

Computerized adaptive testing (CAT) is characterized by its high estimation efficiency and accuracy, in contrast to the traditional paper-and-pencil format. CAT specifically for cognitive diagnosis (CD-CAT) carries the same advantages and has been seen as a tool for advancing the use of cognitive diagnosis (CD) assessment for educational practice. A powerful item selection method is the key to the success of a CD-CAT program, and to date, various parametric item selection methods have been proposed and well-researched. However, these parametric methods all require large samples, to secure high-precision calibration of the items in the item bank. Thus, at present, implementation of parametric methods in small-scale educational settings, such as classroom, remains challenging. In response to this issue, Chang, Chiu, and Tsai (Appl Psychol Meas 43:543–561, 2019) proposed the nonparametric item selection (NPS) method that does not require parameter calibration and outperforms the parametric methods for settings with only small or no calibration samples. Nevertheless, the NPS method is not without limitations; extra assumptions are required to guarantee a consistent estimator of the attribute profiles when data conform to complex models. To remedy this shortcoming, the general nonparametric item selection (GNPS) method that incorporates the newly developed general NPC (GNPC) method (Chiu et al. in Psychometrika 83:355–375, 2018) as the classification vehicle is proposed in this study. The inclusion of the GNPC method in the GNPS method relaxes the assumptions imposed on the NPS method. As a result, the GNPS method can be used with any model or multiple models without abandoning the advantage of being a small-sample technique. The legitimacy of using the GNPS method in the CD-CAT system is supported by Theorem 1 proposed in the study. The efficiency and effectiveness of the GNPS method are confirmed by the simulation study that shows the outperformance of the GNPS method over the compared parametric methods when the calibration samples are small.


A Unified Theory of the Completeness of Q-Matrices for the DINA Model

April 2021

·

41 Reads

·

5 Citations

Journal of Classification

Diagnostic classification models in educational measurement describe ability in a knowledge domain as a composite of specific binary skills called “cognitive attributes,” each of which an examinee may or may not have mastered. Attribute Hierarchy Models (AHMs) account for the possibility that attributes are dependent by imposing a hierarchical structure such that mastery of one or more attributes is a prerequisite of mastering one or more other attributes. Thus, the number of meaningfully defined attribute combinations is reduced, so that constructing a complete Q-matrix may be challenging. (The Q-matrix of a cognitively diagnostic test documents which attributes are required for solving which item; the Q-matrix is said to be complete if it guarantees the identifiability of all realizable proficiency classes among examinees.) For structured Q-matrices (i.e., the item attribute profiles are restricted to reflect the hierarchy postulated to underlie the attributes), the conditions of completeness have been established. However, sometimes, a structured Q-matrix cannot be assembled because the items of the test in question have attribute profiles that do not conform to the prerequisite structure imposed by the postulated attribute hierarchy. A Q-matrix composed of such items is called “unstructured.” In this article, the completeness conditions of unstructured Q-matrices for the DINA model are presented. Specifically, there exists an entire range of Q-matrices that are all complete for DINA-AHMs. Thus, the theoretical results presented here can be combined with extant insights about Q-completeness for models without attribute hierarchies into a unified framework on the completeness of Q-matrices for the DINA model.


Citations (27)


... Additionally, it is important to note that in practical situations, the Q-matrix may not always be readily available. Various estimation techniques have been proposed in the literature (Liu et al., 2012;Chen et al., 2015Chen et al., , 2018Xu and Shang, 2018;Li et al., 2022;Gu and Xu, 2023;Ma et al., 2023b;Köhn et al., 2024). This leads to a potential future direction of developing theories and computational methods for CDMs estimation with an unknown Q-matrix within the nonparametric framework. ...

Reference:

Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis
A Two-Step Q-Matrix Estimation Method
  • Citing Article
  • October 2024

Applied Psychological Measurement

... Cognitive diagnostic assessment (CDA) is a cutting-edge technique in educational testing that leverages specialized statistical models, known as cognitive diagnosis models (CDMs), to uncover the specific cognitive attributes that influence how students respond to assessment items (Shi et al., 2024). These models are used to provide detailed insights into the specific attributes or skills that examinees possess or lack, offering a more granular understanding of their cognitive processes compared to traditional summative assessments (Nájera et al., 2023). The significance of CDAs lies in their ability to provide valuable diagnostic feedback that can guide educational interventions, tailoring instruction to meet the specific needs of students (Maas et al., 2024). ...

The Restricted DINA Model: A Comprehensive Cognitive Diagnostic Model for Classroom-Level Assessments

Journal of Educational and Behavioral Statistics

... Second, undesirable diagnostic ambiguity can arise from distractors that are equally likely to be chosen by an examinee, but have distinct attribute profiles pointing at different diagnostic classifications. In response to these issues, Köhn et al. (2023) proposed two criteria, 'useful' and 'proper', for detecting such problematic cases. An item is called 'useful' if and only if the q-vector of its key is not nested within any of the q-vectors of the distractors. ...

Proper and Useful Distractors in Multiple-Choice Diagnostic Classification Models
  • Citing Chapter
  • March 2023

... Not only does its performance decline, but it may also encounter convergence issues, thereby limiting its applicability in micro-educational settings. In this regard, the nonparametric classification method for MC items (MC-NPC; Wang et al., 2023) was developed as a nonparametric counterpart that estimates examinees' attribute profiles without the need for model fitting and, thus, is more suitable for small samples. ...

Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis
  • Citing Article
  • November 2022

Journal of Educational and Behavioral Statistics

... Two popular examples of nonparametric cognitive diagnosis methods include the nonparametric classification (NPC) method (Chiu and Douglas, 2013) and its generalization, the general NPC (GNPC) method (Chiu et al., 2018). The GNPC method, in particular, has received increasing attention in recent years due to its effectiveness in handling complex CDMs and its good 2 performance for sample sizes (Chiu and Chang, 2021;Wang et al., 2023;Chandía et al., 2023;Ma et al., 2023a). The algorithms of the NPC and GNPC methods are straightforward to implement and require minimal computational resources, making them highly appealing for practical applications. ...

Advances in CD-CAT: The General Nonparametric Item Selection Method
  • Citing Article
  • August 2021

Psychometrika

... From this perspective, the estimation often takes place through the maximization of the marginal likelihood, relying on methods such as the expectation-maximization algorithm (de la Torre, 2011;von Davier, 2008;DiBello et al., 2007). However, the maximum likelihood-based approach often necessitates sufficiently large assessments to guarantee the reliability of the item parameter estimation, and it may either produce inaccurate estimates with small sample sizes or suffer from high computational costs (Chiu and Köhn, 2019a;Chiu et al., 2018). Moreover, the parametric CDMs involve certain parametric assumptions about the item response functions, which may raise concerns about the validity of the assumed model and the underlying process (Chiu and Douglas, 2013). ...

Consistency Theory for the General Nonparametric Classification Method
  • Citing Article
  • February 2019

Psychometrika

... Lastly, applying information theory-based selection criteria to Cognitive Diagnosis CAT (CD-CAT) systems and comparing these criteria with other measurement indices are deemed significant steps for achieving higher precision in cognitive diagnosis processes. Implementing such innovative methods more effectively in adaptive testing could contribute to improving measurement accuracy, efficiency, and reliability in this field (Anselmi et al., 2023;Chang et al., 2019;Chao & Chen, 2023;Chen & Liu, 2023;Davis et al., 2023;Davison et al., 2023;Garcia & Thomas, 2023;Gönülateş, 2019;Gu et al., 2019;He et al., 2020;Hsu et al., 2019;Jones & Brown, 2023;Kaplan et al., 2020;Kárász et al., 2023;Lim & Choe, 2023;Lin et al., 2019;Luo et al., 2019;Smith et al., 2023;Taylor, 2023;White & Black, 2023;Williams et al., 2023;Wyse, 2023;Yuan et al., 2023;Yiğit et al., 2019). ...

Nonparametric CAT for CD in Educational Settings With Small Samples
  • Citing Article
  • December 2018

Applied Psychological Measurement

... All these previous works did not directly take into account the hierarchical graphical structure of the attribute hierarchy, therefore their results can not provide explicit and sharp identifiability conditions for an HLAM. On the other hand, in the cognitive diagnostic modeling literature, researchers (Köhn and Chiu, 2019;Cai et al., 2018) recently studied the "completeness" of the Q-matrix, a relevant concept to be revisited in Section 3, under attribute hierarchy. But these results can not ensure identifying uniquely the model parameters that determine the probabilistic HLAM. ...

Attribute Hierarchy Models in Cognitive Diagnosis: Identifiability of the Latent Attribute Space and Conditions for Completeness of the Q-Matrix
  • Citing Article
  • November 2018

Journal of Classification

... Regarding the identifiability of the newly proposed RANK-G-DINA model, it is necessary to design Q-matrix based on the identifiability condition of generalized DCMs, because this model is constructed based on these DCMs. Based on previous research, the Q-matrix for generalized DCM models is complete when two specific conditions are met: (1) the Q-matrix contains all K single-attribute statements, and (2) the rank of the Q-matrix is less than the number of attributions, and all non-nested attribute pattern pairs and * correspond to distinct effect values (for detailed examples and further information, see Köhn & Chiu, 2018). In this paper, it is necessary to limit the number of attributes measured per statement to a maximum of two while maintaining the completeness of the Q-matrix. ...

How to Build a Complete Q-Matrix for a Cognitively Diagnostic Test
  • Citing Article
  • July 2018

Journal of Classification

... Templin and Bradshaw (2014) interpreted the class probabilities and concluded there was evidence of a linear attribute hierarchy. The results in Table 3 may instead provide evidence of a more complex attribute hierarchy (e.g., see Köhn & Chiu, 2018). That is, using 5% as a threshold to infer which classes are nonzero implies that Attribute 2 requires Attribute 1 is mastered, Attributes 4 and 5 require Attribute 2, and Attribute 3 requires Attributes 4 and 5 are mastered. ...

Identifiability of the Latent Attribute Space and Conditions of Q-Matrix Completeness for Attribute Hierarchy Models
  • Citing Chapter
  • April 2018