IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v88y2023i1d10.1007_s11336-022-09884-4.html
   My bibliography  Save this article

Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference

Author

Listed:
  • Motonori Oka

    (The University of Tokyo)

  • Kensuke Okada

    (The University of Tokyo)

Abstract

Diagnostic classification models offer statistical tools to inspect the fined-grained attribute of respondents’ strengths and weaknesses. However, the diagnosis accuracy deteriorates when misspecification occurs in the predefined item–attribute relationship, which is encoded into a Q-matrix. To prevent such misspecification, methodologists have recently developed several Bayesian Q-matrix estimation methods for greater estimation flexibility. However, these methods become infeasible in the case of large-scale assessments with a large number of attributes and items. In this study, we focused on the deterministic inputs, noisy “and” gate (DINA) model and proposed a new framework for the Q-matrix estimation to find the Q-matrix with the maximum marginal likelihood. Based on this framework, we developed a scalable estimation algorithm for the DINA Q-matrix by constructing an iteration algorithm that utilizes stochastic optimization and variational inference. The simulation and empirical studies reveal that the proposed method achieves high-speed computation, good accuracy, and robustness to potential misspecifications, such as initial value choices and hyperparameter settings. Thus, the proposed method can be a useful tool for estimating a Q-matrix in large-scale settings.

Suggested Citation

  • Motonori Oka & Kensuke Okada, 2023. "Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 302-331, March.
  • Handle: RePEc:spr:psycho:v:88:y:2023:i:1:d:10.1007_s11336-022-09884-4
    DOI: 10.1007/s11336-022-09884-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-022-09884-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-022-09884-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Minjeong Jeon & Frank Rijmen & Sophia Rabe-Hesketh, 2017. "A Variational Maximization–Maximization Algorithm for Generalized Linear Mixed Models with Crossed Random Effects," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 693-716, September.
    2. David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
    3. Zhang, Siliang & Chen, Yunxiao, 2022. "Computation for latent variable model estimation: a unified stochastic proximal framework," LSE Research Online Documents on Economics 114489, London School of Economics and Political Science, LSE Library.
    4. Kazuhiro Yamaguchi & Kensuke Okada, 2020. "Variational Bayes Inference for the DINA Model," Journal of Educational and Behavioral Statistics, , vol. 45(5), pages 569-597, October.
    5. Jonathan Templin & Laine Bradshaw, 2014. "Hierarchical Diagnostic Classification Models: A Family of Models for Estimating and Testing Attribute Hierarchies," Psychometrika, Springer;The Psychometric Society, vol. 79(2), pages 317-339, April.
    6. Frank Rijmen & Minjeong Jeon, 2013. "Erratum to: Fitting an item response theory model with random item effects across groups by a variational approximation," Annals of Operations Research, Springer, vol. 206(1), pages 663-663, July.
    7. Steven Andrew Culpepper, 2019. "Estimating the Cognitive Diagnosis $$\varvec{Q}$$ Q Matrix with Expert Knowledge: Application to the Fraction-Subtraction Dataset," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 333-357, June.
    8. Chia-Yi Chiu & Jeffrey Douglas & Xiaodong Li, 2009. "Cluster Analysis for Cognitive Diagnosis: Theory and Applications," Psychometrika, Springer;The Psychometric Society, vol. 74(4), pages 633-665, December.
    9. Yunxiao Chen & Jingchen Liu & Gongjun Xu & Zhiliang Ying, 2015. "Statistical Analysis of Q -Matrix Based Diagnostic Classification Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 850-866, June.
    10. Gongjun Xu & Zhuoran Shang, 2018. "Identifying Latent Structures in Restricted Latent Class Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1284-1295, July.
    11. Jimmy de la Torre, 2011. "The Generalized DINA Model Framework," Psychometrika, Springer;The Psychometric Society, vol. 76(2), pages 179-199, April.
    12. K. Humphreys & D. Titterington, 2003. "Variational approximations for categorical causal modeling with latent variables," Psychometrika, Springer;The Psychometric Society, vol. 68(3), pages 391-412, September.
    13. Jimmy Torre & Jeffrey Douglas, 2004. "Higher-order latent trait models for cognitive diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 333-353, September.
    14. Li Cai, 2010. "Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis," Journal of Educational and Behavioral Statistics, , vol. 35(3), pages 307-335, June.
    15. Chen, Yunxiao & Liu, Jingchen & Xu, Gongjun & Ying, Zhiliang, 2015. "Statistical analysis of Q-matrix based diagnostic classification models," LSE Research Online Documents on Economics 103183, London School of Economics and Political Science, LSE Library.
    16. Li Cai, 2010. "High-dimensional Exploratory Item Factor Analysis by A Metropolis–Hastings Robbins–Monro Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 75(1), pages 33-57, March.
    17. Chen-Wei Liu & Björn Andersson & Anders Skrondal, 2020. "A Constrained Metropolis–Hastings Robbins–Monro Algorithm for Q Matrix Estimation in DINA Models," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 322-357, June.
    18. Curtis Tatsuoka, 2002. "Data analytic methods for latent partially ordered classification models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 51(3), pages 337-350, July.
    19. Yinyin Chen & Steven Culpepper & Feng Liang, 2020. "A Sparse Latent Class Model for Cognitive Diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 85(1), pages 121-153, March.
    20. Guanhua Fang & Jingchen Liu & Zhiliang Ying, 2019. "On the Identifiability of Diagnostic Classification Models," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 19-40, March.
    21. Frank Rijmen & Minjeong Jeon, 2013. "Fitting an item response theory model with random item effects across groups by a variational approximation method," Annals of Operations Research, Springer, vol. 206(1), pages 647-662, July.
    22. Yuqi Gu & Gongjun Xu, 2019. "The Sufficient and Necessary Condition for the Identifiability and Estimability of the DINA Model," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 468-483, June.
    23. Steven Andrew Culpepper, 2019. "An Exploratory Diagnostic Model for Ordinal Responses with Binary Attributes: Identifiability and Estimation," Psychometrika, Springer;The Psychometric Society, vol. 84(4), pages 921-940, December.
    24. Jimmy Torre, 2011. "Erratum to: The Generalized DINA Model Framework," Psychometrika, Springer;The Psychometric Society, vol. 76(3), pages 510-510, July.
    25. Siliang Zhang & Yunxiao Chen, 2022. "Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1473-1502, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yinghan Chen & Ying Liu & Steven Andrew Culpepper & Yuguo Chen, 2021. "Inferring the Number of Attributes for the Exploratory DINA Model," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 30-64, March.
    2. Chen-Wei Liu & Björn Andersson & Anders Skrondal, 2020. "A Constrained Metropolis–Hastings Robbins–Monro Algorithm for Q Matrix Estimation in DINA Models," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 322-357, June.
    3. Steven Andrew Culpepper, 2023. "A Note on Weaker Conditions for Identifying Restricted Latent Class Models for Binary Responses," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 158-174, March.
    4. James Joseph Balamuta & Steven Andrew Culpepper, 2022. "Exploratory Restricted Latent Class Models with Monotonicity Requirements under PÒLYA–GAMMA Data Augmentation," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 903-945, September.
    5. Yuqi Gu, 2023. "Generic Identifiability of the DINA Model and Blessing of Latent Dependence," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 117-131, March.
    6. Kazuhiro Yamaguchi & Kensuke Okada, 2020. "Variational Bayes Inference for the DINA Model," Journal of Educational and Behavioral Statistics, , vol. 45(5), pages 569-597, October.
    7. Hans Friedrich Köhn & Chia-Yi Chiu, 2021. "A Unified Theory of the Completeness of Q-Matrices for the DINA Model," Journal of Classification, Springer;The Classification Society, vol. 38(3), pages 500-518, October.
    8. Jing Ouyang & Gongjun Xu, 2022. "Identifiability of Latent Class Models with Covariates," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1343-1360, December.
    9. Chun Wang & Jing Lu, 2021. "Learning Attribute Hierarchies From Data: Two Exploratory Approaches," Journal of Educational and Behavioral Statistics, , vol. 46(1), pages 58-84, February.
    10. Kazuhiro Yamaguchi & Jonathan Templin, 2022. "Direct Estimation of Diagnostic Classification Model Attribute Mastery Profiles via a Collapsed Gibbs Sampling Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1390-1421, December.
    11. Yinyin Chen & Steven Culpepper & Feng Liang, 2020. "A Sparse Latent Class Model for Cognitive Diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 85(1), pages 121-153, March.
    12. Kazuhiro Yamaguchi & Jonathan Templin, 2022. "A Gibbs Sampling Algorithm with Monotonicity Constraints for Diagnostic Classification Models," Journal of Classification, Springer;The Classification Society, vol. 39(1), pages 24-54, March.
    13. Chenchen Ma & Jing Ouyang & Gongjun Xu, 2023. "Learning Latent and Hierarchical Structures in Cognitive Diagnosis Models," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 175-207, March.
    14. Chenchen Ma & Jimmy Torre & Gongjun Xu, 2023. "Bridging Parametric and Nonparametric Methods in Cognitive Diagnosis," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 51-75, March.
    15. Kazuhiro Yamaguchi, 2023. "Bayesian Analysis Methods for Two-Level Diagnosis Classification Models," Journal of Educational and Behavioral Statistics, , vol. 48(6), pages 773-809, December.
    16. Yinghan Chen & Shiyu Wang, 2023. "Bayesian Estimation of Attribute Hierarchy for Cognitive Diagnosis Models," Journal of Educational and Behavioral Statistics, , vol. 48(6), pages 810-841, December.
    17. Jimmy de la Torre & Xue-Lan Qiu & Kevin Carl Santos, 2022. "An Empirical Q-Matrix Validation Method for the Polytomous G-DINA Model," Psychometrika, Springer;The Psychometric Society, vol. 87(2), pages 693-724, June.
    18. Steven Andrew Culpepper, 2019. "An Exploratory Diagnostic Model for Ordinal Responses with Binary Attributes: Identifiability and Estimation," Psychometrika, Springer;The Psychometric Society, vol. 84(4), pages 921-940, December.
    19. Meng-Ta Chung & Shui-Lien Chen, 2021. "A Deterministic Learning Algorithm Estimating the Q-Matrix for Cognitive Diagnosis Models," Mathematics, MDPI, vol. 9(23), pages 1-11, November.
    20. Peida Zhan & Wen-Chung Wang & Xiaomin Li, 2020. "A Partial Mastery, Higher-Order Latent Structural Model for Polytomous Attributes in Cognitive Diagnostic Assessments," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 328-351, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:88:y:2023:i:1:d:10.1007_s11336-022-09884-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.