IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/101122.html
   My bibliography  Save this paper

Structured latent factor analysis for large-scale data: identifiability, estimability, and their implications

Author

Listed:
  • Chen, Yunxiao
  • Li, Xiaoou
  • Zhang, Siliang

Abstract

Latent factor models are widely used to measure unobserved latent traits in so- cial and behavioral sciences, including psychology, education, and marketing. When used in a conrmatory manner, design information is incorporated as zero constraints on corresponding parameters, yielding structured (conrmatory) latent factor models. In this paper, we study how such design information aects the identiability and the estimation of a structured latent factor model. Insights are gained through both asymptotic and non-asymptotic analyses. Our asymptotic results are established under a regime where both the number of manifest variables and the sample size diverge, mo- tivated by applications to large-scale data. Under this regime, we dene the structural identiability of the latent factors and establish necessary and sucient conditions that ensure structural identiability. In addition, we propose an estimator which is shown to be consistent and rate optimal when structural identiability holds. Finally, a non-asymptotic error bound is derived for this estimator, through which the eect of design information is further quantied. Our results shed lights on the design of 1 large-scale measurement in education and psychology and have important implications on measurement validity and reliability.

Suggested Citation

  • Chen, Yunxiao & Li, Xiaoou & Zhang, Siliang, 2019. "Structured latent factor analysis for large-scale data: identifiability, estimability, and their implications," LSE Research Online Documents on Economics 101122, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:101122
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/101122/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Grayson & Herbert Marsh, 1994. "Identification with deficient rank loading matrices in confirmatory factor analysis: Multitrait-multimethod models," Psychometrika, Springer;The Psychometric Society, vol. 59(1), pages 121-134, March.
    2. Xuan Bi & Annie Qu & Junhui Wang & Xiaotong Shen, 2017. "A Group-Specific Recommender System," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1344-1353, July.
    3. Yunzhang Zhu & Xiaotong Shen & Changqing Ye, 2016. "Personalized Prediction and Sparsity Pursuit in Latent Factor Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 241-252, March.
    4. Jay Bartroff & Matthew Finkelman & Tze Lai, 2008. "Modern Sequential Analysis and Its Applications to Computerized Adaptive Testing," Psychometrika, Springer;The Psychometric Society, vol. 73(3), pages 473-486, September.
    5. Ghosh, Malay, 1995. "Inconsistent maximum likelihood estimators for the Rasch model," Statistics & Probability Letters, Elsevier, vol. 23(2), pages 165-170, May.
    6. Paul Holland, 1990. "On the sampling theory roundations of item response theory models," Psychometrika, Springer;The Psychometric Society, vol. 55(4), pages 577-601, December.
    7. Yunxiao Chen & Xiaoou Li & Siliang Zhang, 2019. "Joint Maximum Likelihood Estimation for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 124-146, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhenghao Zeng & Yuqi Gu & Gongjun Xu, 2023. "A Tensor-EM Method for Large-Scale Latent Class Analysis with Binary Responses," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 580-612, June.
    2. Yunxiao Chen, 2020. "A Continuous-Time Dynamic Choice Measurement Model for Problem-Solving Process Data," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 1052-1075, December.
    3. Jinsong Chen, 2020. "A Partially Confirmatory Approach to the Multidimensional Item Response Theory with the Bayesian Lasso," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 738-774, September.
    4. Yoav Bergner & Peter Halpin & Jill-Jênn Vie, 2022. "Multidimensional Item Response Theory in the Style of Collaborative Filtering," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 266-288, March.
    5. Yang Liu, 2020. "A Riemannian Optimization Algorithm for Joint Maximum Likelihood Estimation of High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 439-468, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Yunxiao & Li, Xiaoou, 2022. "Determining the number of factors in high-dimensional generalized latent factor models," LSE Research Online Documents on Economics 111574, London School of Economics and Political Science, LSE Library.
    2. Yoav Bergner & Peter Halpin & Jill-Jênn Vie, 2022. "Multidimensional Item Response Theory in the Style of Collaborative Filtering," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 266-288, March.
    3. Siliang Zhang & Yunxiao Chen, 2022. "Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1473-1502, December.
    4. Zhang, Siliang & Chen, Yunxiao, 2022. "Computation for latent variable model estimation: a unified stochastic proximal framework," LSE Research Online Documents on Economics 114489, London School of Economics and Political Science, LSE Library.
    5. Victor F. Araman & René A. Caldentey, 2022. "Diffusion Approximations for a Class of Sequential Experimentation Problems," Management Science, INFORMS, vol. 68(8), pages 5958-5979, August.
    6. Robitzsch, Alexander, 2020. "About Still Nonignorable Consequences of (Partially) Ignoring Missing Item Responses in Large-scale Assessment," OSF Preprints hmy45, Center for Open Science.
    7. Stanislav Kolenikov, 2009. "Confirmatory factor analysis using confa," Stata Journal, StataCorp LP, vol. 9(3), pages 329-373, September.
    8. Dong, Ruipeng & Li, Daoji & Zheng, Zemin, 2021. "Parallel integrative learning for large-scale multi-response regression with incomplete outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    9. Xuan Bi & Gediminas Adomavicius & William Li & Annie Qu, 2022. "Improving Sales Forecasting Accuracy: A Tensor Factorization Approach with Demand Awareness," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1644-1660, May.
    10. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.
    11. Xiaojun Mao & Zhonglei Wang & Shu Yang, 2023. "Matrix completion under complex survey sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 463-492, June.
    12. Paula Fariña & Jorge González & Ernesto San Martín, 2019. "The Use of an Identifiability-Based Strategy for the Interpretation of Parameters in the 1PL-G and Rasch Models," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 511-528, June.
    13. Stefano Noventa & Luca Stefanutti & Giulio Vidotto, 2014. "An Analysis of Item Response Theory and Rasch Models Based on the Most Probable Distribution Method," Psychometrika, Springer;The Psychometric Society, vol. 79(3), pages 377-402, July.
    14. Zhang, Haoran & Chen, Yunxiao & Li, Xiaoou, 2020. "A note on exploratory item factor analysis by singular value decomposition," LSE Research Online Documents on Economics 104166, London School of Economics and Political Science, LSE Library.
    15. Christopher J. Urban & Daniel J. Bauer, 2021. "A Deep Learning Algorithm for High-Dimensional Exploratory Item Factor Analysis," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 1-29, March.
    16. Klaas Sijtsma & Julius M. Pfadt, 2021. "Rejoinder: The Future of Reliability," Psychometrika, Springer;The Psychometric Society, vol. 86(4), pages 887-892, December.
    17. Ting Lin & Grace Yao, 2009. "Evaluating Item Discrimination Power of WHOQOL-BREF from an Item Response Model Perspectives," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 91(2), pages 141-153, April.
    18. Haoran Zhang & Yunxiao Chen & Xiaoou Li, 2020. "A Note on Exploratory Item Factor Analysis by Singular Value Decomposition," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 358-372, June.
    19. Li, Xiaoou & Chen, Yunxiao & Chen, Xi & Liu, Jingchen & Ying, Zhiliang, 2021. "Optimal stopping and worker selection in crowdsourcing: an adaptive sequential probability ratio test framework," LSE Research Online Documents on Economics 100873, London School of Economics and Political Science, LSE Library.
    20. Ping Chen & Chun Wang, 2021. "Using EM Algorithm for Finite Mixtures and Reformed Supplemented EM for MIRT Calibration," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 299-326, March.

    More about this item

    Keywords

    High-dimensional latent factor model; conrmatory factor analysis; identifiability of latent factors; structured low-rank matrix; large-scale psychological measurement; DMS-1712657;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:101122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.