IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/122638.html
   My bibliography  Save this paper

Pairwise stochastic approximation for confirmatory factor analysis of categorical data

Author

Listed:
  • Alfonzetti, Giuseppe
  • Bellio, Ruggero
  • Chen, Yunxiao
  • Moustaki, Irini

Abstract

Pairwise likelihood is a limited-information method widely used to estimate latent variable models, including factor analysis of categorical data. It can often avoid evaluating high-dimensional integrals and, thus, is computationally more efficient than relying on the full likelihood. Despite its computational advantage, the pairwise likelihood approach can still be demanding for large-scale problems that involve many observed variables. We tackle this challenge by employing an approximation of the pairwise likelihood estimator, which is derived from an optimization procedure relying on stochastic gradients. The stochastic gradients are constructed by subsampling the pairwise log-likelihood contributions, for which the subsampling scheme controls the per-iteration computational complexity. The stochastic estimator is shown to be asymptotically equivalent to the pairwise likelihood one. However, finite-sample performance can be improved by compounding the sampling variability of the data with the uncertainty introduced by the subsampling scheme. We demonstrate the performance of the proposed method using simulation studies and two real data applications.

Suggested Citation

  • Alfonzetti, Giuseppe & Bellio, Ruggero & Chen, Yunxiao & Moustaki, Irini, 2025. "Pairwise stochastic approximation for confirmatory factor analysis of categorical data," LSE Research Online Documents on Economics 122638, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:122638
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/122638/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhang, Siliang & Chen, Yunxiao, 2022. "Computation for latent variable model estimation: a unified stochastic proximal framework," LSE Research Online Documents on Economics 114489, London School of Economics and Political Science, LSE Library.
    2. Katsikatsou, Myrsini & Moustaki, Irini & Md Jamil, Haziq, 2022. "Pairwise likelihood estimation for confirmatory factor analysis models with categorical variables and data that are missing at random," LSE Research Online Documents on Economics 108933, London School of Economics and Political Science, LSE Library.
    3. Katsikatsou, Myrsini & Moustaki, Irini & Yang-Wallentin, Fan & Jöreskog, Karl G., 2012. "Pairwise likelihood estimation for factor analysis models with ordinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4243-4258.
    4. Sik-Yum Lee & Wai-Yin Poon & P. Bentler, 1992. "Structural equation models with continuous and polytomous variables," Psychometrika, Springer;The Psychometric Society, vol. 57(1), pages 89-105, March.
    5. de Leon, A.R., 2005. "Pairwise likelihood approach to grouped continuous model and its extension," Statistics & Probability Letters, Elsevier, vol. 75(1), pages 49-57, November.
    6. D. R. Cox, 2004. "A note on pseudolikelihood constructed from marginal densities," Biometrika, Biometrika Trust, vol. 91(3), pages 729-737, September.
    7. Siliang Zhang & Yunxiao Chen, 2022. "Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1473-1502, December.
    8. Lewandowski, Daniel & Kurowicka, Dorota & Joe, Harry, 2009. "Generating random correlation matrices based on vines and extended onion method," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 1989-2001, October.
    9. Myrsini Katsikatsou & Irini Moustaki, 2016. "Pairwise Likelihood Ratio Tests and Model Selection Criteria for Structural Equation Models with Ordinal Variables," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 1046-1068, December.
    10. Vassilis Vasdekis & Silvia Cagnone & Irini Moustaki, 2012. "A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 425-441, July.
    11. Vasdekis, Vassilis G. S. & Rizopoulos, Dimitris & Moustaki, Irini, 2014. "Weighted pairwise likelihood estimation for a general class of random effects models," LSE Research Online Documents on Economics 56733, London School of Economics and Political Science, LSE Library.
    12. Katsikatsou, Myrsini & Moustaki, Irini & Yang-Wallentin, Fan & Jöreskog, Karl G., 2012. "Pairwise likelihood estimation for factor analysis models with ordinal data," LSE Research Online Documents on Economics 43182, London School of Economics and Political Science, LSE Library.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Papageorgiou, Ioulia & Moustaki, Irini, 2019. "Sampling of pairs in pairwise likelihood estimation for latent variable models with categorical observed variables," LSE Research Online Documents on Economics 87592, London School of Economics and Political Science, LSE Library.
    2. Myrsini Katsikatsou & Irini Moustaki, 2016. "Pairwise Likelihood Ratio Tests and Model Selection Criteria for Structural Equation Models with Ordinal Variables," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 1046-1068, December.
    3. Monia Ranalli & Roberto Rocci, 2017. "A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data," Psychometrika, Springer;The Psychometric Society, vol. 82(4), pages 1007-1034, December.
    4. K. Florios & I. Moustaki & D. Rizopoulos & V. Vasdekis, 2015. "A modified weighted pairwise likelihood estimator for a class of random effects models," METRON, Springer;Sapienza Università di Roma, vol. 73(2), pages 217-228, August.
    5. Katsikatsou, Myrsini & Moustaki, Irini & Yang-Wallentin, Fan & Jöreskog, Karl G., 2012. "Pairwise likelihood estimation for factor analysis models with ordinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4243-4258.
    6. Florian Schuberth & Jörg Henseler & Theo K. Dijkstra, 2018. "Partial least squares path modeling using ordinal categorical indicators," Quality & Quantity: International Journal of Methodology, Springer, vol. 52(1), pages 9-35, January.
    7. Battauz, Michela & Vidoni, Paolo, 2022. "A likelihood-based boosting algorithm for factor analysis models with binary data," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    8. Jerzy Grobelny & Rafal Michalski & Gerhard-Wilhelm Weber, 2021. "Modeling human thinking about similarities by neuromatrices in the perspective of fuzzy logic," WORking papers in Management Science (WORMS) WORMS/21/09, Department of Operations Research and Business Intelligence, Wroclaw University of Science and Technology.
    9. Zhang, Haoran & Chen, Yunxiao & Li, Xiaoou, 2020. "A note on exploratory item factor analysis by singular value decomposition," LSE Research Online Documents on Economics 104166, London School of Economics and Political Science, LSE Library.
    10. Haoran Zhang & Yunxiao Chen & Xiaoou Li, 2020. "A Note on Exploratory Item Factor Analysis by Singular Value Decomposition," Psychometrika, Springer;The Psychometric Society, vol. 85(2), pages 358-372, June.
    11. Nuo Xi & Michael W. Browne, 2014. "Contributions to the Underlying Bivariate Normal Method for Factor Analyzing Ordinal Data," Journal of Educational and Behavioral Statistics, , vol. 39(6), pages 583-611, December.
    12. Na Shan & Ping-Feng Xu, 2024. "Bayesian Adaptive Lasso for Detecting Item–Trait Relationship and Differential Item Functioning in Multidimensional Item Response Theory Models," Psychometrika, Springer;The Psychometric Society, vol. 89(4), pages 1337-1365, December.
    13. Katsikatsou, Myrsini & Moustaki, Irini & Md Jamil, Haziq, 2022. "Pairwise likelihood estimation for confirmatory factor analysis models with categorical variables and data that are missing at random," LSE Research Online Documents on Economics 108933, London School of Economics and Political Science, LSE Library.
    14. Ting Wang & Carolin Strobl & Achim Zeileis & Edgar C. Merkle, 2018. "Score-Based Tests of Differential Item Functioning via Pairwise Maximum Likelihood Estimation," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 132-155, March.
    15. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.
    16. Zachary F. Fisher & Kenneth A. Bollen, 2020. "An Instrumental Variable Estimator for Mixed Indicators: Analytic Derivatives and Alternative Parameterizations," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 660-683, September.
    17. Geminiani, Elena & Marra, Giampiero & Moustaki, Irini, 2021. "Single and multiple-group penalized factor analysis: a trust-region algorithm approach with integrated automatic multiple tuning parameter selection," LSE Research Online Documents on Economics 108873, London School of Economics and Political Science, LSE Library.
    18. Motonori Oka & Kensuke Okada, 2023. "Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 302-331, March.
    19. Joe, Harry & Lee, Youngjo, 2009. "On weighting of bivariate margins in pairwise likelihood," Journal of Multivariate Analysis, Elsevier, vol. 100(4), pages 670-685, April.
    20. Bhat, Chandra R. & Astroza, Sebastian & Bhat, Aarti C. & Nagel, Kai, 2016. "Incorporating a multiple discrete-continuous outcome in the generalized heterogeneous data model: Application to residential self-selection effects analysis in an activity time-use behavior model," Transportation Research Part B: Methodological, Elsevier, vol. 91(C), pages 52-76.

    More about this item

    Keywords

    item factor analysis; structural equation models; composite likelihood; asymptotic normality; stochastic gradient descent;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:122638. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.