IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/119994.html
   My bibliography  Save this paper

A two-step estimator for multilevel latent class analysis with covariates

Author

Listed:
  • Di Mari, Roberto
  • Bakk, Zsuzsa
  • Oser, Jennifer
  • Kuha, Jouni

Abstract

We propose a two-step estimator for multilevel latent class analysis (LCA) with covariates. The measurement model for observed items is estimated in its first step, and in the second step covariates are added in the model, keeping the measurement model parameters fixed. We discuss model identification, and derive an Expectation Maximization algorithm for efficient implementation of the estimator. By means of an extensive simulation study we show that (1) this approach performs similarly to existing stepwise estimators for multilevel LCA but with much reduced computing time, and (2) it yields approximately unbiased parameter estimates with a negligible loss of efficiency compared to the one-step estimator. The proposal is illustrated with a cross-national analysis of predictors of citizenship norms.

Suggested Citation

  • Di Mari, Roberto & Bakk, Zsuzsa & Oser, Jennifer & Kuha, Jouni, 2023. "A two-step estimator for multilevel latent class analysis with covariates," LSE Research Online Documents on Economics 119994, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:119994
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/119994/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Catania, Leopoldo & Di Mari, Roberto, 2021. "Hierarchical Markov-switching models for multivariate integer-valued time-series," Journal of Econometrics, Elsevier, vol. 221(1), pages 118-137.
    2. Bakk, Zsuzsa & Kuha, Jouni, 2018. "Two-step estimation of models between latent classes and external variables," LSE Research Online Documents on Economics 85161, London School of Economics and Political Science, LSE Library.
    3. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models," Cambridge Books, Cambridge University Press, number 9780521471626.
    4. Di Zio, Marco & Guarnera, Ugo & Rocci, Roberto, 2007. "A mixture of mixture models for a classification problem: The unity measure error," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2573-2585, February.
    5. D. Oakes, 1999. "Direct calculation of the information matrix via the EM," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(2), pages 479-482, April.
    6. Vermunt, Jeroen K., 2010. "Latent Class Modeling with Covariates: Two Improved Three-Step Approaches," Political Analysis, Cambridge University Press, vol. 18(4), pages 450-469.
    7. Davide Morselli & Stephanie Glaeser, 2018. "Economic conditions and social trust climates in Europe over ten years: An ecological analysis of change," Journal of Trust Research, Taylor & Francis Journals, vol. 8(1), pages 68-86, January.
    8. Jing Ouyang & Gongjun Xu, 2022. "Identifiability of Latent Class Models with Covariates," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1343-1360, December.
    9. Guan-Hua Huang & Karen Bandeen-Roche, 2004. "Building an identifiable latent class model with covariate effects on underlying and measured variables," Psychometrika, Springer;The Psychometric Society, vol. 69(1), pages 5-32, March.
    10. Leopoldo Catania & Roberto Di Mari & Paolo Santucci de Magistris, 2022. "Dynamic Discrete Mixtures for High-Frequency Prices," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(2), pages 559-577, April.
    11. Skrondal, Anders & Kuha, Jouni, 2012. "Improved regression calibration," LSE Research Online Documents on Economics 44135, London School of Economics and Political Science, LSE Library.
    12. Bartolucci, Francesco & Montanari, Giorgio E. & Pandolfi, Silvia, 2015. "Three-step estimation of latent Markov models with covariates," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 287-301.
    13. Anders Skrondal & Jouni Kuha, 2012. "Improved Regression Calibration," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 649-669, October.
    14. Leonor Costa & José Dias, 2015. "What do Europeans Believe to be the Causes of Poverty? A Multilevel Analysis of Heterogeneity Within and Between Countries," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 122(1), pages 1-20, May.
    15. Eddelbuettel, Dirk & Francois, Romain, 2011. "Rcpp: Seamless R and C++ Integration," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i08).
    16. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models 2 volume set," Cambridge Books, Cambridge University Press, number 9780521478373, July.
    17. Zsuzsa Bakk & Jouni Kuha, 2018. "Two-Step Estimation of Models Between Latent Classes and External Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 871-892, December.
    18. Antonello Maruotti & Antonio Punzo, 2021. "Initialization of Hidden Markov and Semi‐Markov Models: A Critical Evaluation of Several Strategies," International Statistical Review, International Statistical Institute, vol. 89(3), pages 447-480, December.
    19. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard, 2003. "Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 561-575, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberto Mari & Antonello Maruotti, 2022. "A two-step estimator for generalized linear models for longitudinal data with time-varying measurement error," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 273-300, June.
    2. Zsuzsa Bakk & Jouni Kuha, 2018. "Two-Step Estimation of Models Between Latent Classes and External Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 871-892, December.
    3. Jalmar M.F. Carrasco & Silvia L.P. Ferrari & Reinaldo B. Arellano-Valle, 2014. "Errors-in-variables beta regression models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(7), pages 1530-1547, July.
    4. Giorgio Eduardo Montanari & Marco Doretti & Maria Francesca Marino, 2022. "Model-based two-way clustering of second-level units in ordinal multilevel latent Markov models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 457-485, June.
    5. Brian Gin & Nicholas Sim & Anders Skrondal & Sophia Rabe-Hesketh, 2020. "A Dyadic IRT Model," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 815-836, September.
    6. Bakk, Zsuzsa & Kuha, Jouni, 2020. "Relating latent class membership to external variables: an overview," LSE Research Online Documents on Economics 107564, London School of Economics and Political Science, LSE Library.
    7. Yajing Zhu & Fiona Steele & Irini Moustaki, 2020. "A multilevel structural equation model for the interrelationships between multiple latent dimensions of childhood socio‐economic circumstances, partnership transitions and mid‐life health," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1029-1050, June.
    8. Zhu, Yajing & Steele, Fiona & Moustaki, Irini, 2020. "A multilevel structural equation model for the interrelationships between multiple latent dimensions of childhood socio‐economic circumstances, partnership transitions and mid‐life health," LSE Research Online Documents on Economics 103104, London School of Economics and Political Science, LSE Library.
    9. Bakk, Zsuzsa & Kuha, Jouni, 2018. "Two-step estimation of models between latent classes and external variables," LSE Research Online Documents on Economics 85161, London School of Economics and Political Science, LSE Library.
    10. Bo E. Honoré & Luojia Hu, 2023. "The COVID-19 pandemic and Asian American employment," Empirical Economics, Springer, vol. 64(5), pages 2053-2083, May.
    11. Patrick Gagliardini & Christian Gouriéroux, 2011. "Approximate Derivative Pricing for Large Classes of Homogeneous Assets with Systematic Risk," Journal of Financial Econometrics, Oxford University Press, vol. 9(2), pages 237-280, Spring.
    12. Gerhard, Frank & Hess, Dieter & Pohlmeier, Winfried, 1998. "What a Difference a Day Makes: On the Common Market Microstructure of Trading Days," CoFE Discussion Papers 98/01, University of Konstanz, Center of Finance and Econometrics (CoFE).
    13. Jennifer Oser & Marc Hooghe & Zsuzsa Bakk & Roberto Mari, 2023. "Changing citizenship norms among adolescents, 1999-2009-2016: A two-step latent class approach with measurement equivalence testing," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(5), pages 4915-4933, October.
    14. Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.
    15. Shapiro, Dmitry & Shi, Xianwen & Zillante, Artie, 2014. "Level-k reasoning in a generalized beauty contest," Games and Economic Behavior, Elsevier, vol. 86(C), pages 308-329.
    16. Prosper Dovonon & Alastair Hall & Frank Kleibergen, 2018. "Inference in Second-Order Identi?ed Models," CIRANO Working Papers 2018s-36, CIRANO.
    17. Detering, Nils & Packham, Natalie, 2018. "Model risk of contingent claims," IRTG 1792 Discussion Papers 2018-036, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    18. Aronsson, Thomas & Jenderny, Katharina & Lanot, Gauthier, 2021. "Maximum Likelihood Bunching Estimators of the ETI," Umeå Economic Studies 987, Umeå University, Department of Economics.
    19. Jean-Fran�ois Richard, 2011. "Book Review: Econometric Modeling and Inference," Econometric Reviews, Taylor & Francis Journals, vol. 30(5), pages 577-581, October.
    20. Edvard Bakhitov, 2020. "Frequentist Shrinkage under Inequality Constraints," Papers 2001.10586, arXiv.org.

    More about this item

    Keywords

    multilevel latent class analysis; covariates; stepwise estimators; pseudo ML; European Union grant (ERC; PRD; project number 101077659).; Starting Grant FIRE; PIACERI 2020/2022;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:119994. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.