IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v84y2019i3d10.1007_s11336-019-09679-0.html
   My bibliography  Save this article

Bayesian Comparison of Latent Variable Models: Conditional Versus Marginal Likelihoods

Author

Listed:
  • Edgar C. Merkle

    (University of Missouri)

  • Daniel Furr

    (University of California, Berkeley)

  • Sophia Rabe-Hesketh

    (University of California, Berkeley)

Abstract

Typical Bayesian methods for models with latent variables (or random effects) involve directly sampling the latent variables along with the model parameters. In high-level software code for model definitions (using, e.g., BUGS, JAGS, Stan), the likelihood is therefore specified as conditional on the latent variables. This can lead researchers to perform model comparisons via conditional likelihoods, where the latent variables are considered model parameters. In other settings, however, typical model comparisons involve marginal likelihoods where the latent variables are integrated out. This distinction is often overlooked despite the fact that it can have a large impact on the comparisons of interest. In this paper, we clarify and illustrate these issues, focusing on the comparison of conditional and marginal Deviance Information Criteria (DICs) and Watanabe–Akaike Information Criteria (WAICs) in psychometric modeling. The conditional/marginal distinction corresponds to whether the model should be predictive for the clusters that are in the data or for new clusters (where “clusters” typically correspond to higher-level units like people or schools). Correspondingly, we show that marginal WAIC corresponds to leave-one-cluster out cross-validation, whereas conditional WAIC corresponds to leave-one-unit out. These results lead to recommendations on the general application of the criteria to models with latent variables.

Suggested Citation

  • Edgar C. Merkle & Daniel Furr & Sophia Rabe-Hesketh, 2019. "Bayesian Comparison of Latent Variable Models: Conditional Versus Marginal Likelihoods," Psychometrika, Springer;The Psychometric Society, vol. 84(3), pages 802-829, September.
  • Handle: RePEc:spr:psycho:v:84:y:2019:i:3:d:10.1007_s11336-019-09679-0
    DOI: 10.1007/s11336-019-09679-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-019-09679-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-019-09679-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Russell B. Millar, 2009. "Comparison of Hierarchical Bayesian Models for Overdispersed Count Data using DIC and Bayes' Factors," Biometrics, The International Biometric Society, vol. 65(3), pages 962-969, September.
    2. Rabe-Hesketh, Sophia & Skrondal, Anders & Pickles, Andrew, 2005. "Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects," Journal of Econometrics, Elsevier, vol. 128(2), pages 301-323, October.
    3. Zhenyu Zhao & Thomas A. Severini, 2017. "Integrated likelihood computation methods," Computational Statistics, Springer, vol. 32(1), pages 281-313, March.
    4. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    5. Rosseel, Yves, 2012. "lavaan: An R Package for Structural Equation Modeling," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i02).
    6. Lancaster, Tony, 2000. "The incidental parameter problem since 1948," Journal of Econometrics, Elsevier, vol. 95(2), pages 391-413, April.
    7. Robert Mislevy, 1986. "Bayes modal estimation in item response models," Psychometrika, Springer;The Psychometric Society, vol. 51(2), pages 177-195, June.
    8. Ian R. White, 2010. "simsum: Analyses of simulation studies including Monte Carlo error," Stata Journal, StataCorp LP, vol. 10(3), pages 369-385, September.
    9. Paul Boeck, 2008. "Random Item IRT Models," Psychometrika, Springer;The Psychometric Society, vol. 73(4), pages 533-559, December.
    10. Denwood, Matthew J., 2016. "runjags: An R Package Providing Interface Utilities, Model Templates, Parallel Computing Methods and Additional Distributions for MCMC Models in JAGS," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i09).
    11. J. C. Naylor & A. F. M. Smith, 1982. "Applications of a Method for the Efficient Computation of Posterior Distributions," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 31(3), pages 214-225, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ting Wang & Benjamin Graves & Yves Rosseel & Edgar C. Merkle, 2022. "Computation and application of generalized linear mixed model derivatives using lme4," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 1173-1193, September.
    2. Arnab Kumar Maity & Sanjib Basu & Santu Ghosh, 2021. "Bayesian criterion‐based variable selection," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 835-857, August.
    3. Justin L. Kern & Steven Andrew Culpepper, 2020. "A Restricted Four-Parameter IRT Model: The Dyad Four-Parameter Normal Ogive (Dyad-4PNO) Model," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 575-599, September.
    4. Fang Liu & Xiaojing Wang & Roeland Hancock & Ming-Hui Chen, 2022. "Bayesian Model Assessment for Jointly Modeling Multidimensional Response Data with Application to Computerized Testing," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1290-1317, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sun-Joo Cho & Paul Boeck & Susan Embretson & Sophia Rabe-Hesketh, 2014. "Additive Multilevel Item Structure Models with Random Residuals: Item Modeling for Explanation and Item Generation," Psychometrika, Springer;The Psychometric Society, vol. 79(1), pages 84-104, January.
    2. Li, Yong & Yu, Jun & Zeng, Tao, 2018. "Integrated Deviance Information Criterion for Latent Variable Models," Economics and Statistics Working Papers 6-2018, Singapore Management University, School of Economics.
    3. Cho, S.-J. & Rabe-Hesketh, S., 2011. "Alternating imputation posterior estimation of models with crossed random effects," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 12-25, January.
    4. Li Cai, 2010. "High-dimensional Exploratory Item Factor Analysis by A Metropolis–Hastings Robbins–Monro Algorithm," Psychometrika, Springer;The Psychometric Society, vol. 75(1), pages 33-57, March.
    5. Li, Yong & Yu, Jun & Zeng, Tao, 2020. "Deviance information criterion for latent variable models and misspecified models," Journal of Econometrics, Elsevier, vol. 216(2), pages 450-493.
    6. Yeojin Chung & Sophia Rabe-Hesketh & Vincent Dorie & Andrew Gelman & Jingchen Liu, 2013. "A Nondegenerate Penalized Likelihood Estimator for Variance Parameters in Multilevel Models," Psychometrika, Springer;The Psychometric Society, vol. 78(4), pages 685-709, October.
    7. Merkle, Edgar C. & Steyvers, Mark & Mellers, Barbara & Tetlock, Philip E., 2017. "A neglected dimension of good forecasting judgment: The questions we choose also matter," International Journal of Forecasting, Elsevier, vol. 33(4), pages 817-832.
    8. Dexen DZ. Xi & C.B. Dean & Stephen W. Taylor, 2020. "Modeling the duration and size of extended attack wildfires as dependent outcomes," Environmetrics, John Wiley & Sons, Ltd., vol. 31(5), August.
    9. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    10. Chun Wang & Steven W. Nydick, 2020. "On Longitudinal Item Response Theory Models: A Didactic," Journal of Educational and Behavioral Statistics, , vol. 45(3), pages 339-368, June.
    11. Shinya Sugawara & Yasuhiro Omori, 2017. "An Econometric Analysis of Insurance Markets with Separate Identification for Moral Hazard and Selection Problems," Computational Economics, Springer;Society for Computational Economics, vol. 50(3), pages 473-502, October.
    12. Hanneke Geerlings & Cees Glas & Wim Linden, 2011. "Modeling Rule-Based Item Generation," Psychometrika, Springer;The Psychometric Society, vol. 76(2), pages 337-359, April.
    13. Royce Anders & William Batchelder, 2015. "Cultural Consensus Theory for the Ordinal Data Case," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 151-181, March.
    14. Stephen Schilling & R. Bock, 2005. "High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature," Psychometrika, Springer;The Psychometric Society, vol. 70(3), pages 533-555, September.
    15. Ng'ombe, John, 2019. "Economics of the Greenseeder Hand Planter, Discrete Choice Modeling, and On-Farm Field Experimentation," Thesis Commons jckt7, Center for Open Science.
    16. Aghabazaz, Zeynab & Kazemi, Iraj, 2023. "Under-reported time-varying MINAR(1) process for modeling multivariate count series," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    17. William Greene, 2007. "Discrete Choice Modeling," Working Papers 07-6, New York University, Leonard N. Stern School of Business, Department of Economics.
    18. Wladimir Raymond & Pierre Mohnen & Franz Palm & Sybrand Schim van der Loeff, 2007. "The Behavior of the Maximum Likelihood Estimator of Dynamic Panel Data Sample Selection Models," CIRANO Working Papers 2007s-06, CIRANO.
    19. Mahdiyeh, Zahra & Kazemi, Iraj, 2019. "An innovative strategy on the construction of multivariate multimodal linear mixed-effects models," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    20. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:84:y:2019:i:3:d:10.1007_s11336-019-09679-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.