IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v117y2018icp154-161.html
   My bibliography  Save this article

A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure

Author

Listed:
  • Craiu, Radu V.
  • Duchesne, Thierry

Abstract

A new model selection criterion for mixed effects regression models is introduced. The criterion is computable even when the model is fitted with a two-step method or when the structure and the distribution of the random effects are unknown. The criterion is especially useful in the early stage of the model building process when one needs to decide which covariates should be included in a mixed effects regression model, but has no knowledge of the random effect structure. This is particularly relevant in substantive fields where variable selection is guided by information criteria rather than regularization. The calculation of the criterion requires only the evaluation of cluster-level log-likelihoods and does not rely on heavy numerical integration. Theoretical and numerical arguments are used to justify the method and its usefulness is illustrated by analysing data from a youth behaviour study.

Suggested Citation

  • Craiu, Radu V. & Duchesne, Thierry, 2018. "A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 154-161.
  • Handle: RePEc:eee:csdana:v:117:y:2018:i:c:p:154-161
    DOI: 10.1016/j.csda.2017.07.011
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794731730169X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2017.07.011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bates, Douglas & Mächler, Martin & Bolker, Ben & Walker, Steve, 2015. "Fitting Linear Mixed-Effects Models Using lme4," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i01).
    2. Cavanaugh, Joseph E., 1997. "Unifying the derivations for the Akaike and corrected Akaike information criteria," Statistics & Probability Letters, Elsevier, vol. 33(2), pages 201-208, April.
    3. Yu, Dalei & Zhang, Xinyu & Yau, Kelvin K.W., 2013. "Information based model selection criteria for generalized linear mixed models with unknown variance component parameters," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 245-262.
    4. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    5. Inna Chervoneva & Boris Iglewicz & Terry Hyslop, 2006. "A General Approach for Two-Stage Analysis of Multilevel Clustered Non-Gaussian Data," Biometrics, The International Biometric Society, vol. 62(3), pages 752-759, September.
    6. Kristensen, Kasper & Nielsen, Anders & Berg, Casper W. & Skaug, Hans & Bell, Bradley M., 2016. "TMB: Automatic Differentiation and Laplace Approximation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(i05).
    7. M. C. Donohue & R. Overholser & R. Xu & F. Vaida, 2011. "Conditional Akaike information under generalized linear and proportional hazards mixed models," Biometrika, Biometrika Trust, vol. 98(3), pages 685-700.
    8. Sonja Greven & Thomas Kneib, 2010. "On the behaviour of marginal and conditional AIC in linear mixed models," Biometrika, Biometrika Trust, vol. 97(4), pages 773-789.
    9. Molenberghs, Geert & Verbeke, Geert & Iddi, Samuel, 2011. "Pseudo-likelihood methodology for partitioned large and complex samples," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 892-901, July.
    10. Florin Vaida & Suzette Blanchard, 2005. "Conditional Akaike information for mixed-effects models," Biometrika, Biometrika Trust, vol. 92(2), pages 351-370, June.
    11. Renard, Didier & Molenberghs, Geert & Geys, Helena, 2004. "A pairwise likelihood approach to estimation in multilevel probit models," Computational Statistics & Data Analysis, Elsevier, vol. 44(4), pages 649-667, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Øystein Sørensen & Anders M. Fjell & Kristine B. Walhovd, 2023. "Longitudinal Modeling of Age-Dependent Latent Traits with Generalized Additive Latent and Mixed Models," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 456-486, June.
    2. Dalei Yu, 2016. "Conditional Akaike Information Criteria for a Class of Poisson Mixture Models with Random Effects," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(4), pages 1214-1235, December.
    3. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.
    4. Philipp F. M. Baumann & Enzo Rossi & Alexander Volkmann, 2020. "What Drives Inflation and How: Evidence from Additive Mixed Models Selected by cAIC," Papers 2006.06274, arXiv.org, revised Aug 2022.
    5. Kawakubo, Yuki & Kubokawa, Tatsuya, 2014. "Modified conditional AIC in linear mixed models," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 44-56.
    6. Braun, Julia & Sabanés Bové, Daniel & Held, Leonhard, 2014. "Choice of generalized linear mixed models using predictive crossvalidation," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 190-202.
    7. Thanapong Champahom & Sajjakaj Jomnonkwao & Chinnakrit Banyong & Watanya Nambulee & Ampol Karoonsoontawong & Vatanavongs Ratanavaraha, 2021. "Analysis of Crash Frequency and Crash Severity in Thailand: Hierarchical Structure Models Approach," Sustainability, MDPI, vol. 13(18), pages 1-19, September.
    8. Chan, Moon-tong & Yu, Dalei & Yau, Kelvin K.W., 2015. "Multilevel cumulative logistic regression model with random effects: Application to British social attitudes panel survey data," Computational Statistics & Data Analysis, Elsevier, vol. 88(C), pages 173-186.
    9. Overholser, Rosanna & Xu, Ronghui, 2014. "Effective degrees of freedom and its application to conditional AIC for linear mixed-effects models with correlated error structures," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 160-170.
    10. Yuki Kawakubo & Shonosuke Sugasawa & Tatsuya Kubokawa, 2014. "Conditional AIC under Covariate Shift with Application to Small Area Prediction," CIRJE F-Series CIRJE-F-944, CIRJE, Faculty of Economics, University of Tokyo.
    11. Colin Griesbach & Andreas Groll & Elisabeth Bergherr, 2021. "Addressing cluster-constant covariates in mixed effects models via likelihood-based boosting techniques," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-17, July.
    12. Hang Lai & Xin Gao, 2023. "Modified BIC Criterion for Model Selection in Linear Mixed Models," Mathematics, MDPI, vol. 11(9), pages 1-26, May.
    13. Yu, Dalei & Zhang, Xinyu & Yau, Kelvin K.W., 2013. "Information based model selection criteria for generalized linear mixed models with unknown variance component parameters," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 245-262.
    14. Sun-Joo Cho & Sarah Brown-Schmidt & Woo-yeol Lee, 2018. "Autoregressive Generalized Linear Mixed Effect Models with Crossed Random Effects: An Application to Intensive Binary Time Series Eye-Tracking Data," Psychometrika, Springer;The Psychometric Society, vol. 83(3), pages 751-771, September.
    15. Stefano Marchetti & Maciej Beręsewicz & Nicola Salvati & Marcin Szymkowiak & Łukasz Wawrowski, 2018. "The use of a three‐level M‐quantile model to map poverty at local administrative unit 1 in Poland," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1077-1104, October.
    16. Mojtaba Ganjali & Taban Baghfalaki, 2018. "Application of Penalized Mixed Model in Identification of Genes in Yeast Cell-Cycle Gene Expression Data," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 6(2), pages 38-41, April.
    17. Jonathan Bradley & Noel Cressie & Tao Shi, 2015. "Comparing and selecting spatial predictors using local criteria," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 1-28, March.
    18. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.
    19. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    20. David L. Miller & Richard Glennie & Andrew E. Seaton, 2020. "Understanding the Stochastic Partial Differential Equation Approach to Smoothing," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(1), pages 1-16, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:117:y:2018:i:c:p:154-161. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.