IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i3p1853-1867.html
   My bibliography  Save this article

Subset selection for linear mixed models

Author

Listed:
  • Daniel R. Kowal

Abstract

Linear mixed models (LMMs) are instrumental for regression analysis with structured dependence, such as grouped, clustered, or multilevel data. However, selection among the covariates—while accounting for this structured dependence—remains a challenge. We introduce a Bayesian decision analysis for subset selection with LMMs. Using a Mahalanobis loss function that incorporates the structured dependence, we derive optimal linear coefficients for (i) any given subset of variables and (ii) all subsets of variables that satisfy a cardinality constraint. Crucially, these estimates inherit shrinkage or regularization and uncertainty quantification from the underlying Bayesian model, and apply for any well‐specified Bayesian LMM. More broadly, our decision analysis strategy deemphasizes the role of a single “best” subset, which is often unstable and limited in its information content, and instead favors a collection of near‐optimal subsets. This collection is summarized by key member subsets and variable‐specific importance metrics. Customized subset search and out‐of‐sample approximation algorithms are provided for more scalable computing. These tools are applied to simulated data and a longitudinal physical activity dataset, and demonstrate excellent prediction, estimation, and selection ability.

Suggested Citation

  • Daniel R. Kowal, 2023. "Subset selection for linear mixed models," Biometrics, The International Biometric Society, vol. 79(3), pages 1853-1867, September.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:1853-1867
    DOI: 10.1111/biom.13707
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13707
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13707?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Satkartar K. Kinney & David B. Dunson, 2007. "Fixed and Random Effects Selection in Linear and Logistic Models," Biometrics, The International Biometric Society, vol. 63(3), pages 690-698, September.
    2. P. Richard Hahn & Carlos M. Carvalho, 2015. "Decoupling Shrinkage and Selection in Bayesian Linear Models: A Posterior Summary Perspective," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 435-448, March.
    3. Howard D. Bondell & Arun Krishna & Sujit K. Ghosh, 2010. "Joint Variable Selection for Fixed and Random Effects in Linear Mixed-Effects Models," Biometrics, The International Biometric Society, vol. 66(4), pages 1069-1077, December.
    4. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    5. Zhen Chen & David B. Dunson, 2003. "Random Effects Selection in Linear Mixed Models," Biometrics, The International Biometric Society, vol. 59(4), pages 762-769, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.
    2. Mojtaba Ganjali & Taban Baghfalaki, 2018. "Application of Penalized Mixed Model in Identification of Genes in Yeast Cell-Cycle Gene Expression Data," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 6(2), pages 38-41, April.
    3. Mingan Yang & Min Wang & Guanghui Dong, 2020. "Bayesian variable selection for mixed effects model with shrinkage prior," Computational Statistics, Springer, vol. 35(1), pages 227-243, March.
    4. Yang, Mingan, 2012. "Bayesian variable selection for logistic mixed model with nonparametric random effects," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2663-2674.
    5. Abhik Ghosh & Magne Thoresen, 2018. "Non-concave penalization in linear mixed-effect models and regularized selection of fixed effects," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(2), pages 179-210, April.
    6. Imori, Shinpei & Rosen, Dietrich von, 2015. "Covariance components selection in high-dimensional growth curve model with random coefficients," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 86-94.
    7. Tsai-Hung Fan & Yi-Fu Wang & Yi-Chen Zhang, 2014. "Bayesian model selection in linear mixed effects models with autoregressive(p) errors using mixture priors," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(8), pages 1814-1829, August.
    8. Jieyi Yi & Niansheng Tang, 2022. "Variational Bayesian Inference in High-Dimensional Linear Mixed Models," Mathematics, MDPI, vol. 10(3), pages 1-19, January.
    9. Armagan, Artin & Dunson, David, 2011. "Sparse variational analysis of linear mixed models for large data sets," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 1056-1062, August.
    10. Ping Wu & Xinchao Luo & Peirong Xu & Lixing Zhu, 2017. "New variable selection for linear mixed-effects models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(3), pages 627-646, June.
    11. Kramlinger, Peter & Schneider, Ulrike & Krivobokova, Tatyana, 2023. "Uniformly valid inference based on the Lasso in linear mixed models," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    12. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    13. Shakhawat Hossain & Trevor Thomson & Ejaz Ahmed, 2018. "Shrinkage estimation in linear mixed models for longitudinal data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 81(5), pages 569-586, July.
    14. Mingan Yang, 2020. "Bayesian Mixed Effects Model with Variable Selection," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 10(2), pages 27-29, August.
    15. Xinyu Zhang & Hua Liang & Anna Liu & David Ruppert & Guohua Zou, 2016. "Selection Strategy for Covariance Structure of Random Effects in Linear Mixed-effects Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 275-291, March.
    16. Zangdong He & Wanzhu Tu & Sijian Wang & Haoda Fu & Zhangsheng Yu, 2015. "Simultaneous variable selection for joint models of longitudinal and survival outcomes," Biometrics, The International Biometric Society, vol. 71(1), pages 178-187, March.
    17. Jiehuan Sun & Jose D. Herazo‐Maya & Philip L. Molyneaux & Toby M. Maher & Naftali Kaminski & Hongyu Zhao, 2019. "Regularized Latent Class Model for Joint Analysis of High‐Dimensional Longitudinal Biomarkers and a Time‐to‐Event Outcome," Biometrics, The International Biometric Society, vol. 75(1), pages 69-77, March.
    18. Jan Pablo Burgard & Joscha Krause & Ralf Münnich, 2019. "Penalized Small Area Models for the Combination of Unit- and Area-level Data," Research Papers in Economics 2019-05, University of Trier, Department of Economics.
    19. Rohart, Florian & San Cristobal, Magali & Laurent, Béatrice, 2014. "Selection of fixed effects in high dimensional linear mixed models using a multicycle ECM algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 209-222.
    20. Ollier, Edouard, 2022. "Fast selection of nonlinear mixed effect models using penalized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 167(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:1853-1867. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.