IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i6d10.1007_s00180-021-01177-1.html
   My bibliography  Save this article

Robust order selection of mixtures of regression models with random effects

Author

Listed:
  • Luísa Novais

    (University of Minho)

  • Susana Faria

    (University of Minho)

Abstract

Finite mixtures of regression models with random effects are a very flexible statistical tool to model data, as these models allow to model the heterogeneity of the population and to account for multiple correlated observations from the same individual at the same time. The selection of the number of components for these models has been a long-standing challenging problem in statistics. However, the majority of the existent methods for the estimation of the number of components are not robust and, therefore, are quite sensitive to outliers. In this article we study a robust estimation of the number of components for mixtures of regression models with random effects, investigating the performance of trimmed information and classification criteria comparatively to the performance of the traditional information and classification criteria. The simulation study and a real-world application showcase the superiority of the trimmed information and classification criteria in the presence of contaminated data.

Suggested Citation

  • Luísa Novais & Susana Faria, 2025. "Robust order selection of mixtures of regression models with random effects," Computational Statistics, Springer, vol. 40(6), pages 3205-3228, July.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:6:d:10.1007_s00180-021-01177-1
    DOI: 10.1007/s00180-021-01177-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-021-01177-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-021-01177-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    2. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    3. Joseph E. Cavanaugh, 2004. "Criteria for Linear Model Selection Based on Kullback's Symmetric Divergence," Australian & New Zealand Journal of Statistics, Australian Statistical Publishing Association Inc., vol. 46(2), pages 257-274, June.
    4. Cavanaugh, Joseph E., 1999. "A large-sample model selection criterion based on Kullback's symmetric divergence," Statistics & Probability Letters, Elsevier, vol. 42(4), pages 333-343, May.
    5. Chun Yu & Weixin Yao & Guangren Yang, 2020. "A Selective Overview and Comparison of Robust Mixture Regression Estimators," International Statistical Review, International Statistical Institute, vol. 88(1), pages 176-202, April.
    6. Derek S. Young & David R. Hunter, 2015. "Random effects regression mixtures for analyzing infant habituation," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(7), pages 1421-1441, July.
    7. Hiroyuki Kasahara & Katsumi Shimotsu, 2015. "Testing the Number of Components in Normal Mixture Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1632-1645, December.
    8. Koller, Manuel, 2016. "robustlmm: An R Package for Robust Estimation of Linear Mixed-Effects Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 75(i06).
    9. Francis K.C. Hui & David I. Warton & Scott D. Foster, 2015. "Order selection in finite mixture models: complete or observed likelihood information criteria?," Biometrika, Biometrika Trust, vol. 102(3), pages 724-730.
    10. Hamparsum Bozdogan, 1987. "Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 345-370, September.
    11. Stanley Sclove, 1987. "Application of model-selection criteria to some problems in multivariate analysis," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 333-343, September.
    12. Neykov, N. & Filzmoser, P. & Dimova, R. & Neytchev, P., 2007. "Robust fitting of mixtures using the trimmed likelihood estimator," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 299-308, September.
    13. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. David Fernando Muñoz & Verónica Andrea González-López & Jürgen Symanzik, 2025. "Editorial on the special issue on the V Latin American conference on statistical computing," Computational Statistics, Springer, vol. 40(6), pages 2849-2856, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolas Depraetere & Martina Vandebroek, 2014. "Order selection in finite mixtures of linear regressions," Statistical Papers, Springer, vol. 55(3), pages 871-911, August.
    2. Marianna Virtanen & Jussi Vahtera & Jenny Head & Rosemary Dray-Spira & Annaleena Okuloff & Adam G Tabak & Marcel Goldberg & Jenni Ervasti & Markus Jokela & Archana Singh-Manoux & Jaana Pentti & Marie , 2015. "Work Disability among Employees with Diabetes: Latent Class Analysis of Risk Factors in Three Prospective Cohort Studies," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-14, November.
    3. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    4. Danks, Nicholas P. & Sharma, Pratyush N. & Sarstedt, Marko, 2020. "Model selection uncertainty and multimodel inference in partial least squares structural equation modeling (PLS-SEM)," Journal of Business Research, Elsevier, vol. 113(C), pages 13-24.
    5. Morgan, Grant B. & Hodge, Kari J. & Baggett, Aaron R., 2016. "Latent profile analysis with nonnormal mixtures: A Monte Carlo examination of model selection using fit indices," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 146-161.
    6. repec:jss:jstsof:06:i02 is not listed on IDEAS
    7. Michael T. Baglivio & Kevin T. Wolff, 2021. "Adverse Childhood Experiences Distinguish Violent Juvenile Sexual Offenders’ Victim Typologies," IJERPH, MDPI, vol. 18(21), pages 1-14, October.
    8. Md. Matiar Rahman & Mahbubul Muttakin & Animesh Pal & Abu Zar Shafiullah & Bidyut Baran Saha, 2019. "A Statistical Approach to Determine Optimal Models for IUPAC-Classified Adsorption Isotherms," Energies, MDPI, vol. 12(23), pages 1-34, November.
    9. Marhuenda, Yolanda & Morales, Domingo & del Carmen Pardo, María, 2014. "Information criteria for Fay–Herriot model selection," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 268-280.
    10. Kim, Daeyoung & Kim, Jong-Min & Liao, Shu-Min & Jung, Yoon-Sung, 2013. "Mixture of D-vine copulas for modeling dependence," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 1-19.
    11. Ana Oliveira-Brochado & Francisco Vitorino Martins, 2014. "Identifying Small Market Segments with Mixture Regression Models," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 4(4), pages 812-812.
    12. Po-Hsien Huang, 2017. "Asymptotics of AIC, BIC, and RMSEA for Model Selection in Structural Equation Modeling," Psychometrika, Springer;The Psychometric Society, vol. 82(2), pages 407-426, June.
    13. Qi Chen & Wen Luo & Gregory J. Palardy & Ryan Glaman & Amber McEnturff, 2017. "The Efficacy of Common Fit Indices for Enumerating Classes in Growth Mixture Models When Nested Data Structure Is Ignored," SAGE Open, , vol. 7(1), pages 21582440177, March.
    14. Hafidi, Bezza, 2006. "A small-sample criterion based on Kullback's symmetric divergence for vector autoregressive modeling," Statistics & Probability Letters, Elsevier, vol. 76(15), pages 1647-1654, September.
    15. Xiaotong Wen & Yixiang Lin & Yuchen Liu & Katie Starcevich & Fang Yuan & Xiuzhu Wang & Xiaoxu Xie & Zhaokang Yuan, 2020. "A Latent Profile Analysis of Anxiety among Junior High School Students in Less Developed Rural Regions of China," IJERPH, MDPI, vol. 17(11), pages 1-14, June.
    16. Scott, Carol F. & Bay-Cheng, Laina Y. & Nochajski, Thomas H. & Lorraine Collins, R., 2024. "Emerging adults’ social media engagement & alcohol misuse: A multidimensional, person-centered analysis of risk," Children and Youth Services Review, Elsevier, vol. 159(C).
    17. Wang, Wan-Lun & Castro, Luis M. & Lin, Tsung-I, 2017. "Automated learning of t factor analysis models with complete and incomplete data," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 157-171.
    18. Zachary K. Collier & Haobai Zhang & Bridgette Johnson, 2021. "Finite Mixture Modeling for Program Evaluation: Resampling and Pre-processing Approaches," Evaluation Review, , vol. 45(6), pages 309-333, December.
    19. Kim, Daeyoung & Seo, Byungtae, 2014. "Assessment of the number of components in Gaussian mixture models in the presence of multiple local maximizers," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 100-120.
    20. Bartolucci, Francesco & Bacci, Silvia & Pigini, Claudia, 2017. "Misspecification test for random effects in generalized linear finite-mixture models for clustered binary and ordered data," Econometrics and Statistics, Elsevier, vol. 3(C), pages 112-131.
    21. Terry Elrod & Gerald Häubl & Steven Tipps, 2012. "Parsimonious Structural Equation Models for Repeated Measures Data, with Application to the Study of Consumer Preferences," Psychometrika, Springer;The Psychometric Society, vol. 77(2), pages 358-387, April.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:6:d:10.1007_s00180-021-01177-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.