IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v190y2022ics0047259x22000069.html
   My bibliography  Save this article

Mixture regression for longitudinal data based on joint mean–covariance model

Author

Listed:
  • Yu, Jing
  • Nummi, Tapio
  • Pan, Jianxin

Abstract

In the process of modeling longitudinal data, we focus on the case that the studied population is comprised of different groups of individuals and individuals within the same group share the similar kind of mean progression trajectories, where finite mixture models (FMM) are often used to address this kind of unobserved heterogeneity in terms of mean. Existing methods, such as parametric and semiparametric mixture regression, usually model the mean in each subpopulation with assumption that observations sharing a common trajectory are independent or their covariance structure is pre-specified, but less research considers modeling of covariance structures while accounting for heterogeneity. In this paper, we introduce a joint model which models the mean and covariance structures simultaneously in a finite normal mixture regression, demonstrating how important the within-subject correlation is in clustering longitudinal data. Model parameters are estimated with an iteratively re-weighted least squares EM (IRLS-EM) algorithm. Our estimators are shown to be consistent and asymptotically normal. We can identify different mean trajectories and covariance structures in all clusters. Simulations show that the proposed method performs well and gives more accurate clustering results by introducing covariance modeling. Real data analysis is also used to illustrate the usefulness of the proposed method, and it presents good performance in clustering COVID-19 deaths for European countries in terms of progression trajectory.

Suggested Citation

  • Yu, Jing & Nummi, Tapio & Pan, Jianxin, 2022. "Mixture regression for longitudinal data based on joint mean–covariance model," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
  • Handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x22000069
    DOI: 10.1016/j.jmva.2022.104956
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X22000069
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2022.104956?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gilles Celeux & Gilda Soromenho, 1996. "An entropy criterion for assessing the number of clusters in a mixture model," Journal of Classification, Springer;The Classification Society, vol. 13(2), pages 195-212, September.
    2. Jianxin Pan, 2003. "On modelling mean-covariance structures in longitudinal studies," Biometrika, Biometrika Trust, vol. 90(1), pages 239-244, March.
    3. Mohsen Pourahmadi, 2007. "Cholesky Decompositions and Estimation of A Covariance Matrix: Orthogonality of Variance--Correlation Parameters," Biometrika, Biometrika Trust, vol. 94(4), pages 1006-1013.
    4. Mian Huang & Runze Li & Shaoli Wang, 2013. "Nonparametric Mixture of Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 929-941, September.
    5. Huajun Ye & Jianxin Pan, 2006. "Modelling of covariance structures in generalised estimating equations for longitudinal data," Biometrika, Biometrika Trust, vol. 93(4), pages 927-941, December.
    6. Khalili, Abbas & Chen, Jiahua, 2007. "Variable Selection in Finite Mixture of Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1025-1038, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guney, Yesim & Arslan, Olcay & Yavuz, Fulya Gokalp, 2022. "Robust estimation in multivariate heteroscedastic regression models with autoregressive covariance structures using EM algorithm," Journal of Multivariate Analysis, Elsevier, vol. 191(C).
    2. Xueying Zheng & Wing Fung & Zhongyi Zhu, 2013. "Robust estimation in joint mean–covariance regression model for longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 617-638, August.
    3. Dengke Xu & Zhongzhan Zhang & Liucang Wu, 2014. "Bayesian analysis of joint mean and covariance models for longitudinal data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(11), pages 2504-2514, November.
    4. Feng, Sanying & Lian, Heng & Xue, Liugen, 2016. "A new nested Cholesky decomposition and estimation for the covariance matrix of bivariate longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 102(C), pages 98-109.
    5. Abbas Khalili & Farhad Shokoohi & Masoud Asgharian & Shili Lin, 2023. "Sparse estimation in semiparametric finite mixture of varying coefficient regression models," Biometrics, The International Biometric Society, vol. 79(4), pages 3445-3457, December.
    6. Ahonen, Ilmari & Nevalainen, Jaakko & Larocque, Denis, 2019. "Prediction with a flexible finite mixture-of-regressions," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 212-224.
    7. Lam, Clifford, 2020. "High-dimensional covariance matrix estimation," LSE Research Online Documents on Economics 101667, London School of Economics and Political Science, LSE Library.
    8. Yixin Chen & Weixin Yao, 2017. "Unified Inference for Sparse and Dense Longitudinal Data in Time-varying Coefficient Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 268-284, March.
    9. Xu, Lin & Xiang, Sijia & Yao, Weixin, 2019. "Robust maximum Lq-likelihood estimation of joint mean–covariance models for longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 397-411.
    10. Marco Berrettini & Giuliano Galimberti & Saverio Ranciati, 2023. "Semiparametric finite mixture of regression models with Bayesian P-splines," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(3), pages 745-775, September.
    11. Daniels, M.J. & Pourahmadi, M., 2009. "Modeling covariance matrices via partial autocorrelations," Journal of Multivariate Analysis, Elsevier, vol. 100(10), pages 2352-2363, November.
    12. Lin, Lijing & Higham, Nicholas J. & Pan, Jianxin, 2014. "Covariance structure regularization via entropy loss function," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 315-327.
    13. Lee, Keunbaik & Baek, Changryong & Daniels, Michael J., 2017. "ARMA Cholesky factor models for the covariance matrix of linear models," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 267-280.
    14. Hongmei Lin & Wenchao Xu & Riquan Zhang & Jianhong Shi & Yuedong Wang, 2017. "Multiple-index varying-coefficient models for longitudinal data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(11), pages 1960-1978, August.
    15. Luo, Renwen & Pan, Jianxin, 2022. "Conditional generalized estimating equations of mean-variance-correlation for clustered data," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    16. Lu, Fei & Xue, Liugen & Cai, Xiong, 2020. "GEE analysis in joint mean-covariance model for longitudinal data," Statistics & Probability Letters, Elsevier, vol. 160(C).
    17. Julian Aichholzer & Sylvia Kritzinger & Carolina Plescia, 2021. "National identity profiles and support for the European Union," European Union Politics, , vol. 22(2), pages 293-315, June.
    18. Adrian Bruhin & Ernst Fehr & Daniel Schunk, 2019. "The many Faces of Human Sociality: Uncovering the Distribution and Stability of Social Preferences," Journal of the European Economic Association, European Economic Association, vol. 17(4), pages 1025-1069.
    19. Nicoleta Serban & Huijing Jiang, 2012. "Multilevel Functional Clustering Analysis," Biometrics, The International Biometric Society, vol. 68(3), pages 805-814, September.
    20. Jing Lv & Chaohui Guo, 2017. "Efficient parameter estimation via modified Cholesky decomposition for quantile regression with longitudinal data," Computational Statistics, Springer, vol. 32(3), pages 947-975, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x22000069. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.