IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v171y2019icp452-474.html
   My bibliography  Save this article

Finite mixture of varying coefficient model: Estimation and component selection

Author

Listed:
  • Ye, Mao
  • Lu, Zhao-Hua
  • Li, Yimei
  • Song, Xinyuan

Abstract

Heterogeneous longitudinal data have become prevalent in medical, biological, and social studies. This paper proposes a finite mixture of varying coefficient models for handling heterogeneous populations. Each component of the mixture is modeled by a varying coefficient mixed-effect model that characterizes the longitudinal relations among variables. The identifiability of the mixture model is studied. Regression splines with equally spaced knots are used to approximate the varying coefficient functions, and a nested expectation maximization algorithm is developed to obtain the maximum likelihood estimation. We propose a penalized likelihood method based on the smoothly clipped absolute deviation (SCAD) penalty for the component selection of finite mixture of varying coefficient model. A modified BIC-based criterion based on the SCAD penalty, the BICSCAD, is proposed for selecting the penalty parameter and spline space simultaneously. The asymptotic properties of parameter estimation and component selection consistency are studied under mild conditions. Simulation studies are conducted to illustrate the component selection, parameter estimation, and inference of the proposed method. The model is then applied to a heterogeneous longitudinal data set from a study of the treatment effect on the use of heroin in the California Civil Addict Program.

Suggested Citation

  • Ye, Mao & Lu, Zhao-Hua & Li, Yimei & Song, Xinyuan, 2019. "Finite mixture of varying coefficient model: Estimation and component selection," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 452-474.
  • Handle: RePEc:eee:jmvana:v:171:y:2019:i:c:p:452-474
    DOI: 10.1016/j.jmva.2019.01.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X18302124
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2019.01.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Woo, Mi-Ja & Sriram, T.N., 2006. "Robust Estimation of Mixture Complexity," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1475-1486, December.
    2. Chiang C-T. & Rice J. A & Wu C. O, 2001. "Smoothing Spline Estimation for Varying Coefficient Models With Repeatedly Measured Dependent Variables," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 605-619, June.
    3. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    4. R. L. Eubank & Chunfeng Huang & Y. Muñoz Maldonado & Naisyin Wang & Suojin Wang & R. J. Buchanan, 2004. "Smoothing spline estimation in varying‐coefficient models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(3), pages 653-667, August.
    5. Cai, Jing-Heng & Song, Xin-Yuan & Lam, Kwok-Hap & Ip, Edward Hak-Sing, 2011. "A mixture of generalized latent variable models for mixed mode and heterogeneous data," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2889-2907, November.
    6. Mian Huang & Weixin Yao, 2012. "Mixture of Regression Models With Varying Mixing Proportions: A Semiparametric Approach," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 711-724, June.
    7. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    8. Nishii, R., 1988. "Maximum likelihood principle and model selection when the true model is unspecified," Journal of Multivariate Analysis, Elsevier, vol. 27(2), pages 392-403, November.
    9. Lu, Zudi & Tjøstheim, Dag & Yao, Qiwei, 2007. "Adaptive varying-coefficient linear models for stochastic processes: asymptotic theory," LSE Research Online Documents on Economics 5411, London School of Economics and Political Science, LSE Library.
    10. John A. Rice & Colin O. Wu, 2001. "Nonparametric Mixed Effects Models for Unequally Sampled Noisy Curves," Biometrics, The International Biometric Society, vol. 57(1), pages 253-259, March.
    11. Mian Huang & Runze Li & Shaoli Wang, 2013. "Nonparametric Mixture of Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 929-941, September.
    12. Hanfeng Chen & Jiahua Chen & John D. Kalbfleisch, 2001. "A modified likelihood ratio test for homogeneity in finite mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(1), pages 19-29.
    13. Li, Pengfei & Chen, Jiahua, 2010. "Testing the Order of a Finite Mixture," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1084-1092.
    14. Jianhua Z. Huang, 2002. "Varying-coefficient models and basis function approximations for the analysis of repeated measurements," Biometrika, Biometrika Trust, vol. 89(1), pages 111-128, March.
    15. Ying Zhang & Lei Hua & Jian Huang, 2010. "A Spline‐Based Semiparametric Maximum Likelihood Estimation Method for the Cox Model with Interval‐Censored Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 338-354, June.
    16. Hiroyuki Kasahara & Katsumi Shimotsu, 2015. "Testing the Number of Components in Normal Mixture Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1632-1645, December.
    17. Wang, Shaoli & Huang, Mian & Wu, Xing & Yao, Weixin, 2016. "Mixture of functional linear models and its application to CO2-GDP functional data," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 1-15.
    18. Lo, Yungtai, 2005. "Likelihood ratio tests of the number of components in a normal mixture with unequal variances," Statistics & Probability Letters, Elsevier, vol. 71(3), pages 225-235, March.
    19. Mian Huang & Weixin Yao & Shaoli Wang & Yixin Chen, 2018. "Statistical Inference and Applications of Mixture of Varying Coefficient Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 45(3), pages 618-643, September.
    20. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    21. Lu, Minggen, 2010. "Spline-based sieve maximum likelihood estimation in the partly linear model under monotonicity constraints," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2528-2542, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mahdiyeh, Zahra & Kazemi, Iraj, 2019. "An innovative strategy on the construction of multivariate multimodal linear mixed-effects models," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    2. Liu, Hefei & Song, Xinyuan & Zhang, Baoxue, 2022. "Varying-coefficient hidden Markov models with zero-effect regions," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yanyuan Ma & Shaoli Wang & Lin Xu & Weixin Yao, 2021. "Semiparametric mixture regression with unspecified error distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 429-444, June.
    2. Jun Jin & Tiefeng Ma & Jiajia Dai, 2021. "New efficient spline estimation for varying-coefficient models with two-step knot number selection," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(5), pages 693-712, July.
    3. Rahul Ghosal & Arnab Maity & Timothy Clark & Stefano B. Longo, 2020. "Variable selection in functional linear concurrent regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(3), pages 565-587, June.
    4. Zhaoping Hong & Yuao Hu & Heng Lian, 2013. "Variable selection for high-dimensional varying coefficient partially linear models via nonconcave penalty," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 887-908, October.
    5. Lian, Heng, 2015. "Quantile regression for dynamic partially linear varying coefficient time series models," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 49-66.
    6. Tang, Yanlin & Wang, Huixia Judy & Zhu, Zhongyi, 2013. "Variable selection in quantile varying coefficient models with longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 435-449.
    7. Wang, Shaoli & Huang, Mian & Wu, Xing & Yao, Weixin, 2016. "Mixture of functional linear models and its application to CO2-GDP functional data," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 1-15.
    8. Sphiwe B. Skhosana & Salomon M. Millard & Frans H. J. Kanfer, 2023. "A Novel EM-Type Algorithm to Estimate Semi-Parametric Mixtures of Partially Linear Models," Mathematics, MDPI, vol. 11(5), pages 1-20, February.
    9. Meng Li & Sijia Xiang & Weixin Yao, 2016. "Robust estimation of the number of components for mixtures of linear regression models," Computational Statistics, Springer, vol. 31(4), pages 1539-1555, December.
    10. Tang Qingguo & Cheng Longsheng, 2008. "M-estimation and B-spline approximation for varying coefficient models with longitudinal data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 20(7), pages 611-625.
    11. Qiu, Jia & Li, Degao & You, Jinhong, 2015. "SCAD-penalized regression for varying-coefficient models with autoregressive errors," Journal of Multivariate Analysis, Elsevier, vol. 137(C), pages 100-118.
    12. Abbas Khalili & Farhad Shokoohi & Masoud Asgharian & Shili Lin, 2023. "Sparse estimation in semiparametric finite mixture of varying coefficient regression models," Biometrics, The International Biometric Society, vol. 79(4), pages 3445-3457, December.
    13. Hoshino Tadao & Yanagi Takahide, 2022. "Estimating marginal treatment effects under unobserved group heterogeneity," Journal of Causal Inference, De Gruyter, vol. 10(1), pages 197-216, January.
    14. Liu, Hefei & Song, Xinyuan & Zhang, Baoxue, 2022. "Varying-coefficient hidden Markov models with zero-effect regions," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    15. Weihua Zhao & Weiping Zhang & Heng Lian, 2020. "Marginal quantile regression for varying coefficient models with longitudinal data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(1), pages 213-234, February.
    16. Hidetoshi Matsui & Toshihiro Misumi, 2015. "Variable selection for varying-coefficient models with the sparse regularization," Computational Statistics, Springer, vol. 30(1), pages 43-55, March.
    17. Long Feng & Changliang Zou & Zhaojun Wang & Xianwu Wei & Bin Chen, 2015. "Robust spline-based variable selection in varying coefficient model," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 78(1), pages 85-118, January.
    18. Liu, Lili & Lin, Lu, 2019. "Subgroup analysis for heterogeneous additive partially linear models and its application to car sales data," Computational Statistics & Data Analysis, Elsevier, vol. 138(C), pages 239-259.
    19. Peixin Zhao & Liugen Xue, 2011. "Variable selection for varying coefficient models with measurement errors," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 74(2), pages 231-245, September.
    20. Kong, Dehan & Bondell, Howard D. & Wu, Yichao, 2015. "Domain selection for the varying coefficient model via local polynomial regression," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 236-250.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:171:y:2019:i:c:p:452-474. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.