IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v156y2021ics0167947320302061.html
   My bibliography  Save this article

Variable selection for generalized odds rate mixture cure models with interval-censored failure time data

Author

Listed:
  • Xu, Yang
  • Zhao, Shishun
  • Hu, Tao
  • Sun, Jianguo

Abstract

Variable selection for failure time data with a cured fraction has been discussed by many authors but most of existing methods apply only to right-censored failure time data. In this paper, we consider variable selection when one faces interval-censored failure time data arising from a general class of generalized odds rate mixture cure models, and we propose a penalized variable selection method by maximizing a derived penalized likelihood function. In the method, the sieve approach is employed to approximate the unknown function, and it is implemented using a novel penalized expectation–maximization (EM) algorithm. Also the asymptotic properties of the proposed estimators of regression parameters, including the oracle property, are obtained. Furthermore, a simulation study is conducted to assess the finite sample performance of the proposed method, and the results indicate that it works well in practice. Finally, the approach is applied to a set of real data on childhood mortality taken from the Nigeria Demographic and Health Survey.

Suggested Citation

  • Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
  • Handle: RePEc:eee:csdana:v:156:y:2021:i:c:s0167947320302061
    DOI: 10.1016/j.csda.2020.107115
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320302061
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107115?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Meinshausen, Nicolai & Meier, Lukas & Bühlmann, Peter, 2009. "p-Values for High-Dimensional Regression," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1671-1681.
    2. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    3. Hui Zhao & Qiwei Wu & Gang Li & Jianguo Sun, 2020. "Simultaneous Estimation and Variable Selection for Interval-Censored Data With Broken Adaptive Ridge Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 204-216, January.
    4. Scolas, Sylvie & El Ghouch, Anouar & Legrand, Catherine & Oulhaj, Abderrahim, 2016. "Variable selection in a flexible parametric mixture cure model with interval-censored data," LIDAM Reprints ISBA 2016016, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    5. Minggen Lu & Ying Zhang & Jian Huang, 2007. "Estimation of the mean function with panel count data using monotone polynomial splines," Biometrika, Biometrika Trust, vol. 94(3), pages 705-718.
    6. Sudipto Banerjee & Bradley P. Carlin, 2004. "Parametric Spatial Cure Rate Models for Interval-Censored Time-to-Relapse Data," Biometrics, The International Biometric Society, vol. 60(1), pages 268-275, March.
    7. Hu, Tao & Xiang, Liming, 2013. "Efficient estimation for semiparametric cure models with interval-censored data," Journal of Multivariate Analysis, Elsevier, vol. 121(C), pages 139-151.
    8. Liu, Hao & Shen, Yu, 2009. "A Semiparametric Regression Cure Model for Interval-Censored Data," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1168-1178.
    9. Ying Wu & Richard J. Cook, 2015. "Penalized regression for interval‐censored times of disease progression: Selection of HLA markers in psoriatic arthritis," Biometrics, The International Biometric Society, vol. 71(3), pages 782-791, September.
    10. Wang, Naichen & Wang, Lianming & McMahan, Christopher S., 2015. "Regression analysis of bivariate current status data under the Gamma-frailty proportional hazards model using the EM algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 83(C), pages 140-150.
    11. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    12. Lianming Wang & Christopher S. McMahan & Michael G. Hudgens & Zaina P. Qureshi, 2016. "A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data," Biometrics, The International Biometric Society, vol. 72(1), pages 222-231, March.
    13. Kneib, Thomas, 2006. "Mixed model-based inference in geoadditive hazard regression for interval-censored survival times," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 777-792, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Du, Mingyue & Zhao, Xingqiu & Sun, Jianguo, 2022. "Variable selection for case-cohort studies with informatively interval-censored outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    2. Fengting Yi & Niansheng Tang & Jianguo Sun, 2022. "Simultaneous variable selection and estimation for joint models of longitudinal and failure time data with interval censoring," Biometrics, The International Biometric Society, vol. 78(1), pages 151-164, March.
    3. Liuquan Sun & Shuwei Li & Lianming Wang & Xinyuan Song & Xuemei Sui, 2022. "Simultaneous variable selection in regression analysis of multivariate interval‐censored data," Biometrics, The International Biometric Society, vol. 78(4), pages 1402-1413, December.
    4. Xiaoguang Wang & Ziwen Wang, 2021. "EM algorithm for the additive risk mixture cure model with interval-censored data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(1), pages 91-130, January.
    5. Rong Liu & Shishun Zhao & Tao Hu & Jianguo Sun, 2022. "Variable Selection for Generalized Linear Models with Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 10(5), pages 1-18, February.
    6. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    7. Hu, Tao & Xiang, Liming, 2016. "Partially linear transformation cure models for interval-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 257-269.
    8. Li, Shuwei & Hu, Tao & Zhao, Xingqiu & Sun, Jianguo, 2019. "A class of semiparametric transformation cure models for interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 153-165.
    9. Fan Feng & Guanghui Cheng & Jianguo Sun, 2023. "Variable Selection for Length-Biased and Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 11(22), pages 1-20, November.
    10. Gabriel E Hoffman & Benjamin A Logsdon & Jason G Mezey, 2013. "PUMA: A Unified Framework for Penalized Multiple Regression Analysis of GWAS Data," PLOS Computational Biology, Public Library of Science, vol. 9(6), pages 1-19, June.
    11. Prabhashi W. Withana Gamage & Monica Chaudari & Christopher S. McMahan & Edwin H. Kim & Michael R. Kosorok, 2020. "An extended proportional hazards model for interval-censored data subject to instantaneous failures," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 158-182, January.
    12. Ying Wu & Richard J. Cook, 2018. "Variable selection and prediction in biased samples with censored outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(1), pages 72-93, January.
    13. Feng Zou & Hengjian Cui, 2020. "Error density estimation in high-dimensional sparse linear model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(2), pages 427-449, April.
    14. Gamage, Prabhashi W. Withana & McMahan, Christopher S. & Wang, Lianming & Tu, Wanzhu, 2018. "A Gamma-frailty proportional hazards model for bivariate interval-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 354-366.
    15. Liu, Xiaoyu & Xiang, Liming, 2021. "Generalized accelerated hazards mixture cure models with interval-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    16. Li‐Pang Chen & Bangxu Qiu, 2023. "Analysis of length‐biased and partly interval‐censored survival data with mismeasured covariates," Biometrics, The International Biometric Society, vol. 79(4), pages 3929-3940, December.
    17. Yuxiang Wu & Hui Zhao & Jianguo Sun, 2023. "Group variable selection for the Cox model with interval‐censored failure time data," Biometrics, The International Biometric Society, vol. 79(4), pages 3082-3095, December.
    18. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    19. Margherita Giuzio, 2017. "Genetic algorithm versus classical methods in sparse index tracking," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 40(1), pages 243-256, November.
    20. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:156:y:2021:i:c:s0167947320302061. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.