IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i4p1402-1413.html
   My bibliography  Save this article

Simultaneous variable selection in regression analysis of multivariate interval‐censored data

Author

Listed:
  • Liuquan Sun
  • Shuwei Li
  • Lianming Wang
  • Xinyuan Song
  • Xuemei Sui

Abstract

Multivariate interval‐censored data arise when each subject under study can potentially experience multiple events and the onset time of each event is not observed exactly but is known to lie in a certain time interval formed by adjacent examination times with changed statuses of the event. This type of incomplete and complex data structure poses a substantial challenge in practical data analysis. In addition, many potential risk factors exist in numerous studies. Thus, conducting variable selection for event‐specific covariates simultaneously becomes useful in identifying important variables and assessing their effects on the events of interest. In this paper, we develop a variable selection technique for multivariate interval‐censored data under a general class of semiparametric transformation frailty models. The minimum information criterion (MIC) method is embedded in the optimization step of the proposed expectation‐maximization (EM) algorithm to obtain the parameter estimator. The proposed EM algorithm greatly reduces the computational burden in maximizing the observed likelihood function, and the MIC naturally avoids selecting the optimal tuning parameter as needed in many other popular penalties, making the proposed algorithm promising and reliable. The proposed method is evaluated through extensive simulation studies and illustrated by an analysis of patient data from the Aerobics Center Longitudinal Study.

Suggested Citation

  • Liuquan Sun & Shuwei Li & Lianming Wang & Xinyuan Song & Xuemei Sui, 2022. "Simultaneous variable selection in regression analysis of multivariate interval‐censored data," Biometrics, The International Biometric Society, vol. 78(4), pages 1402-1413, December.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:4:p:1402-1413
    DOI: 10.1111/biom.13548
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13548
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13548?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Xiaogang Su & Chalani S. Wijayasinghe & Juanjuan Fan & Ying Zhang, 2016. "Sparse estimation of Cox proportional hazards models via approximated information criteria," Biometrics, The International Biometric Society, vol. 72(3), pages 751-759, September.
    3. Donglin Zeng & Fei Gao & D. Y. Lin, 2017. "Maximum likelihood estimation for semiparametric regression models with multivariate interval-censored data," Biometrika, Biometrika Trust, vol. 104(3), pages 505-525.
    4. Hui Zhao & Qiwei Wu & Gang Li & Jianguo Sun, 2020. "Simultaneous Estimation and Variable Selection for Interval-Censored Data With Broken Adaptive Ridge Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 204-216, January.
    5. Hao Helen Zhang & Wenbin Lu, 2007. "Adaptive Lasso for Cox's proportional hazards model," Biometrika, Biometrika Trust, vol. 94(3), pages 691-703.
    6. Scolas, Sylvie & El Ghouch, Anouar & Legrand, Catherine & Oulhaj, Abderrahim, 2016. "Variable selection in a flexible parametric mixture cure model with interval-censored data," LIDAM Reprints ISBA 2016016, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    7. Lianming Wang & Christopher S. McMahan & Michael G. Hudgens & Zaina P. Qureshi, 2016. "A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data," Biometrics, The International Biometric Society, vol. 72(1), pages 222-231, March.
    8. Donglin Zeng & Lu Mao & D. Y. Lin, 2016. "Maximum likelihood estimation for semiparametric transformation models with interval-censored data," Biometrika, Biometrika Trust, vol. 103(2), pages 253-271.
    9. Jian Huang & Shuangge Ma & Huiliang Xie, 2006. "Regularized Estimation in the Accelerated Failure Time Model with High-Dimensional Covariates," Biometrics, The International Biometric Society, vol. 62(3), pages 813-820, September.
    10. Donglin Zeng & Qingxia Chen & Joseph G. Ibrahim, 2009. "Gamma frailty transformation models for multivariate survival times," Biometrika, Biometrika Trust, vol. 96(2), pages 277-291.
    11. Jianwen Cai & Jianqing Fan & Runze Li & Haibo Zhou, 2005. "Variable selection for multivariate failure time data," Biometrika, Biometrika Trust, vol. 92(2), pages 303-316, June.
    12. Xiaoxi Liu & Donglin Zeng, 2013. "Variable selection in semiparametric transformation models for right-censored data," Biometrika, Biometrika Trust, vol. 100(4), pages 859-876.
    13. Liu, Hao & Shen, Yu, 2009. "A Semiparametric Regression Cure Model for Interval-Censored Data," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1168-1178.
    14. Qingning Zhou & Tao Hu & Jianguo Sun, 2017. "A Sieve Semiparametric Maximum Likelihood Approach for Regression Analysis of Bivariate Interval-Censored Failure Time Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 664-672, April.
    15. T. Cai & J. Huang & L. Tian, 2009. "Regularized Estimation for the Accelerated Failure Time Model," Biometrics, The International Biometric Society, vol. 65(2), pages 394-404, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fan Feng & Guanghui Cheng & Jianguo Sun, 2023. "Variable Selection for Length-Biased and Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 11(22), pages 1-20, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Du, Mingyue & Zhao, Xingqiu & Sun, Jianguo, 2022. "Variable selection for case-cohort studies with informatively interval-censored outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    2. Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
    3. Fan Feng & Guanghui Cheng & Jianguo Sun, 2023. "Variable Selection for Length-Biased and Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 11(22), pages 1-20, November.
    4. Fengting Yi & Niansheng Tang & Jianguo Sun, 2022. "Simultaneous variable selection and estimation for joint models of longitudinal and failure time data with interval censoring," Biometrics, The International Biometric Society, vol. 78(1), pages 151-164, March.
    5. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    6. Wang Zhu & Wang C.Y., 2010. "Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-33, June.
    7. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    8. Qu, Lianqiang & Song, Xinyuan & Sun, Liuquan, 2018. "Identification of local sparsity and variable selection for varying coefficient additive hazards models," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 119-135.
    9. T. Cai & J. Huang & L. Tian, 2009. "Regularized Estimation for the Accelerated Failure Time Model," Biometrics, The International Biometric Society, vol. 65(2), pages 394-404, June.
    10. Chun Yin Lee & Kin Yau Wong & Kwok Fai Lam & Dipankar Bandyopadhyay, 2023. "A semiparametric joint model for cluster size and subunit‐specific interval‐censored outcomes," Biometrics, The International Biometric Society, vol. 79(3), pages 2010-2022, September.
    11. Li, Shuwei & Hu, Tao & Zhao, Xingqiu & Sun, Jianguo, 2019. "A class of semiparametric transformation cure models for interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 153-165.
    12. Engler David & Li Yi, 2009. "Survival Analysis with High-Dimensional Covariates: An Application in Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-22, February.
    13. Wei Wang & Shou‐En Lu & Jerry Q. Cheng & Minge Xie & John B. Kostis, 2022. "Multivariate survival analysis in big data: A divide‐and‐combine approach," Biometrics, The International Biometric Society, vol. 78(3), pages 852-866, September.
    14. Fei Gao & Kwun Chuen Gary Chan, 2019. "Semiparametric regression analysis of length‐biased interval‐censored data," Biometrics, The International Biometric Society, vol. 75(1), pages 121-132, March.
    15. Cheng, Chao & Feng, Xingdong & Huang, Jian & Jiao, Yuling & Zhang, Shuang, 2022. "ℓ0-Regularized high-dimensional accelerated failure time model," Computational Statistics & Data Analysis, Elsevier, vol. 170(C).
    16. Zhihua Sun & Yi Liu & Kani Chen & Gang Li, 2022. "Broken adaptive ridge regression for right-censored survival data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 69-91, February.
    17. Yudong Wang & Zhi‐Sheng Ye & Hongyuan Cao, 2021. "On computation of semiparametric maximum likelihood estimators with shape constraints," Biometrics, The International Biometric Society, vol. 77(1), pages 113-124, March.
    18. Rong Liu & Shishun Zhao & Tao Hu & Jianguo Sun, 2022. "Variable Selection for Generalized Linear Models with Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 10(5), pages 1-18, February.
    19. Li‐Pang Chen & Grace Y. Yi, 2021. "Analysis of noisy survival data with graphical proportional hazards measurement error models," Biometrics, The International Biometric Society, vol. 77(3), pages 956-969, September.
    20. Shuwei Li & Limin Peng, 2023. "Instrumental variable estimation of complier causal treatment effect with interval‐censored data," Biometrics, The International Biometric Society, vol. 79(1), pages 253-263, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:4:p:1402-1413. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.