IDEAS home Printed from https://ideas.repec.org/p/hum/wpaper/sfb649dp2012-061.html
   My bibliography  Save this paper

Variable selection in Cox regression models with varying coefficients

Author

Listed:
  • Toshio Honda
  • Wolfgang Karl Härdle

Abstract

We deal with two kinds of Cox regression models with varying coefficients. The coefficients vary with time in one model. In the other model, there is an important random variable called an index variable and the coefficients vary with the variable. In both models, we have p-dimensional covariates and p increases moderately. However, it is the case that only a small part of the covariates are relevant in these situations. We carry out variable selection and estimation of the coefficient functions by using the group SCAD-type estimator and the adaptive group Lasso estimator. We examine the theoretical properties of the estimators, especially the L2 convergence rate, the sparsity, and the oracle property. Simulation studies and a real data analysis show the performance of these new techniques.

Suggested Citation

  • Toshio Honda & Wolfgang Karl Härdle, 2012. "Variable selection in Cox regression models with varying coefficients," SFB 649 Discussion Papers SFB649DP2012-061, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
  • Handle: RePEc:hum:wpaper:sfb649dp2012-061
    as

    Download full text from publisher

    File URL: http://sfb649.wiwi.hu-berlin.de/papers/pdf/SFB649DP2012-061.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Jianwen Cai & Jianqing Fan & Jiancheng Jiang & Haibo Zhou, 2008. "Partially linear hazard regression with varying coefficients for multivariate survival data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 141-158, February.
    3. Lukas Meier & Sara Van De Geer & Peter Bühlmann, 2008. "The group lasso for logistic regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 53-71, February.
    4. Wang, Lifeng & Li, Hongzhe & Huang, Jianhua Z., 2008. "Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1556-1569.
    5. S. Wang & B. Nan & N. Zhu & J. Zhu, 2009. "Hierarchically penalized Cox regression with grouped variables," Biometrika, Biometrika Trust, vol. 96(2), pages 307-322.
    6. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    7. Hao Helen Zhang & Wenbin Lu, 2007. "Adaptive Lasso for Cox's proportional hazards model," Biometrika, Biometrika Trust, vol. 94(3), pages 691-703.
    8. Jun Yan & Jian Huang, 2012. "Model Selection for Cox Models with Time-Varying Coefficients," Biometrics, The International Biometric Society, vol. 68(2), pages 419-428, June.
    9. Zhang, Hao Helen & Cheng, Guang & Liu, Yufeng, 2011. "Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1099-1112.
    10. Jianwen Cai & Jianqing Fan & Runze Li & Haibo Zhou, 2005. "Variable selection for multivariate failure time data," Biometrika, Biometrika Trust, vol. 92(2), pages 303-316, June.
    11. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    12. Cai, Jianwen & Fan, Jianqing & Jiang, Jiancheng & Zhou, Haibo, 2007. "Partially Linear Hazard Regression for Multivariate Survival Data," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 538-551, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. HONDA, Toshio & 本田, 敏雄 & YABE, Ryota & 矢部, 竜太, 2017. "Variable selection and structure identification for varying coefficient Cox models," Discussion Papers 2016-05, Graduate School of Economics, Hitotsubashi University.
    2. Ling Zhou & Lu Tang & Angela T. Song & Diane M. Cibrik & Peter X.-K. Song, 2017. "A LASSO Method to Identify Protein Signature Predicting Post-transplant Renal Graft Survival," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 431-452, December.
    3. Honda, Toshio & 本田, 敏雄, 2019. "The de-biased group Lasso estimation for varying coefficient models," Discussion Papers 2018-04, Graduate School of Economics, Hitotsubashi University.
    4. Toshio Honda, 2021. "The de-biased group Lasso estimation for varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(1), pages 3-29, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qu, Lianqiang & Song, Xinyuan & Sun, Liuquan, 2018. "Identification of local sparsity and variable selection for varying coefficient additive hazards models," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 119-135.
    2. Xin Cheng & Wenbin Lu & Mengling Liu, 2015. "Identification of homogeneous and heterogeneous variables in pooled cohort studies," Biometrics, The International Biometric Society, vol. 71(2), pages 397-403, June.
    3. Heng Lian & Peng Lai & Hua Liang, 2013. "Partially Linear Structure Selection in Cox Models with Varying Coefficients," Biometrics, The International Biometric Society, vol. 69(2), pages 348-357, June.
    4. Kaida Cai & Hua Shen & Xuewen Lu, 2022. "Adaptive bi-level variable selection for multivariate failure time model with a diverging number of covariates," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 968-993, December.
    5. Jun Yan & Jian Huang, 2012. "Model Selection for Cox Models with Time-Varying Coefficients," Biometrics, The International Biometric Society, vol. 68(2), pages 419-428, June.
    6. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    7. Yongxiu Cao & Jian Huang & Yanyan Liu & Xingqiu Zhao, 2016. "Sieve estimation of Cox models with latent structures," Biometrics, The International Biometric Society, vol. 72(4), pages 1086-1097, December.
    8. Heng Lian & Xin Chen & Jian-Yi Yang, 2012. "Identification of Partially Linear Structure in Additive Models with an Application to Gene Expression Prediction from Sequences," Biometrics, The International Biometric Society, vol. 68(2), pages 437-445, June.
    9. Yanfang Zhang & Chuanhua Wei & Xiaolin Liu, 2022. "Group Logistic Regression Models with l p,q Regularization," Mathematics, MDPI, vol. 10(13), pages 1-15, June.
    10. Young Joo Yoon & Cheolwoo Park & Erik Hofmeister & Sangwook Kang, 2012. "Group variable selection in cardiopulmonary cerebral resuscitation data for veterinary patients," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(7), pages 1605-1621, January.
    11. Wenyan Zhong & Xuewen Lu & Jingjing Wu, 2021. "Bi-level variable selection in semiparametric transformation models with right-censored data," Computational Statistics, Springer, vol. 36(3), pages 1661-1692, September.
    12. Fabian Scheipl & Thomas Kneib & Ludwig Fahrmeir, 2013. "Penalized likelihood and Bayesian function selection in regression models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 349-385, October.
    13. G. Yi & J. Q. Shi & T. Choi, 2011. "Penalized Gaussian Process Regression and Classification for High-Dimensional Nonlinear Data," Biometrics, The International Biometric Society, vol. 67(4), pages 1285-1294, December.
    14. Lian, Heng & Li, Jianbo & Tang, Xingyu, 2014. "SCAD-penalized regression in additive partially linear proportional hazards models with an ultra-high-dimensional linear part," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 50-64.
    15. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    16. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    17. Zhang, Tao & Zhang, Qingzhao & Wang, Qihua, 2014. "Model detection for functional polynomial regression," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 183-197.
    18. Lian, Heng & Li, Jianbo & Hu, Yuao, 2013. "Shrinkage variable selection and estimation in proportional hazards models with additive structure and high dimensionality," Computational Statistics & Data Analysis, Elsevier, vol. 63(C), pages 99-112.
    19. Zanhua Yin, 2020. "Variable selection for sparse logistic regression," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 83(7), pages 821-836, October.
    20. Lichun Wang & Yuan You & Heng Lian, 2015. "Convergence and sparsity of Lasso and group Lasso in high-dimensional generalized linear models," Statistical Papers, Springer, vol. 56(3), pages 819-828, August.

    More about this item

    Keywords

    Cox regression model; high-dimensional data; sparsity; oracle estimator; B-splines; group SCAD; adaptive group Lasso; L2 convergence rate;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C24 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Truncated and Censored Models; Switching Regression Models; Threshold Regression Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hum:wpaper:sfb649dp2012-061. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RDC-Team (email available below). General contact details of provider: https://edirc.repec.org/data/sohubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.