IDEAS home Printed from https://ideas.repec.org/a/spr/metrik/v81y2018i7d10.1007_s00184-018-0660-5.html
   My bibliography  Save this article

Nonparametric independence feature screening for ultrahigh-dimensional survival data

Author

Listed:
  • Jing Pan

    (Shanghai University of Finance and Economics)

  • Yuan Yu

    (Shanghai University of Finance and Economics)

  • Yong Zhou

    (East China Normal University)

Abstract

With the explosion of digital information, high-dimensional data is frequently collected in prevalent domains, in which the dimension of covariates can be much larger than the sample size. Many effective methods have been developed to reduce the dimension of such data recently, however, few methods might perform well for survival data with censoring. In this article, we develop a novel nonparametric feature screening procedure based on ultrahigh-dimensional survival data by incorporating the inverse probability weighting scheme to tackle the issue of censoring. The proposed method is model-free and hence can be implemented for extensive survival models. Moreover, it is robust to heterogeneity and invariant to monotone increasing transformations of the response. The sure screening property and ranking consistency property are also established under mild conditions. The competence and robustness of our method is further confirmed through comprehensive simulation studies and an analysis of a real data example.

Suggested Citation

  • Jing Pan & Yuan Yu & Yong Zhou, 2018. "Nonparametric independence feature screening for ultrahigh-dimensional survival data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 81(7), pages 821-847, October.
  • Handle: RePEc:spr:metrik:v:81:y:2018:i:7:d:10.1007_s00184-018-0660-5
    DOI: 10.1007/s00184-018-0660-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00184-018-0660-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00184-018-0660-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lin, Huazhen & Peng, Heng, 2013. "Smoothed rank correlation of the linear transformation regression model," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 615-630.
    2. Emre Barut & Jianqing Fan & Anneleen Verhasselt, 2016. "Conditional Sure Independence Screening," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 1266-1277, July.
    3. Yuanshan Wu & Guosheng Yin, 2015. "Conditional quantile screening in ultrahigh-dimensional heterogeneous data," Biometrika, Biometrika Trust, vol. 102(1), pages 65-76.
    4. Zhao, Sihai Dave & Li, Yi, 2012. "Principled sure independence screening for Cox models with ultra-high-dimensional covariates," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 397-411.
    5. Rui Song & Wenbin Lu & Shuangge Ma & X. Jessie Jeng, 2014. "Censored rank independence screening for high-dimensional survival data," Biometrika, Biometrika Trust, vol. 101(4), pages 799-814.
    6. Fan, Jianqing & Feng, Yang & Song, Rui, 2011. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 544-557.
    7. Peng, Limin & Fine, Jason P., 2009. "Competing Risks Quantile Regression," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1440-1453.
    8. Runze Li & Wei Zhong & Liping Zhu, 2012. "Feature Screening via Distance Correlation Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1129-1139, September.
    9. Jianqing Fan & Yunbei Ma & Wei Dai, 2014. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1270-1284, September.
    10. Anders Gorst-Rasmussen & Thomas Scheike, 2013. "Independent screening for single-index hazard rate models with ultrahigh dimensional features," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(2), pages 217-246, March.
    11. Shujie Ma & Runze Li & Chih-Ling Tsai, 2017. "Variable Screening via Quantile Partial Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 650-663, April.
    12. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    13. Zhang, Jing & Liu, Yanyan & Wu, Yuanshan, 2017. "Correlation rank screening for ultrahigh-dimensional survival data," Computational Statistics & Data Analysis, Elsevier, vol. 108(C), pages 121-132.
    14. Hyokyoung G. Hong & Jian Kang & Yi Li, 2018. "Conditional screening for ultra-high dimensional covariates with survival outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(1), pages 45-71, January.
    15. D. Zeng & D. Y. Lin, 2007. "Maximum likelihood estimation in semiparametric regression models with censored data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 507-564, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang Qu & Yu Cheng, 2023. "Volume under the ROC surface for high-dimensional independent screening with ordinal competing risk outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(4), pages 735-751, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Conditional screening for ultrahigh-dimensional survival data in case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(4), pages 632-661, October.
    2. Zhong, Wei & Wang, Jiping & Chen, Xiaolin, 2021. "Censored mean variance sure independence screening for ultrahigh dimensional survival data," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    3. Zhang, Shen & Zhao, Peixin & Li, Gaorong & Xu, Wangli, 2019. "Nonparametric independence screening for ultra-high dimensional generalized varying coefficient models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 37-52.
    4. Xiaolin Chen & Yi Liu & Qihua Wang, 2019. "Joint feature screening for ultra-high-dimensional sparse additive hazards model by the sparsity-restricted pseudo-score estimator," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(5), pages 1007-1031, October.
    5. Zhang, Shucong & Zhou, Yong, 2018. "Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 1-13.
    6. Chen, Xiaolin & Chen, Xiaojing & Wang, Hong, 2018. "Robust feature screening for ultra-high dimensional right censored data via distance correlation," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 118-138.
    7. Jing Zhang & Guosheng Yin & Yanyan Liu & Yuanshan Wu, 2018. "Censored cumulative residual independent screening for ultrahigh-dimensional survival data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(2), pages 273-292, April.
    8. Jing Zhang & Yanyan Liu & Hengjian Cui, 2021. "Model-free feature screening via distance correlation for ultrahigh dimensional survival data," Statistical Papers, Springer, vol. 62(6), pages 2711-2738, December.
    9. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Feature screening for case‐cohort studies with failure time outcome," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 349-370, March.
    10. Zhang, Jing & Liu, Yanyan & Wu, Yuanshan, 2017. "Correlation rank screening for ultrahigh-dimensional survival data," Computational Statistics & Data Analysis, Elsevier, vol. 108(C), pages 121-132.
    11. Pan, Yingli, 2022. "Feature screening and FDR control with knockoff features for ultrahigh-dimensional right-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    12. Liu, Yanyan & Zhang, Jing & Zhao, Xingqiu, 2018. "A new nonparametric screening method for ultrahigh-dimensional survival data," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 74-85.
    13. Xiaolin Chen & Catherine Chunling Liu & Sheng Xu, 2021. "An efficient algorithm for joint feature screening in ultrahigh-dimensional Cox’s model," Computational Statistics, Springer, vol. 36(2), pages 885-910, June.
    14. Jing Zhang & Qihua Wang & Xuan Wang, 2022. "Surrogate-variable-based model-free feature screening for survival data under the general censoring mechanism," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 379-397, April.
    15. Jinfeng Xu & Wai Keung Li & Zhiliang Ying, 2020. "Variable screening for survival data in the presence of heterogeneous censoring," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1171-1191, December.
    16. Jianglin Fang, 2021. "Feature screening for ultrahigh-dimensional survival data when failure indicators are missing at random," Statistical Papers, Springer, vol. 62(3), pages 1141-1166, June.
    17. Chen, Xiaolin & Zhang, Yahui & Chen, Xiaojing & Liu, Yi, 2019. "A simple model-free survival conditional feature screening," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 156-160.
    18. Guo, Chaohui & Lv, Jing & Wu, Jibo, 2021. "Composite quantile regression for ultra-high dimensional semiparametric model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    19. Qu, Lianqiang & Wang, Xiaoyu & Sun, Liuquan, 2022. "Variable screening for varying coefficient models with ultrahigh-dimensional survival data," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    20. Xiaochao Xia & Hao Ming, 2022. "A Flexibly Conditional Screening Approach via a Nonparametric Quantile Partial Correlation," Mathematics, MDPI, vol. 10(24), pages 1-32, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:81:y:2018:i:7:d:10.1007_s00184-018-0660-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.