IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v32y2023i1d10.1007_s11749-022-00829-8.html
   My bibliography  Save this article

Robust censored regression with $$\ell _1$$ ℓ 1 -norm regularization

Author

Listed:
  • Jad Beyhum

    (KU Leuven)

  • Ingrid Keilegom

    (KU Leuven)

Abstract

This paper considers inference in a linear regression model with random right censoring and outliers. The number of outliers can grow with the sample size while their proportion goes to zero. We make only very mild assumptions on the distribution of the error term, contrary to most other existing approaches in the literature. We propose to penalize the estimator proposed by Stute for censored linear regression by the $$\ell _1$$ ℓ 1 -norm. We derive rates of convergence and establish asymptotic normality of the estimator of the regression coefficients. Our estimator has the same asymptotic variance as Stute’s estimator in the censored linear model without outliers. Hence, there is no loss of efficiency as a result of robustness. Tests and confidence sets can therefore rely on the theory developed by Stute. The outlined procedure is also computationally advantageous, since it amounts to solving a convex optimization program. We also propose a second estimator which uses the proposed penalized Stute estimator as a first step to detect outliers. It has similar theoretical properties but better performance in finite samples as assessed by simulations. We apply the outlined procedures on data from the Ohio State transplant center.

Suggested Citation

  • Jad Beyhum & Ingrid Keilegom, 2023. "Robust censored regression with $$\ell _1$$ ℓ 1 -norm regularization," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 146-162, March.
  • Handle: RePEc:spr:testjl:v:32:y:2023:i:1:d:10.1007_s11749-022-00829-8
    DOI: 10.1007/s11749-022-00829-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-022-00829-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-022-00829-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stute, W., 1993. "Consistent Estimation Under Random Censorship When Covariables Are Present," Journal of Multivariate Analysis, Elsevier, vol. 45(1), pages 89-103, April.
    2. Chiou, Sy Han & Kang, Sangwook & Yan, Jun, 2014. "Fitting Accelerated Failure Time Models in Routine Survival Analysis with R Package aftgee," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i11).
    3. Locatelli, Isabella & Marazzi, Alfio & Yohai, Victor J., 2011. "Robust accelerated failure time regression," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 874-887, January.
    4. Sanjoy K. Sinha, 2019. "Robust estimation in accelerated failure time models," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(1), pages 52-78, January.
    5. She, Yiyuan & Owen, Art B., 2011. "Outlier Detection Using Nonconvex Penalized Regression," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 626-639.
    6. Heller, Glenn, 2007. "Smoothed Rank Regression With Censored Data," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 552-559, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhiping Qiu & Jing Qin & Yong Zhou, 2016. "Composite Estimating Equation Method for the Accelerated Failure Time Model with Length-biased Sampling Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 396-415, June.
    2. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    3. Qiu, Zhiping & Peng, Limin & Manatunga, Amita & Guo, Ying, 2019. "A smooth nonparametric approach to determining cut-points of a continuous scale," Computational Statistics & Data Analysis, Elsevier, vol. 134(C), pages 186-210.
    4. Adam Braima S. Mastor & Abdulaziz S. Alghamdi & Oscar Ngesa & Joseph Mung’atu & Christophe Chesneau & Ahmed Z. Afify, 2023. "The Extended Exponential-Weibull Accelerated Failure Time Model with Application to Sudan COVID-19 Data," Mathematics, MDPI, vol. 11(2), pages 1-26, January.
    5. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    6. Wang Zhu & Wang C.Y., 2010. "Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-33, June.
    7. Young‐Geun Choi & Lawrence P. Hanrahan & Derek Norton & Ying‐Qi Zhao, 2022. "Simultaneous spatial smoothing and outlier detection using penalized regression, with application to childhood obesity surveillance from electronic health records," Biometrics, The International Biometric Society, vol. 78(1), pages 324-336, March.
    8. Ruoqing Zhu & Ying-Qi Zhao & Guanhua Chen & Shuangge Ma & Hongyu Zhao, 2017. "Greedy outcome weighted tree learning of optimal personalized treatment rules," Biometrics, The International Biometric Society, vol. 73(2), pages 391-400, June.
    9. Guessoum Zohra & Ould-Said Elias, 2009. "On nonparametric estimation of the regression function under random censorship model," Statistics & Risk Modeling, De Gruyter, vol. 26(3), pages 159-177, April.
    10. Weiyu Li & Valentin Patilea, 2018. "A dimension reduction approach for conditional Kaplan–Meier estimators," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 295-315, June.
    11. Wentao Qu & Xianchao Xiu & Huangyue Chen & Lingchen Kong, 2023. "A Survey on High-Dimensional Subspace Clustering," Mathematics, MDPI, vol. 11(2), pages 1-39, January.
    12. Sungwan Bang & Soo-Heang Eo & Yong Mee Cho & Myoungshic Jhun & HyungJun Cho, 2016. "Non-crossing weighted kernel quantile regression with right censored data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(1), pages 100-121, January.
    13. Olivier Lopez & Xavier Milhaud & Pierre-Emmanuel Thérond, 2015. "Tree-based censored regression with applications to insurance," Working Papers hal-01141228, HAL.
    14. Cao, Yongxiu & Yu, Jichang, 2023. "Adjusting for unmeasured confounding in survival causal effect using validation data," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    15. Amor Keziou & Aida Toma, 2021. "A Robust Version of the Empirical Likelihood Estimator," Mathematics, MDPI, vol. 9(8), pages 1-19, April.
    16. Seokho Lee & Hyejin Shin & Sang Han Lee, 2016. "Label‐noise resistant logistic regression for functional data classification with an application to Alzheimer's disease study," Biometrics, The International Biometric Society, vol. 72(4), pages 1325-1335, December.
    17. Wenceslao González Manteiga & Cédric Heuchenne & César Sánchez Sellero & Alessandro Beretta, 2020. "Goodness-of-fit tests for censored regression based on artificial data points," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 599-615, June.
    18. Amorim, Ana Paula & de Uña-Álvarez, Jacobo & Meira-Machado, Luís, 2011. "Presmoothing the transition probabilities in the illness-death model," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 797-806, July.
    19. Uña-Álvarez, Jacobo de & González-Manteiga, Wenceslao, 1999. "Strong consistency under proportional censorship when covariables are present," Statistics & Probability Letters, Elsevier, vol. 42(3), pages 283-292, April.
    20. Feriel Bouhadjera & Mohamed Lemdani & Elias Ould Saïd, 2023. "Strong uniform consistency of the local linear relative error regression estimator under left truncation," Statistical Papers, Springer, vol. 64(2), pages 421-447, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:32:y:2023:i:1:d:10.1007_s11749-022-00829-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.