IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v112y2017i519p1221-1235.html
   My bibliography  Save this article

Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation

Author

Listed:
  • Jon Arni Steingrimsson
  • Robert L. Strawderman

Abstract

This article considers linear regression with missing covariates and a right censored outcome. We first consider a general two-phase outcome sampling design, where full covariate information is only ascertained for subjects in phase two and sampling occurs under an independent Bernoulli sampling scheme with known subject-specific sampling probabilities that depend on phase one information (e.g., survival time, failure status and covariates). The semiparametric information bound is derived for estimating the regression parameter in this setting. We also introduce a more practical class of augmented estimators that is shown to improve asymptotic efficiency over simple but inefficient inverse probability of sampling weighted estimators. Estimation for known sampling weights and extensions to the case of estimated sampling weights are both considered. The allowance for estimated sampling weights permits covariates to be missing at random according to a monotone but unknown mechanism. The asymptotic properties of the augmented estimators are derived and simulation results demonstrate substantial efficiency improvements over simpler inverse probability of sampling weighted estimators in the indicated settings. With suitable modification, the proposed methodology can also be used to improve augmented estimators previously used for missing covariates in a Cox regression model. Supplementary materials for this article are available online.

Suggested Citation

  • Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
  • Handle: RePEc:taf:jnlasa:v:112:y:2017:i:519:p:1221-1235
    DOI: 10.1080/01621459.2016.1205500
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2016.1205500
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2016.1205500?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhezhen Jin, 2003. "Rank-based inference for the accelerated failure time model," Biometrika, Biometrika Trust, vol. 90(2), pages 341-353, June.
    2. Tianxi Cai & Yingye Zheng, 2013. "Resampling Procedures for Making Inference Under Nested Case--Control Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1532-1544, December.
    3. Lynn M. Johnson & Robert L. Strawderman, 2009. "Induced smoothing for the semiparametric accelerated failure time model: asymptotics and extensions to clustered data," Biometrika, Biometrika Trust, vol. 96(3), pages 577-590.
    4. Donglin Zeng & D. Y. Lin, 2014. "Efficient Estimation of Semiparametric Transformation Models for Two-Phase Cohort Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 371-383, March.
    5. C. Y. Wang & Hua Yun Chen, 2001. "Augmented Inverse Probability Weighted Estimator for Cox Missing Covariate Regression," Biometrics, The International Biometric Society, vol. 57(2), pages 414-419, June.
    6. Bin Nan & Menggang Yu & John D. Kalbfleisch, 2006. "Censored linear regression for case-cohort studies," Biometrika, Biometrika Trust, vol. 93(4), pages 747-762, December.
    7. Zeng, Donglin & Lin, D.Y., 2007. "Efficient Estimation for the Accelerated Failure Time Model," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1387-1396, December.
    8. Norman E. Breslow & Jon A. Wellner, 2007. "Weighted Likelihood for Semiparametric Models and Two‐phase Stratified Samples, with Application to Cox Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 34(1), pages 86-102, March.
    9. Jianwen Cai & Donglin Zeng, 2007. "Power Calculation for Case–Cohort Studies with Nonrare Events," Biometrics, The International Biometric Society, vol. 63(4), pages 1288-1295, December.
    10. Michal Kulich & D.Y. Lin, 2004. "Improving the Efficiency of Relative-Risk Estimation in Case-Cohort Studies," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 832-844, January.
    11. Qi, Lihong & Wang, C.Y. & Prentice, Ross L., 2005. "Weighted Estimators for Proportional Hazards Regression With Missing Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1250-1263, December.
    12. Xiaodong Luo & Wei Yann Tsai & Qiang Xu, 2009. "Pseudo-partial likelihood estimators for the Cox regression model with missing covariates," Biometrika, Biometrika Trust, vol. 96(3), pages 617-633.
    13. Xu, Qiang & Paik, Myunghee Cho & Luo, Xiaodong & Tsai, Wei-Yann, 2009. "Reweighting Estimators for Cox Regression With Missing Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1155-1167.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Menggang Yu & Bin Nan, 2010. "Regression Calibration in Semiparametric Accelerated Failure Time Models," Biometrics, The International Biometric Society, vol. 66(2), pages 405-414, June.
    2. Jieli Ding & Tsui-Shan Lu & Jianwen Cai & Haibo Zhou, 2017. "Recent progresses in outcome-dependent sampling with failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 57-82, January.
    3. Jichang Yu & Haibo Zhou & Jianwen Cai, 2021. "Accelerated failure time model for data from outcome-dependent sampling," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(1), pages 15-37, January.
    4. Du, Mingyue & Li, Huiqiong & Sun, Jianguo, 2021. "Regression analysis of censored data with nonignorable missing covariates and application to Alzheimer Disease," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    5. Shanshan Li & Yang Ning, 2015. "Estimation of covariate‐specific time‐dependent ROC curves in the presence of missing biomarkers," Biometrics, The International Biometric Society, vol. 71(3), pages 666-676, September.
    6. Na Hu & Xuerong Chen & Jianguo Sun, 2015. "Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 438-452, June.
    7. Torben Martinussen & Klaus K. Holst & Thomas H. Scheike, 2016. "Cox regression with missing covariate data using a modified partial likelihood method," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(4), pages 570-588, October.
    8. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    9. Yanyao Yi & Ting Ye & Menggang Yu & Jun Shao, 2020. "Cox regression with survival‐time‐dependent missing covariate values," Biometrics, The International Biometric Society, vol. 76(2), pages 460-471, June.
    10. Yanqing Sun & Xiyuan Qian & Qiong Shou & Peter B. Gilbert, 2017. "Analysis of two-phase sampling data with semiparametric additive hazards models," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(3), pages 377-399, July.
    11. Yayun Xu & Soyoung Kim & Mei-Jie Zhang & David Couper & Kwang Woo Ahn, 2022. "Competing risks regression models with covariates-adjusted censoring weight under the generalized case-cohort design," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(2), pages 241-262, April.
    12. Xiaolin Chen & Jianwen Cai, 2018. "Reweighted estimators for additive hazard model with censoring indicators missing at random," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(2), pages 224-249, April.
    13. Deng, Lifeng & Ding, Jieli & Liu, Yanyan & Wei, Chengdong, 2018. "Regression analysis for the proportional hazards model with parameter constraints under case-cohort design," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 194-206.
    14. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Conditional screening for ultrahigh-dimensional survival data in case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(4), pages 632-661, October.
    15. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Feature screening for case‐cohort studies with failure time outcome," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 349-370, March.
    16. Ying Yan & Haibo Zhou & Jianwen Cai, 2017. "Improving efficiency of parameter estimation in case-cohort studies with multivariate failure time data," Biometrics, The International Biometric Society, vol. 73(3), pages 1042-1052, September.
    17. Suhyun Kang & Wenbin Lu & Mengling Liu, 2017. "Efficient estimation for accelerated failure time model under case-cohort and nested case-control sampling," Biometrics, The International Biometric Society, vol. 73(1), pages 114-123, March.
    18. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    19. Xu, Linzhi & Zhang, Jiajia, 2010. "An EM-like algorithm for the semiparametric accelerated failure time gamma frailty model," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1467-1474, June.
    20. Zhiping Qiu & Jing Qin & Yong Zhou, 2016. "Composite Estimating Equation Method for the Accelerated Failure Time Model with Length-biased Sampling Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 396-415, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:112:y:2017:i:519:p:1221-1235. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.