IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v17y2025i2d10.1007_s12561-024-09442-9.html
   My bibliography  Save this article

Analyzing Left-Truncated Samples with the Cox Model in the Presence of Missing Covariates

Author

Listed:
  • Omar Vazquez

    (University of Pennsylvania)

  • Hayley M. Locke

    (University of Pennsylvania)

  • Sharon X. Xie

    (University of Pennsylvania)

Abstract

Delayed enrollment of subjects into a time-to-event study may result in a sample with biased outcome and covariate distributions. Additionally, missing covariate data may arise in these studies when information is difficult to collect due to patient burden or high testing costs. Some common missing data strategies, such as multiple imputation (MI) and augmented inverse probability weighting (AIPW), involve modeling the distribution of the missing covariate, which may be inaccurate with a left-truncated sample. Through simulation studies, we explore the performance of these methods in estimating Cox regression parameters under a variety of truncation and missing data scenarios. We find that MI and AIPW may be approximately unbiased when truncation is very low. Otherwise, biased estimation of the covariate distribution can cause MI to perform poorly. Similarly, AIPW may rely more heavily on correct estimation of the probability of having non-missing covariates in some settings. We apply these approaches to a Parkinson’s disease dementia biomarker study. In analyzing left-truncated data with missing covariates, careful consideration of the data characteristics and method assumptions is needed to obtain valid results.

Suggested Citation

  • Omar Vazquez & Hayley M. Locke & Sharon X. Xie, 2025. "Analyzing Left-Truncated Samples with the Cox Model in the Presence of Missing Covariates," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 17(2), pages 555-574, July.
  • Handle: RePEc:spr:stabio:v:17:y:2025:i:2:d:10.1007_s12561-024-09442-9
    DOI: 10.1007/s12561-024-09442-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-024-09442-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-024-09442-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. C. Y. Wang & Hua Yun Chen, 2001. "Augmented Inverse Probability Weighted Estimator for Cox Missing Covariate Regression," Biometrics, The International Biometric Society, vol. 57(2), pages 414-419, June.
    2. Qian, Jing & Betensky, Rebecca A., 2014. "Assumptions regarding right censoring in the presence of left truncation," Statistics & Probability Letters, Elsevier, vol. 87(C), pages 12-17.
    3. Bergeron, Pierre-Jerome & Asgharian, Masoud & Wolfson, David B., 2008. "Covariate Bias Induced by Length-Biased Sampling of Failure Times," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 737-742, June.
    4. Qi, Lihong & Wang, C.Y. & Prentice, Ross L., 2005. "Weighted Estimators for Proportional Hazards Regression With Missing Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1250-1263, December.
    5. Na Hu & Xuerong Chen & Jianguo Sun, 2015. "Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 438-452, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Du, Mingyue & Li, Huiqiong & Sun, Jianguo, 2021. "Regression analysis of censored data with nonignorable missing covariates and application to Alzheimer Disease," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    2. Torben Martinussen & Klaus K. Holst & Thomas H. Scheike, 2016. "Cox regression with missing covariate data using a modified partial likelihood method," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(4), pages 570-588, October.
    3. Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
    4. Shanshan Li & Yang Ning, 2015. "Estimation of covariate‐specific time‐dependent ROC curves in the presence of missing biomarkers," Biometrics, The International Biometric Society, vol. 71(3), pages 666-676, September.
    5. Menggang Yu & Bin Nan, 2010. "Regression Calibration in Semiparametric Accelerated Failure Time Models," Biometrics, The International Biometric Society, vol. 66(2), pages 405-414, June.
    6. Lihong Qi & Xu Zhang & Yanqing Sun & Lu Wang & Yichuan Zhao, 2019. "Weighted estimating equations for additive hazards models with missing covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 365-387, April.
    7. Donglin Zeng & Qingxia Chen, 2010. "Adjustment for Missingness Using Auxiliary Information in Semiparametric Regression," Biometrics, The International Biometric Society, vol. 66(1), pages 115-122, March.
    8. Lou, Yichen & Ma, Yuqing & Xiang, Liming & Sun, Jianguo, 2025. "A multiple imputation approach for flexible modelling of interval-censored data with missing and censored covariates," Computational Statistics & Data Analysis, Elsevier, vol. 209(C).
    9. Na Hu & Xuerong Chen & Jianguo Sun, 2015. "Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 438-452, June.
    10. Yanyao Yi & Ting Ye & Menggang Yu & Jun Shao, 2020. "Cox regression with survival‐time‐dependent missing covariate values," Biometrics, The International Biometric Society, vol. 76(2), pages 460-471, June.
    11. Huang, Bin & Wang, Qihua, 2010. "Semiparametric analysis based on weighted estimating equations for transformation models with missing covariates," Journal of Multivariate Analysis, Elsevier, vol. 101(9), pages 2078-2090, October.
    12. Ertefaie Ashkan & Asgharian Masoud & Stephens David A., 2015. "Double Bias: Estimation of Causal Effects from Length-Biased Samples in the Presence of Confounding," The International Journal of Biostatistics, De Gruyter, vol. 11(1), pages 69-89, May.
    13. Chi-Chung Wen & Chien-Tai Lin, 2011. "Analysis of Current Status Data with Missing Covariates," Biometrics, The International Biometric Society, vol. 67(3), pages 760-769, September.
    14. Jieli Ding & Tsui-Shan Lu & Jianwen Cai & Haibo Zhou, 2017. "Recent progresses in outcome-dependent sampling with failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 57-82, January.
    15. Frank Eriksson & Torben Martinussen & Søren Feodor Nielsen, 2020. "Large sample results for frequentist multiple imputation for Cox regression with missing covariate data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(4), pages 969-996, August.
    16. Kwun Chuen Gary Chan & Mei-Cheng Wang, 2012. "Estimating Incident Population Distribution from Prevalent Data," Biometrics, The International Biometric Society, vol. 68(2), pages 521-531, June.
    17. Kuhn, Peter J. & Shen, Kailing, 2010. "Gender Discrimination in Job Ads: Theory and Evidence," IZA Discussion Papers 5195, Institute of Labor Economics (IZA).
    18. Micha Mandel & Ya'akov Ritov, 2010. "The Accelerated Failure Time Model Under Biased Sampling," Biometrics, The International Biometric Society, vol. 66(4), pages 1306-1308, December.
    19. Peter Kuhn & Kailing Shen, 2009. "Employers' Preferences for Gender, Age, Height and Beauty: Direct Evidence," NBER Working Papers 15564, National Bureau of Economic Research, Inc.
    20. Bella Vakulenko‐Lagun & Jing Qian & Sy Han Chiou & Nancy Wang & Rebecca A. Betensky, 2022. "Nonparametric estimation of the survival distribution under covariate‐induced dependent truncation," Biometrics, The International Biometric Society, vol. 78(4), pages 1390-1401, December.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:17:y:2025:i:2:d:10.1007_s12561-024-09442-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.