IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v17y2025i2d10.1007_s12561-024-09442-9.html
   My bibliography  Save this article

Analyzing Left-Truncated Samples with the Cox Model in the Presence of Missing Covariates

Author

Listed:
  • Omar Vazquez

    (University of Pennsylvania)

  • Hayley M. Locke

    (University of Pennsylvania)

  • Sharon X. Xie

    (University of Pennsylvania)

Abstract

Delayed enrollment of subjects into a time-to-event study may result in a sample with biased outcome and covariate distributions. Additionally, missing covariate data may arise in these studies when information is difficult to collect due to patient burden or high testing costs. Some common missing data strategies, such as multiple imputation (MI) and augmented inverse probability weighting (AIPW), involve modeling the distribution of the missing covariate, which may be inaccurate with a left-truncated sample. Through simulation studies, we explore the performance of these methods in estimating Cox regression parameters under a variety of truncation and missing data scenarios. We find that MI and AIPW may be approximately unbiased when truncation is very low. Otherwise, biased estimation of the covariate distribution can cause MI to perform poorly. Similarly, AIPW may rely more heavily on correct estimation of the probability of having non-missing covariates in some settings. We apply these approaches to a Parkinson’s disease dementia biomarker study. In analyzing left-truncated data with missing covariates, careful consideration of the data characteristics and method assumptions is needed to obtain valid results.

Suggested Citation

  • Omar Vazquez & Hayley M. Locke & Sharon X. Xie, 2025. "Analyzing Left-Truncated Samples with the Cox Model in the Presence of Missing Covariates," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 17(2), pages 555-574, July.
  • Handle: RePEc:spr:stabio:v:17:y:2025:i:2:d:10.1007_s12561-024-09442-9
    DOI: 10.1007/s12561-024-09442-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-024-09442-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-024-09442-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Bergeron, Pierre-Jerome & Asgharian, Masoud & Wolfson, David B., 2008. "Covariate Bias Induced by Length-Biased Sampling of Failure Times," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 737-742, June.
    2. Qi, Lihong & Wang, C.Y. & Prentice, Ross L., 2005. "Weighted Estimators for Proportional Hazards Regression With Missing Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1250-1263, December.
    3. C. Y. Wang & Hua Yun Chen, 2001. "Augmented Inverse Probability Weighted Estimator for Cox Missing Covariate Regression," Biometrics, The International Biometric Society, vol. 57(2), pages 414-419, June.
    4. Na Hu & Xuerong Chen & Jianguo Sun, 2015. "Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 438-452, June.
    5. Qian, Jing & Betensky, Rebecca A., 2014. "Assumptions regarding right censoring in the presence of left truncation," Statistics & Probability Letters, Elsevier, vol. 87(C), pages 12-17.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Du, Mingyue & Li, Huiqiong & Sun, Jianguo, 2021. "Regression analysis of censored data with nonignorable missing covariates and application to Alzheimer Disease," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    2. Torben Martinussen & Klaus K. Holst & Thomas H. Scheike, 2016. "Cox regression with missing covariate data using a modified partial likelihood method," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(4), pages 570-588, October.
    3. Yanyao Yi & Ting Ye & Menggang Yu & Jun Shao, 2020. "Cox regression with survival‐time‐dependent missing covariate values," Biometrics, The International Biometric Society, vol. 76(2), pages 460-471, June.
    4. Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
    5. Huang, Bin & Wang, Qihua, 2010. "Semiparametric analysis based on weighted estimating equations for transformation models with missing covariates," Journal of Multivariate Analysis, Elsevier, vol. 101(9), pages 2078-2090, October.
    6. Shanshan Li & Yang Ning, 2015. "Estimation of covariate‐specific time‐dependent ROC curves in the presence of missing biomarkers," Biometrics, The International Biometric Society, vol. 71(3), pages 666-676, September.
    7. Menggang Yu & Bin Nan, 2010. "Regression Calibration in Semiparametric Accelerated Failure Time Models," Biometrics, The International Biometric Society, vol. 66(2), pages 405-414, June.
    8. Ertefaie Ashkan & Asgharian Masoud & Stephens David A., 2015. "Double Bias: Estimation of Causal Effects from Length-Biased Samples in the Presence of Confounding," The International Journal of Biostatistics, De Gruyter, vol. 11(1), pages 69-89, May.
    9. Chi-Chung Wen & Chien-Tai Lin, 2011. "Analysis of Current Status Data with Missing Covariates," Biometrics, The International Biometric Society, vol. 67(3), pages 760-769, September.
    10. Lihong Qi & Xu Zhang & Yanqing Sun & Lu Wang & Yichuan Zhao, 2019. "Weighted estimating equations for additive hazards models with missing covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 365-387, April.
    11. Donglin Zeng & Qingxia Chen, 2010. "Adjustment for Missingness Using Auxiliary Information in Semiparametric Regression," Biometrics, The International Biometric Society, vol. 66(1), pages 115-122, March.
    12. Jieli Ding & Tsui-Shan Lu & Jianwen Cai & Haibo Zhou, 2017. "Recent progresses in outcome-dependent sampling with failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 57-82, January.
    13. Lou, Yichen & Ma, Yuqing & Xiang, Liming & Sun, Jianguo, 2025. "A multiple imputation approach for flexible modelling of interval-censored data with missing and censored covariates," Computational Statistics & Data Analysis, Elsevier, vol. 209(C).
    14. Na Hu & Xuerong Chen & Jianguo Sun, 2015. "Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 438-452, June.
    15. Yu Shen & Jing Ning & Jing Qin, 2017. "Nonparametric and semiparametric regression estimation for length-biased survival data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 3-24, January.
    16. Jing Qin & Yu Shen, 2010. "Statistical Methods for Analyzing Right-Censored Length-Biased Data under Cox Model," Biometrics, The International Biometric Society, vol. 66(2), pages 382-392, June.
    17. Frank Eriksson & Torben Martinussen & Søren Feodor Nielsen, 2020. "Large sample results for frequentist multiple imputation for Cox regression with missing covariate data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(4), pages 969-996, August.
    18. Kwun Chuen Gary Chan & Mei-Cheng Wang, 2012. "Estimating Incident Population Distribution from Prevalent Data," Biometrics, The International Biometric Society, vol. 68(2), pages 521-531, June.
    19. X. Hu & Bin Zhang, 2012. "Pseudolikelihood ratio test with biased observations," Statistical Papers, Springer, vol. 53(2), pages 387-400, May.
    20. Kuhn, Peter J. & Shen, Kailing, 2010. "Gender Discrimination in Job Ads: Theory and Evidence," IZA Discussion Papers 5195, Institute of Labor Economics (IZA).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:17:y:2025:i:2:d:10.1007_s12561-024-09442-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.