IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v51y2024i3p1086-1102.html

Confidence bands for survival curves from outcome‐dependent stratified samples

Author

Listed:
  • Takumi Saegusa
  • Peter Nandori

Abstract

We consider the construction of confidence bands for survival curves under the outcome‐dependent stratified sampling. A main challenge of this design is that data are a biased dependent sample due to stratification and sampling without replacement. Most literature on regression approximates this design by Bernoulli sampling but variance is generally overestimated. Even with this approximation, the limiting distribution of the inverse probability weighted Kaplan–Meier estimator involves a general Gaussian process, and hence quantiles of its supremum is not analytically available. In this paper, we provide a rigorous asymptotic theory for the weighted Kaplan–Meier estimator accounting for dependence in the sample. We propose the novel hybrid method to both simulate and bootstrap parts of the limiting process to compute confidence bands with asymptotically correct coverage probability. Simulation study indicates that the proposed bands are appropriate for practical use. A Wilms tumor example is presented.

Suggested Citation

  • Takumi Saegusa & Peter Nandori, 2024. "Confidence bands for survival curves from outcome‐dependent stratified samples," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(3), pages 1086-1102, September.
  • Handle: RePEc:bla:scjsta:v:51:y:2024:i:3:p:1086-1102
    DOI: 10.1111/sjos.12700
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/sjos.12700
    Download Restriction: no

    File URL: https://libkey.io/10.1111/sjos.12700?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    2. Tianxi Cai & Yingye Zheng, 2013. "Resampling Procedures for Making Inference Under Nested Case--Control Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1532-1544, December.
    3. Winnett A. & Sasieni P., 2002. "Adjusted Nelson-Aalen Estimates With Retrospective Matching," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 245-256, March.
    4. Ma, Shuangge & Kosorok, Michael R., 2005. "Robust semiparametric M-estimation and the weighted bootstrap," Journal of Multivariate Analysis, Elsevier, vol. 96(1), pages 190-217, September.
    5. Dennis Dobler & Markus Pauly & ThomasH. Scheike, 2019. "Confidence bands for multiplicative hazards models: Flexible resampling approaches," Biometrics, The International Biometric Society, vol. 75(3), pages 906-916, September.
    6. Donglin Zeng & D. Y. Lin, 2014. "Efficient Estimation of Semiparametric Transformation Models for Two-Phase Cohort Studies," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 371-383, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
    2. J. Feifel & D. Dobler, 2021. "Dynamic inference in general nested case‐control designs," Biometrics, The International Biometric Society, vol. 77(1), pages 175-185, March.
    3. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2020. "Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 85-108, January.
    4. Fei Gao & Kwun Chuen Gary Chan, 2024. "Efficient Estimation of Semiparametric Transformation Model with Interval-Censored Data in Two-Phase Cohort Studies," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(1), pages 203-220, April.
    5. repec:hum:wpaper:sfb649dp2014-067 is not listed on IDEAS
    6. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    7. Salim Bouzebda & Issam Elhattab & Anouar Abdeldjaoued Ferfache, 2022. "General M-Estimator Processes and their m out of n Bootstrap with Functional Nuisance Parameters," Methodology and Computing in Applied Probability, Springer, vol. 24(4), pages 2961-3005, December.
    8. Chen, Xiaohong & Pouzo, Demian, 2009. "Efficient estimation of semiparametric conditional moment models with possibly nonsmooth residuals," Journal of Econometrics, Elsevier, vol. 152(1), pages 46-60, September.
    9. Giovanni Compiani & Philip Haile & Marcelo Sant’Anna, 2020. "Common Values, Unobserved Heterogeneity, and Endogenous Entry in US Offshore Oil Lease Auctions," Journal of Political Economy, University of Chicago Press, vol. 128(10), pages 3872-3912.
    10. Mingzhe Wu & Ming Zheng & Wen Yu & Ruofan Wu, 2018. "Estimation and variable selection for semiparametric transformation models under a more efficient cohort sampling design," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 570-596, September.
    11. Pereda-Fernández, Santiago, 2023. "Identification and estimation of triangular models with a binary treatment," Journal of Econometrics, Elsevier, vol. 234(2), pages 585-623.
    12. Gianluca Frasso & Jonathan Jaeger & Philippe Lambert, 2016. "Parameter estimation and inference in dynamic systems described by linear partial differential equations," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 100(3), pages 259-287, July.
    13. Demian Pouzo, 2014. "Bootstrap Consistency for Quadratic Forms of Sample Averages with Increasing Dimension," Papers 1411.2701, arXiv.org, revised Aug 2015.
    14. Chernozhukov, Victor & Fernández-Val, Iván & Hoderlein, Stefan & Holzmann, Hajo & Newey, Whitney, 2015. "Nonparametric identification in panels using quantiles," Journal of Econometrics, Elsevier, vol. 188(2), pages 378-392.
    15. Jessica G. Young & Nicholas P. Jewell & Steven J. Samuels, 2008. "Regression Analysis of a Disease Onset Distribution Using Diagnosis Data," Biometrics, The International Biometric Society, vol. 64(1), pages 20-28, March.
    16. Han, Bo & Wang, Xiaoguang, 2020. "Semiparametric estimation for the non-mixture cure model in case-cohort and nested case-control studies," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    17. Yichen Lou & Peijie Wang & Jianguo Sun, 2023. "A semi-parametric weighted likelihood approach for regression analysis of bivariate interval-censored outcomes from case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 29(3), pages 628-653, July.
    18. J. F. Lawless, 2018. "Two-phase outcome-dependent studies for failure times and testing for effects of expensive covariates," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(1), pages 28-44, January.
    19. Chernozhukov, Victor & Fernández-Val, Iván & Kowalski, Amanda E., 2015. "Quantile regression with censoring and endogeneity," Journal of Econometrics, Elsevier, vol. 186(1), pages 201-221.
    20. Hsu, Yu-Chin & Shiu, Ji-Liang & Wan, Yuanyuan, 2024. "Testing identification conditions of LATE in fuzzy regression discontinuity designs," Journal of Econometrics, Elsevier, vol. 241(1).
    21. De Silva, Dakshina G. & Kosmopoulou, Georgia & Lamarche, Carlos, 2017. "Subcontracting and the survival of plants in the road construction industry: A panel quantile regression analysis," Journal of Economic Behavior & Organization, Elsevier, vol. 137(C), pages 113-131.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:51:y:2024:i:3:p:1086-1102. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.