IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i1p179-191.html
   My bibliography  Save this article

Weight calibration to improve efficiency for estimating pure risks from the additive hazards model with the nested case‐control design

Author

Listed:
  • Yei Eun Shin
  • Ruth M. Pfeiffer
  • Barry I. Graubard
  • Mitchell H. Gail

Abstract

We study the efficiency of covariate‐specific estimates of pure risk (one minus the survival function) when some covariates are only available for case‐control samples nested in a cohort. We focus on the semiparametric additive hazards model in which the hazard function equals a baseline hazard plus a linear combination of covariates with either time‐varying or time‐invariant coefficients. A published approach uses the design‐based inclusion probabilities to reweight the nested case‐control data. We obtain more efficient estimates of pure risks by calibrating the design weights to data available in the entire cohort, for both time‐varying and time‐invariant covariate coefficients. We develop explicit variance formulas for the weight‐calibrated estimates based on influence functions. Simulations show the improvement in precision by using weight calibration and confirm the consistency of variance estimators and the validity of inference based on asymptotic normality. Examples are provided using data from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial Study (PLCO).

Suggested Citation

  • Yei Eun Shin & Ruth M. Pfeiffer & Barry I. Graubard & Mitchell H. Gail, 2022. "Weight calibration to improve efficiency for estimating pure risks from the additive hazards model with the nested case‐control design," Biometrics, The International Biometric Society, vol. 78(1), pages 179-191, March.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:1:p:179-191
    DOI: 10.1111/biom.13413
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13413
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13413?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wu C. & Sitter R. R, 2001. "A Model-Calibration Approach to Using Complete Auxiliary Information From Survey Data," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 185-193, March.
    2. Michal Kulich & D.Y. Lin, 2004. "Improving the Efficiency of Relative-Risk Estimation in Case-Cohort Studies," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 832-844, January.
    3. Thomas Lumley & Pamela A. Shaw & James Y. Dai, 2011. "Connections between Survey Calibration Estimators and Semiparametric Models for Incomplete Data," International Statistical Review, International Statistical Institute, vol. 79(2), pages 200-220, August.
    4. Yanqing Sun & Xiyuan Qian & Qiong Shou & Peter B. Gilbert, 2017. "Analysis of two-phase sampling data with semiparametric additive hazards models," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(3), pages 377-399, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jason P. Estes & Bhramar Mukherjee & Jeremy M. G. Taylor, 2018. "Empirical Bayes Estimation and Prediction Using Summary-Level Information From External Big Data Sources Adjusting for Violations of Transportability," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 10(3), pages 568-586, December.
    2. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2020. "Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 85-108, January.
    3. Tan, Zhiqiang, 2014. "Second-order asymptotic theory for calibration estimators in sampling and missing-data problems," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 240-253.
    4. Yei Eun Shin & Ruth M. Pfeiffer & Barry I. Graubard & Mitchell H. Gail, 2020. "Weight calibration to improve the efficiency of pure risk estimates from case‐control samples nested in a cohort," Biometrics, The International Biometric Society, vol. 76(4), pages 1087-1097, December.
    5. Debashis Ghosh & Michael S. Sabel, 2022. "A Weighted Sample Framework to Incorporate External Calculators for Risk Modeling," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(3), pages 363-379, December.
    6. Sarjinder Singh, 2012. "On the calibration of design weights using a displacement function," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 75(1), pages 85-107, January.
    7. Peisong Han & Linglong Kong & Jiwei Zhao & Xingcai Zhou, 2019. "A general framework for quantile estimation with incomplete data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 305-333, April.
    8. Adane F. Wogu & Haolin Li & Shanshan Zhao & Hazel B. Nichols & Jianwen Cai, 2023. "Additive subdistribution hazards regression for competing risks data in case‐cohort studies," Biometrics, The International Biometric Society, vol. 79(4), pages 3010-3022, December.
    9. Alkaya Aylin & Ayhan H. Öztaş & Esin Alptekin, 2017. "Sequential Data Weighting Procedures for Combined Ratio Estimators in Complex Sample Surveys," Statistics in Transition New Series, Statistics Poland, vol. 18(2), pages 247-270, June.
    10. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Conditional screening for ultrahigh-dimensional survival data in case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(4), pages 632-661, October.
    11. Luis Castro-Martín & María del Mar Rueda & Ramón Ferri-García & César Hernando-Tamayo, 2021. "On the Use of Gradient Boosting Methods to Improve the Estimation with Data Obtained with Self-Selection Procedures," Mathematics, MDPI, vol. 9(23), pages 1-23, November.
    12. Eli Ben-Michael & Avi Feller & Jesse Rothstein, 2021. "The Augmented Synthetic Control Method," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 1789-1803, October.
    13. Gustavo Amorim & Ran Tao & Sarah Lotspeich & Pamela A. Shaw & Thomas Lumley & Bryan E. Shepherd, 2021. "Two‐phase sampling designs for data validation in settings with covariate measurement error and continuous outcome," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(4), pages 1368-1389, October.
    14. Brick J. Michael, 2013. "Unit Nonresponse and Weighting Adjustments: A Critical Review," Journal of Official Statistics, Sciendo, vol. 29(3), pages 329-353, June.
    15. Lihong Qi & Xu Zhang & Yanqing Sun & Lu Wang & Yichuan Zhao, 2019. "Weighted estimating equations for additive hazards models with missing covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 365-387, April.
    16. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Feature screening for case‐cohort studies with failure time outcome," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 349-370, March.
    17. Lothian Jack & Holmberg Anders & Seyb Allyson, 2019. "An Evolutionary Schema for Using “it-is-what-it-is” Data in Official Statistics," Journal of Official Statistics, Sciendo, vol. 35(1), pages 137-165, March.
    18. Domingo Morales & María del Mar Rueda & Dolores Esteban, 2018. "Model-Assisted Estimation of Small Area Poverty Measures: An Application within the Valencia Region in Spain," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 138(3), pages 873-900, August.
    19. Lee, Unkyung & Sun, Yanqing & Scheike, Thomas H. & Gilbert, Peter B., 2018. "Analysis of generalized semiparametric regression models for cumulative incidence functions with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 59-79.
    20. Ying Sheng & Yifei Sun & Chiung‐Yu Huang & Mi‐Ok Kim, 2022. "Synthesizing external aggregated information in the presence of population heterogeneity: A penalized empirical likelihood approach," Biometrics, The International Biometric Society, vol. 78(2), pages 679-690, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:1:p:179-191. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.