IDEAS home Printed from https://ideas.repec.org/a/spr/lifeda/v23y2017i3d10.1007_s10985-016-9363-2.html
   My bibliography  Save this article

Analysis of two-phase sampling data with semiparametric additive hazards models

Author

Listed:
  • Yanqing Sun

    (University of North Carolina at Charlotte)

  • Xiyuan Qian

    (East China University of Science and Technology)

  • Qiong Shou

    (Merck China & Co., Inc.)

  • Peter B. Gilbert

    (University of Washington and Fred Hutchinson Cancer Research Center)

Abstract

Under the case-cohort design introduced by Prentice (Biometrica 73:1–11, 1986), the covariate histories are ascertained only for the subjects who experience the event of interest (i.e., the cases) during the follow-up period and for a relatively small random sample from the original cohort (i.e., the subcohort). The case-cohort design has been widely used in clinical and epidemiological studies to assess the effects of covariates on failure times. Most statistical methods developed for the case-cohort design use the proportional hazards model, and few methods allow for time-varying regression coefficients. In addition, most methods disregard data from subjects outside of the subcohort, which can result in inefficient inference. Addressing these issues, this paper proposes an estimation procedure for the semiparametric additive hazards model with case-cohort/two-phase sampling data, allowing the covariates of interest to be missing for cases as well as for non-cases. A more flexible form of the additive model is considered that allows the effects of some covariates to be time varying while specifying the effects of others to be constant. An augmented inverse probability weighted estimation procedure is proposed. The proposed method allows utilizing the auxiliary information that correlates with the phase-two covariates to improve efficiency. The asymptotic properties of the proposed estimators are established. An extensive simulation study shows that the augmented inverse probability weighted estimation is more efficient than the widely adopted inverse probability weighted complete-case estimation method. The method is applied to analyze data from a preventive HIV vaccine efficacy trial.

Suggested Citation

  • Yanqing Sun & Xiyuan Qian & Qiong Shou & Peter B. Gilbert, 2017. "Analysis of two-phase sampling data with semiparametric additive hazards models," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(3), pages 377-399, July.
  • Handle: RePEc:spr:lifeda:v:23:y:2017:i:3:d:10.1007_s10985-016-9363-2
    DOI: 10.1007/s10985-016-9363-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10985-016-9363-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10985-016-9363-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhiguo Li & Peter Gilbert & Bin Nan, 2008. "Weighted Likelihood Method for Grouped Survival Data in Case–Cohort Studies with Application to HIV Vaccine Trials," Biometrics, The International Biometric Society, vol. 64(4), pages 1247-1255, December.
    2. Yanqing Sun & Peter B. Gilbert, 2012. "Estimation of Stratified Mark‐Specific Proportional Hazards Models with Missing Marks," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 39(1), pages 34-52, March.
    3. Zhezhen Jin, 2003. "Rank-based inference for the accelerated failure time model," Biometrika, Biometrika Trust, vol. 90(2), pages 341-353, June.
    4. Kani Chen, 2001. "Generalized case–cohort sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(4), pages 791-809.
    5. Lan Kong & Jianwen Cai, 2009. "Case–Cohort Analysis with Accelerated Failure Time Model," Biometrics, The International Biometric Society, vol. 65(1), pages 135-142, March.
    6. Norman E. Breslow & Jon A. Wellner, 2007. "Weighted Likelihood for Semiparametric Models and Two‐phase Stratified Samples, with Application to Cox Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 34(1), pages 86-102, March.
    7. Michal Kulich & D.Y. Lin, 2004. "Improving the Efficiency of Relative-Risk Estimation in Case-Cohort Studies," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 832-844, January.
    8. Guozhi Gao & Anastasios A. Tsiatis, 2005. "Semiparametric estimators for the regression coefficients in the linear transformation competing risks model with missing cause of failure," Biometrika, Biometrika Trust, vol. 92(4), pages 875-891, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2020. "Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 85-108, January.
    2. Lihong Qi & Xu Zhang & Yanqing Sun & Lu Wang & Yichuan Zhao, 2019. "Weighted estimating equations for additive hazards models with missing covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(2), pages 365-387, April.
    3. Lee, Unkyung & Sun, Yanqing & Scheike, Thomas H. & Gilbert, Peter B., 2018. "Analysis of generalized semiparametric regression models for cumulative incidence functions with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 59-79.
    4. Fei Heng & Yanqing Sun & Seunggeun Hyun & Peter B. Gilbert, 2020. "Analysis of the time-varying Cox model for the cause-specific hazard functions with missing causes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(4), pages 731-760, October.
    5. Yei Eun Shin & Ruth M. Pfeiffer & Barry I. Graubard & Mitchell H. Gail, 2022. "Weight calibration to improve efficiency for estimating pure risks from the additive hazards model with the nested case‐control design," Biometrics, The International Biometric Society, vol. 78(1), pages 179-191, March.
    6. Yanqing Sun & Qiong Shou & Peter B. Gilbert & Fei Heng & Xiyuan Qian, 2023. "Semiparametric additive time‐varying coefficients model for longitudinal data with censored time origin," Biometrics, The International Biometric Society, vol. 79(2), pages 695-710, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jieli Ding & Tsui-Shan Lu & Jianwen Cai & Haibo Zhou, 2017. "Recent progresses in outcome-dependent sampling with failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(1), pages 57-82, January.
    2. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    3. Jon Arni Steingrimsson & Robert L. Strawderman, 2017. "Estimation in the Semiparametric Accelerated Failure Time Model With Missing Covariates: Improving Efficiency Through Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1221-1235, July.
    4. Yanqing Sun & Li Qi & Fei Heng & Peter B. Gilbert, 2020. "A hybrid approach for the stratified mark‐specific proportional hazards model with missing covariates and missing marks, with application to vaccine efficacy trials," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 791-814, August.
    5. Zheng, Ming & Zhao, Ziqiang & Yu, Wen, 2013. "Quantile regression analysis of case-cohort data," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 20-34.
    6. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Conditional screening for ultrahigh-dimensional survival data in case-cohort studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(4), pages 632-661, October.
    7. Guangren Yang & Yanqing Sun & Li Qi & Peter B. Gilbert, 2017. "Estimation of Stratified Mark-Specific Proportional Hazards Models Under Two-Phase Sampling with Application to HIV Vaccine Efficacy Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 259-283, June.
    8. Gongjun Xu & Tony Sit & Lan Wang & Chiung-Yu Huang, 2017. "Estimation and Inference of Quantile Regression for Survival Data Under Biased Sampling," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1571-1586, October.
    9. Ying Yan & Haibo Zhou & Jianwen Cai, 2017. "Improving efficiency of parameter estimation in case-cohort studies with multivariate failure time data," Biometrics, The International Biometric Society, vol. 73(3), pages 1042-1052, September.
    10. Jichang Yu & Haibo Zhou & Jianwen Cai, 2021. "Accelerated failure time model for data from outcome-dependent sampling," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 27(1), pages 15-37, January.
    11. Mingzhe Wu & Ming Zheng & Wen Yu & Ruofan Wu, 2018. "Estimation and variable selection for semiparametric transformation models under a more efficient cohort sampling design," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 570-596, September.
    12. Han, Bo & Wang, Xiaoguang, 2020. "Semiparametric estimation for the non-mixture cure model in case-cohort and nested case-control studies," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    13. Erik T. Parner & Per K. Andersen & Morten Overgaard, 2020. "Cumulative risk regression in case–cohort studies using pseudo-observations," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(4), pages 639-658, October.
    14. Menggang Yu & Bin Nan, 2010. "Regression Calibration in Semiparametric Accelerated Failure Time Models," Biometrics, The International Biometric Society, vol. 66(2), pages 405-414, June.
    15. Soyoung Kim & Yayun Xu & Mei‐Jie Zhang & Kwang‐Woo Ahn, 2020. "Stratified proportional subdistribution hazards model with covariate‐adjusted censoring weight for case‐cohort studies," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1222-1242, December.
    16. Fei Heng & Yanqing Sun & Seunggeun Hyun & Peter B. Gilbert, 2020. "Analysis of the time-varying Cox model for the cause-specific hazard functions with missing causes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(4), pages 731-760, October.
    17. Sangwook Kang & Jianwen Cai, 2009. "Marginal Hazards Regression for Retrospective Studies within Cohort with Possibly Correlated Failure Time Data," Biometrics, The International Biometric Society, vol. 65(2), pages 405-414, June.
    18. Jing Zhang & Haibo Zhou & Yanyan Liu & Jianwen Cai, 2021. "Feature screening for case‐cohort studies with failure time outcome," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(1), pages 349-370, March.
    19. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2020. "Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 26(1), pages 85-108, January.
    20. Zhiguo Li & Peter Gilbert & Bin Nan, 2008. "Weighted Likelihood Method for Grouped Survival Data in Case–Cohort Studies with Application to HIV Vaccine Trials," Biometrics, The International Biometric Society, vol. 64(4), pages 1247-1255, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:lifeda:v:23:y:2017:i:3:d:10.1007_s10985-016-9363-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.