IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v180y2023ics0167947322002407.html
   My bibliography  Save this article

Adjusting for unmeasured confounding in survival causal effect using validation data

Author

Listed:
  • Cao, Yongxiu
  • Yu, Jichang

Abstract

Unmeasured confounding is an important problem in observational studies, which brings a great challenge to eliminate or reduce bias. A large main data set with unmeasured confounders and a smaller validation data set with detailed information on these confounders are combined to estimate the survival causal effect. The initial estimator based on the small validation data set under the ignorable treatment assignment and the error-prone estimator based on the large main data set are both obtained by the doubly robust method. Then, the proposed estimator is obtained by leveraging the correlation between the initial estimator and the error-prone estimator. The large sample theory of the proposed estimator is established. Simulation studies are conducted to show the good performance of the proposed method. A real data of breast cancer from the cBio Cancer Genomics Portal is analyzed to illustrate the proposed method.

Suggested Citation

  • Cao, Yongxiu & Yu, Jichang, 2023. "Adjusting for unmeasured confounding in survival causal effect using validation data," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
  • Handle: RePEc:eee:csdana:v:180:y:2023:i:c:s0167947322002407
    DOI: 10.1016/j.csda.2022.107660
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322002407
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107660?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lawrence C. McCandless & Sylvia Richardson & Nicky Best, 2012. "Adjustment for Missing Confounders Using External Validation Data and Propensity Scores," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 40-51, March.
    2. Stute, W., 1993. "Consistent Estimation Under Random Censorship When Covariables Are Present," Journal of Multivariate Analysis, Elsevier, vol. 45(1), pages 89-103, April.
    3. Chatterjee N. & Chen Y-H. & Breslow N.E., 2003. "A Pseudoscore Estimator for Regression Problems With Two-Phase Sampling," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 158-168, January.
    4. Heejung Bang & James M. Robins, 2005. "Doubly Robust Estimation in Missing Data and Causal Inference Models," Biometrics, The International Biometric Society, vol. 61(4), pages 962-973, December.
    5. Yi‐Hau Chen, 2002. "Cox regression in cohort studies with validation sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(1), pages 51-62, January.
    6. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 655-680.
    7. Nilanjan Chatterjee & Yi-Hau Chen & Paige Maas & Raymond J. Carroll, 2016. "Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-Level Information From External Big Data Sources," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 107-117, March.
    8. Heejung Bang & Anastasios A. Tsiatis, 2002. "Median Regression with Censored Cost Data," Biometrics, The International Biometric Society, vol. 58(3), pages 643-649, September.
    9. Yingchao Zhong & Douglas E. Schaubel, 2022. "Restricted mean survival time as a function of restriction time," Biometrics, The International Biometric Society, vol. 78(1), pages 192-201, March.
    10. Weiwei Wang & Daniel Scharfstein & Zhiqiang Tan & Ellen J. MacKenzie, 2009. "Causal inference in outcome‐dependent two‐phase sampling designs," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 947-969, November.
    11. Shu Yang & Peng Ding, 2020. "Combining Multiple Observational Data Sources to Estimate Causal Effects," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1540-1554, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tian Gu & Jeremy Michael George Taylor & Bhramar Mukherjee, 2023. "A synthetic data integration framework to leverage external summary‐level information from heterogeneous populations," Biometrics, The International Biometric Society, vol. 79(4), pages 3831-3845, December.
    2. Sungwan Bang & Soo-Heang Eo & Yong Mee Cho & Myoungshic Jhun & HyungJun Cho, 2016. "Non-crossing weighted kernel quantile regression with right censored data," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(1), pages 100-121, January.
    3. Wenqin Pan & Donglin Zeng, 2011. "Estimating Mean Cost Using Auxiliary Covariates," Biometrics, The International Biometric Society, vol. 67(3), pages 996-1006, September.
    4. Chixiang Chen & Ming Wang & Shuo Chen, 2023. "An efficient data integration scheme for synthesizing information from multiple secondary datasets for the parameter inference of the main analysis," Biometrics, The International Biometric Society, vol. 79(4), pages 2947-2960, December.
    5. Hu, Jianwei & Chai, Hao, 2013. "Adjusted regularized estimation in the accelerated failure time model with high dimensional covariates," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 96-114.
    6. Corwin M. Zigler & Krista Watts & Robert W. Yeh & Yun Wang & Brent A. Coull & Francesca Dominici, 2013. "Model Feedback in Bayesian Propensity Score Estimation," Biometrics, The International Biometric Society, vol. 69(1), pages 263-273, March.
    7. Ruoyu Wang & Qihua Wang & Wang Miao, 2023. "A robust fusion-extraction procedure with summary statistics in the presence of biased sources," Biometrika, Biometrika Trust, vol. 110(4), pages 1023-1040.
    8. Fei Gao & K. C. G. Chan, 2023. "Noniterative adjustment to regression estimators with population‐based auxiliary information for semiparametric models," Biometrics, The International Biometric Society, vol. 79(1), pages 140-150, March.
    9. Han Zhang & Lu Deng & William Wheeler & Jing Qin & Kai Yu, 2022. "Integrative analysis of multiple case‐control studies," Biometrics, The International Biometric Society, vol. 78(3), pages 1080-1091, September.
    10. Ziqi Chen & Jing Ning & Yu Shen & Jing Qin, 2021. "Combining primary cohort data with external aggregate information without assuming comparability," Biometrics, The International Biometric Society, vol. 77(3), pages 1024-1036, September.
    11. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    12. Yu‐Jen Cheng & Yen‐Chun Liu & Chang‐Yu Tsai & Chiung‐Yu Huang, 2023. "Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity," Biometrics, The International Biometric Society, vol. 79(3), pages 1996-2009, September.
    13. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    14. Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Papers 2107.11732, arXiv.org, revised Apr 2023.
    15. Debashis Ghosh & Michael S. Sabel, 2022. "A Weighted Sample Framework to Incorporate External Calculators for Risk Modeling," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(3), pages 363-379, December.
    16. Jelena Bradic & Weijie Ji & Yuqian Zhang, 2021. "High-dimensional Inference for Dynamic Treatment Effects," Papers 2110.04924, arXiv.org, revised May 2023.
    17. Lawrence C. McCandless & Sylvia Richardson & Nicky Best, 2012. "Adjustment for Missing Confounders Using External Validation Data and Propensity Scores," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 40-51, March.
    18. Wang, Xuan & Wang, Qihua, 2015. "Semiparametric linear transformation model with differential measurement error and validation sampling," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 67-80.
    19. Leora Friedberg & Steven Stern, 2014. "Marriage, Divorce, And Asymmetric Information," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 55(4), pages 1155-1199, November.
    20. L. Tian & J. Liu & Y. Zhao & L. J. Wei, 2004. "Statistical inference based on non-smooth estimating functions," Biometrika, Biometrika Trust, vol. 91(4), pages 943-954, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:180:y:2023:i:c:s0167947322002407. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.