IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2202.07234.html
   My bibliography  Save this paper

Long-term Causal Inference Under Persistent Confounding via Data Combination

Author

Listed:
  • Guido Imbens
  • Nathan Kallus
  • Xiaojie Mao
  • Yuhao Wang

Abstract

We study the identification and estimation of long-term treatment effects when both experimental and observational data are available. Since the long-term outcome is observed only after a long delay, it is not measured in the experimental data, but only recorded in the observational data. However, both types of data include observations of some short-term outcomes. In this paper, we uniquely tackle the challenge of persistent unmeasured confounders, i.e., some unmeasured confounders that can simultaneously affect the treatment, short-term outcomes and the long-term outcome, noting that they invalidate identification strategies in previous literature. To address this challenge, we exploit the sequential structure of multiple short-term outcomes, and develop three novel identification strategies for the average long-term treatment effect. We further propose three corresponding estimators and prove their asymptotic consistency and asymptotic normality. We finally apply our methods to estimate the effect of a job training program on long-term employment using semi-synthetic data. We numerically show that our proposals outperform existing methods that fail to handle persistent confounders.

Suggested Citation

  • Guido Imbens & Nathan Kallus & Xiaojie Mao & Yuhao Wang, 2022. "Long-term Causal Inference Under Persistent Confounding via Data Combination," Papers 2202.07234, arXiv.org, revised Aug 2023.
  • Handle: RePEc:arx:papers:2202.07234
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2202.07234
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    2. Constantine E. Frangakis & Donald B. Rubin, 2002. "Principal Stratification in Causal Inference," Biometrics, The International Biometric Society, vol. 58(1), pages 21-29, March.
    3. Card, David & Krueger, Alan B, 1994. "Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania," American Economic Review, American Economic Association, vol. 84(4), pages 772-793, September.
    4. Brenda L. Price & Peter B. Gilbert & Mark J. van der Laan, 2018. "Estimation of the optimal surrogate based on a randomized trial," Biometrics, The International Biometric Society, vol. 74(4), pages 1271-1281, December.
    5. Guido Imbens & Nathan Kallus & Xiaojie Mao, 2021. "Controlling for Unmeasured Confounding in Panel Data Using Minimal Bridge Functions: From Two-Way Fixed Effects to Factor Models," Papers 2108.03849, arXiv.org.
    6. Susan Athey & Raj Chetty & Guido Imbens, 2020. "Combining Experimental and Observational Data to Estimate Treatment Effects on Long Term Outcomes," Papers 2006.09676, arXiv.org.
    7. Raj Chetty & John N. Friedman & Nathaniel Hilger & Emmanuel Saez & Diane Whitmore Schanzenbach & Danny Yagan, 2011. "How Does Your Kindergarten Classroom Affect Your Earnings? Evidence from Project Star," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(4), pages 1593-1660.
    8. Susan Athey & Raj Chetty & Guido W. Imbens & Hyunseung Kang, 2019. "The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely," NBER Working Papers 26463, National Bureau of Economic Research, Inc.
    9. Nishanth Dikkala & Greg Lewis & Lester Mackey & Vasilis Syrgkanis, 2020. "Minimax Estimation of Conditional Moment Models," Papers 2006.07201, arXiv.org.
    10. Rahul Singh, 2022. "Generalized Kernel Ridge Regression for Long Term Causal Inference: Treatment Effects, Dose Responses, and Counterfactual Distributions," Papers 2201.05139, arXiv.org.
    11. Marshall M. Joffe & Tom Greene, 2009. "Related Causal Frameworks for Surrogate Outcomes," Biometrics, The International Biometric Society, vol. 65(2), pages 530-538, June.
    12. AmirEmad Ghassami & Alan Yang & David Richardson & Ilya Shpitser & Eric Tchetgen Tchetgen, 2022. "Combining Experimental and Observational Data for Identification and Estimation of Long-Term Causal Effects," Papers 2201.10743, arXiv.org, revised Apr 2022.
    13. Hua Chen & Zhi Geng & Jinzhu Jia, 2007. "Criteria for surrogate end points," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 919-932, November.
    14. Bryan S. Graham & Cristine Campos de Xavier Pinto & Daniel Egel, 2016. "Efficient Estimation of Data Combination Models by the Method of Auxiliary-to-Study Tilting (AST)," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(2), pages 288-301, April.
    15. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    16. Tyler J. VanderWeele, 2013. "Surrogate Measures and Consistent Surrogates," Biometrics, The International Biometric Society, vol. 69(3), pages 561-565, September.
    17. Carrasco, Marine & Florens, Jean-Pierre & Renault, Eric, 2007. "Linear Inverse Problems in Structural Econometrics Estimation Based on Spectral Decomposition and Regularization," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 77, Elsevier.
    18. Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.
    19. Xuan Wang & Layla Parast & Lu Tian & Tianxi Cai, 2020. "Model-free approach to quantifying the proportion of treatment effect explained by a surrogate marker," Biometrika, Biometrika Trust, vol. 107(1), pages 107-122.
    20. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    21. Xu Shi & Wang Miao & Jennifer C. Nelson & Eric J. Tchetgen Tchetgen, 2020. "Multiply robust causal inference with double‐negative control adjustment for categorical unmeasured confounding," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(2), pages 521-540, April.
    22. Ben Deaner, 2018. "Proxy Controls and Panel Data," Papers 1810.00283, arXiv.org, revised Nov 2023.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shan Huang & Chen Wang & Yuan Yuan & Jinglong Zhao & Jingjing Zhang, 2023. "Estimating Effects of Long-Term Treatments," Papers 2308.08152, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.
    2. Rahul Singh, 2022. "Generalized Kernel Ridge Regression for Long Term Causal Inference: Treatment Effects, Dose Responses, and Counterfactual Distributions," Papers 2201.05139, arXiv.org.
    3. Dmitry Arkhangelsky & Guido Imbens, 2023. "Causal Models for Longitudinal and Panel Data: A Survey," Papers 2311.15458, arXiv.org, revised Mar 2024.
    4. Gilbert Peter B. & Huang Ying & Gabriel Erin E. & Chan Ivan S.F., 2015. "Surrogate Endpoint Evaluation: Principal Stratification Criteria and the Prentice Definition," Journal of Causal Inference, De Gruyter, vol. 3(2), pages 157-175, September.
    5. Fatema Shafie Khorassani & Jeremy M. G. Taylor & Niko Kaciroti & Michael R. Elliott, 2023. "Incorporating Covariates into Measures of Surrogate Paradox Risk," Stats, MDPI, vol. 6(1), pages 1-23, February.
    6. Zhichao Jiang & Peng Ding & Zhi Geng, 2016. "Principal causal effect identification and surrogate end point evaluation by multiple trials," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 829-848, September.
    7. Yechan Park & Yuya Sasaki, 2024. "A Bracketing Relationship for Long-Term Policy Evaluation with Combined Experimental and Observational Data," Papers 2401.12050, arXiv.org.
    8. Xuan Wang & Layla Parast & Larry Han & Lu Tian & Tianxi Cai, 2023. "Robust approach to combining multiple markers to improve surrogacy," Biometrics, The International Biometric Society, vol. 79(2), pages 788-798, June.
    9. Ying Huang & Shibasish Dasgupta, 2019. "Likelihood-Based Methods for Assessing Principal Surrogate Endpoints in Vaccine Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(3), pages 504-523, December.
    10. Xiaohong Chen & Victor Chernozhukov & Sokbae Lee & Whitney K. Newey, 2014. "Local Identification of Nonparametric and Semiparametric Models," Econometrica, Econometric Society, vol. 82(2), pages 785-809, March.
    11. Masahiro Kato & Masaaki Imaizumi & Kenichiro McAlinn & Haruo Kakehi & Shota Yasui, 2021. "Learning Causal Models from Conditional Moment Restrictions by Importance Weighting," Papers 2108.01312, arXiv.org, revised Sep 2022.
    12. Yu, Ping & Phillips, Peter C.B., 2018. "Threshold regression with endogeneity," Journal of Econometrics, Elsevier, vol. 203(1), pages 50-68.
    13. Tyler J. VanderWeele, 2013. "Surrogate Measures and Consistent Surrogates," Biometrics, The International Biometric Society, vol. 69(3), pages 561-565, September.
    14. Xiaohong Chen & Yin Jia Jeff Qiu, 2016. "Methods for Nonparametric and Semiparametric Regressions with Endogeneity: A Gentle Guide," Annual Review of Economics, Annual Reviews, vol. 8(1), pages 259-290, October.
    15. VanderWeele Tyler J, 2011. "Principal Stratification -- Uses and Limitations," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-14, July.
    16. Ai, Chunrong & Chen, Xiaohong, 2012. "The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," Journal of Econometrics, Elsevier, vol. 170(2), pages 442-457.
    17. Yun Li & Jeremy M.G. Taylor & Michael R. Elliott, 2010. "A Bayesian Approach to Surrogacy Assessment Using Principal Stratification in Clinical Trials," Biometrics, The International Biometric Society, vol. 66(2), pages 523-531, June.
    18. Layla Parast & Tianxi Cai & Lu Tian, 2021. "Evaluating multiple surrogate markers with censored data," Biometrics, The International Biometric Society, vol. 77(4), pages 1315-1327, December.
    19. Andrew Bennett & Nathan Kallus, 2020. "The Variational Method of Moments," Papers 2012.09422, arXiv.org, revised Mar 2023.
    20. Andrew Bennett & Nathan Kallus & Xiaojie Mao & Whitney Newey & Vasilis Syrgkanis & Masatoshi Uehara, 2022. "Inference on Strongly Identified Functionals of Weakly Identified Functions," Papers 2208.08291, arXiv.org, revised Jun 2023.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2202.07234. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.