IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.17712.html

The Proximal Surrogate Index: Long-Term Treatment Effects under Unobserved Confounding

Author

Listed:
  • Ting-Chih Hung
  • Yu-Chang Chen

Abstract

We study the identification and estimation of long-term treatment effects under unobserved confounding by combining an experimental sample, where the long-term outcome is missing, with an observational sample, where the treatment assignment is unobserved. While standard surrogate index methods fail when unobserved confounders exist, we establish novel identification results by leveraging proxy variables for the unobserved confounders. We further develop multiply robust estimation and inference procedures based on these results. Applying our method to the Job Corps program, we demonstrate its ability to recover experimental benchmarks even when unobserved confounders bias standard surrogate index estimates.

Suggested Citation

  • Ting-Chih Hung & Yu-Chang Chen, 2026. "The Proximal Surrogate Index: Long-Term Treatment Effects under Unobserved Confounding," Papers 2601.17712, arXiv.org.
  • Handle: RePEc:arx:papers:2601.17712
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.17712
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Peter Z. Schochet & John Burghardt & Steven Glazerman, 2001. "National Job Corps Study: The Impacts of Job Corps on Participants' Employment and Related Outcomes," Mathematica Policy Research Reports db6c4204b8e1408bb0c6289ec, Mathematica Policy Research.
    2. Constantine E. Frangakis & Donald B. Rubin, 2002. "Principal Stratification in Causal Inference," Biometrics, The International Biometric Society, vol. 58(1), pages 21-29, March.
    3. Chen, Jiafeng & Ritzwoller, David M., 2023. "Semiparametric estimation of long-term treatment effects," Journal of Econometrics, Elsevier, vol. 237(2).
    4. Imbens, Guido W., 2014. "Instrumental Variables: An Econometrician's Perspective," IZA Discussion Papers 8048, IZA Network @ LISER.
    5. Yifan Cui & Hongming Pu & Xu Shi & Wang Miao & Eric Tchetgen Tchetgen, 2024. "Semiparametric Proximal Causal Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(546), pages 1348-1359, April.
    6. Imbens, Guido W & Angrist, Joshua D, 1994. "Identification and Estimation of Local Average Treatment Effects," Econometrica, Econometric Society, vol. 62(2), pages 467-475, March.
    7. Tyler J. VanderWeele, 2013. "Surrogate Measures and Consistent Surrogates," Biometrics, The International Biometric Society, vol. 69(3), pages 561-565, September.
    8. Guido Imbens & Nathan Kallus & Xiaojie Mao, 2021. "Controlling for Unmeasured Confounding in Panel Data Using Minimal Bridge Functions: From Two-Way Fixed Effects to Factor Models," Papers 2108.03849, arXiv.org.
    9. Steffen L. Lauritzen, 2004. "Discussion on Causality," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 31(2), pages 189-193, June.
    10. Whitney K. Newey & James L. Powell, 2003. "Instrumental Variable Estimation of Nonparametric Models," Econometrica, Econometric Society, vol. 71(5), pages 1565-1578, September.
    11. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    12. Susan Athey & Raj Chetty & Guido Imbens, 2025. "The Experimental Selection Correction Estimator: Using Experiments to Remove Biases in Observational Estimates," NBER Working Papers 33817, National Bureau of Economic Research, Inc.
    13. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881, November.
    14. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    15. Victor Chernozhukov & Christian Hansen, 2005. "An IV Model of Quantile Treatment Effects," Econometrica, Econometric Society, vol. 73(1), pages 245-261, January.
    16. Oliver Dukes & Ilya Shpitser & Eric J Tchetgen, 2023. "Proximal mediation analysis," Biometrika, Biometrika Trust, vol. 110(4), pages 973-987.
    17. Wang Miao & Zhi Geng & Eric J Tchetgen Tchetgen, 2018. "Identifying causal effects with proxy variables of an unmeasured confounder," Biometrika, Biometrika Trust, vol. 105(4), pages 987-993.
    18. An, Yonghong & Hu, Yingyao, 2012. "Well-posedness of measurement error models for self-reported data," Journal of Econometrics, Elsevier, vol. 168(2), pages 259-269.
    19. Marshall M. Joffe & Tom Greene, 2009. "Related Causal Frameworks for Surrogate Outcomes," Biometrics, The International Biometric Society, vol. 65(2), pages 530-538, June.
    20. Heckman, James & Pinto, Rodrigo, 2015. "Causal Analysis After Haavelmo," Econometric Theory, Cambridge University Press, vol. 31(1), pages 115-151, February.
    21. Wang Miao & Xu Shi & Yilin Li & Eric J. Tchetgen Tchetgen, 2024. "A confounding bridge approach for double negative control inference on causal effects," Statistical Theory and Related Fields, Taylor & Francis Journals, vol. 8(4), pages 262-273, October.
    22. Zhengling Qi & Rui Miao & Xiaoke Zhang, 2024. "Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(546), pages 915-928, April.
    23. Amiremad Ghassami & Alan Yang & Ilya Shpitser & Eric Tchetgen Tchetgen, 2025. "Causal inference with hidden mediators," Biometrika, Biometrika Trust, vol. 112(1), pages 5633-5751.
    24. repec:mpr:mprres:2951 is not listed on IDEAS
    25. Jiafeng Chen & David M. Ritzwoller, 2021. "Semiparametric Estimation of Long-Term Treatment Effects," Papers 2107.14405, arXiv.org, revised Aug 2023.
    26. Yingyao Hu & Susanne M. Schennach, 2008. "Instrumental Variable Treatment of Nonclassical Measurement Error Models," Econometrica, Econometric Society, vol. 76(1), pages 195-216, January.
    27. Hua Chen & Zhi Geng & Jinzhu Jia, 2007. "Criteria for surrogate end points," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 919-932, November.
    28. Ben Deaner, 2018. "Proxy Controls and Panel Data," Papers 1810.00283, arXiv.org, revised Nov 2023.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Guido Imbens & Nathan Kallus & Xiaojie Mao & Yuhao Wang, 2022. "Long-term Causal Inference Under Persistent Confounding via Data Combination," Papers 2202.07234, arXiv.org, revised Aug 2024.
    2. Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression," Papers 2112.14249, arXiv.org, revised May 2025.
    3. Andrews, Donald W.K., 2017. "Examples of L2-complete and boundedly-complete distributions," Journal of Econometrics, Elsevier, vol. 199(2), pages 213-220.
    4. Fatema Shafie Khorassani & Jeremy M. G. Taylor & Niko Kaciroti & Michael R. Elliott, 2023. "Incorporating Covariates into Measures of Surrogate Paradox Risk," Stats, MDPI, vol. 6(1), pages 1-23, February.
    5. Hu, Yingyao & Schennach, Susanne M. & Shiu, Ji-Liang, 2017. "Injectivity of a class of integral operators with compactly supported kernels," Journal of Econometrics, Elsevier, vol. 200(1), pages 48-58.
    6. Yuta Ota & Takahiro Hoshino & Taisuke Otsu, 2024. "Causal Inference With Auxiliary Observations," Keio-IES Discussion Paper Series 2024-022, Institute for Economics Studies, Keio University.
    7. Ben Deaner, 2022. "Controlling for Latent Confounding with Triple Proxies," Papers 2204.13815, arXiv.org, revised May 2023.
    8. Jingwen Zhang & Yifang Chen & Amandeep Singh, 2022. "Causal Bandits: Online Decision-Making in Endogenous Settings," Papers 2211.08649, arXiv.org, revised Feb 2023.
    9. Schennach, Susanne M., 2020. "Mismeasured and unobserved variables," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 487-565, Elsevier.
    10. Zhichao Jiang & Peng Ding & Zhi Geng, 2016. "Principal causal effect identification and surrogate end point evaluation by multiple trials," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(4), pages 829-848, September.
    11. Jin Li & Ye Luo & Zigan Wang & Xiaowei Zhang, 2021. "Asymptotic Theory for IV-Based Reinforcement Learning with Potential Endogeneity," Papers 2103.04021, arXiv.org, revised Dec 2024.
    12. Zhongren Chen & Siyu Chen & Zhengling Qi & Xiaohong Chen & Zhuoran Yang, 2025. "Quantile-Optimal Policy Learning under Unmeasured Confounding," Cowles Foundation Discussion Papers 2469, Cowles Foundation for Research in Economics, Yale University.
    13. Halbert White & Karim Chalak, 2013. "Identification and Identification Failure for Treatment Effects Using Structural Systems," Econometric Reviews, Taylor & Francis Journals, vol. 32(3), pages 273-317, November.
    14. David M. Ritzwoller & Vasilis Syrgkanis, 2024. "Simultaneous Inference for Local Structural Parameters with Random Forests," Papers 2405.07860, arXiv.org, revised Sep 2024.
    15. Dmitry Arkhangelsky & Guido Imbens, 2023. "Causal Models for Longitudinal and Panel Data: A Survey," Papers 2311.15458, arXiv.org, revised Jun 2024.
    16. Ivan A. Canay & Andres Santos & Azeem M. Shaikh, 2013. "On the Testability of Identification in Some Nonparametric Models With Endogeneity," Econometrica, Econometric Society, vol. 81(6), pages 2535-2559, November.
    17. Simon Calmar Andersen & Louise Beuchert & Phillip Heiler & Helena Skyt Nielsen, 2023. "A Guide to Impact Evaluation under Sample Selection and Missing Data: Teacher's Aides and Adolescent Mental Health," Papers 2308.04963, arXiv.org.
    18. Yanqin Fan & Carlos A. Manzanares & Hyeonseok Park & Yuan Qi, 2026. "A Sensitivity Analysis of the Surrogate Index Approach for Estimating Long-Term Treatment Effects," Papers 2603.00580, arXiv.org.
    19. Chuan Ju & Zhi Geng, 2010. "Criteria for surrogate end points based on causal distributions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(1), pages 129-142, January.
    20. Mogstad, Magne & Torgovitsky, Alexander, 2024. "Instrumental variables with unobserved heterogeneity in treatment effects," Handbook of Labor Economics,, Elsevier.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.17712. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.