IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.20550.html
   My bibliography  Save this paper

Policy Learning under Unobserved Confounding: A Robust and Efficient Approach

Author

Listed:
  • Zequn Jin
  • Gaoqian Xu
  • Xi Zheng
  • Yahong Zhou

Abstract

This paper develops a robust and efficient method for policy learning from observational data in the presence of unobserved confounding, complementing existing instrumental variable (IV) based approaches. We employ the marginal sensitivity model (MSM) to relax the commonly used yet restrictive unconfoundedness assumption by introducing a sensitivity parameter that captures the extent of selection bias induced by unobserved confounders. Building on this framework, we consider two distributionally robust welfare criteria, defined as the worst-case welfare and policy improvement functions, evaluated over an uncertainty set of counterfactual distributions characterized by the MSM. Closed-form expressions for both welfare criteria are derived. Leveraging these identification results, we construct doubly robust scores and estimate the robust policies by maximizing the proposed criteria. Our approach accommodates flexible machine learning methods for estimating nuisance components, even when these converge at moderately slow rate. We establish asymptotic regret bounds for the resulting policies, providing a robust guarantee against the most adversarial confounding scenario. The proposed method is evaluated through extensive simulation studies and empirical applications to the JTPA study and Head Start program.

Suggested Citation

  • Zequn Jin & Gaoqian Xu & Xi Zheng & Yahong Zhou, 2025. "Policy Learning under Unobserved Confounding: A Robust and Efficient Approach," Papers 2507.20550, arXiv.org.
  • Handle: RePEc:arx:papers:2507.20550
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.20550
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhengyuan Zhou & Susan Athey & Stefan Wager, 2023. "Offline Multi-Action Policy Learning: Generalization and Optimization," Operations Research, INFORMS, vol. 71(1), pages 148-183, January.
    2. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "High-Dimensional Methods and Inference on Structural and Treatment Effects," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 29-50, Spring.
    3. Currie, Janet & Thomas, Duncan, 1995. "Does Head Start Make a Difference?," American Economic Review, American Economic Association, vol. 85(3), pages 341-364, June.
    4. David Deming, 2009. "Early Childhood Intervention and Life-Cycle Skill Development: Evidence from Head Start," American Economic Journal: Applied Economics, American Economic Association, vol. 1(3), pages 111-134, July.
    5. Max H. Farrell & Tengyuan Liang & Sanjog Misra, 2021. "Deep Neural Networks for Estimation and Inference," Econometrica, Econometric Society, vol. 89(1), pages 181-213, January.
    6. Toru Kitagawa & Aleksey Tetenov, 2021. "Equality-Minded Treatment Choice," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(2), pages 561-574, March.
    7. Nathan Kallus & Angela Zhou, 2021. "Minimax-Optimal Policy Learning Under Unobserved Confounding," Management Science, INFORMS, vol. 67(5), pages 2870-2890, May.
    8. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    9. Jesse Y. Hsu & Dylan S. Small, 2013. "Calibrating Sensitivity Analyses to Observed Covariates in Observational Studies," Biometrics, The International Biometric Society, vol. 69(4), pages 803-811, December.
    10. Alberto Abadie & Joshua Angrist & Guido Imbens, 2002. "Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings," Econometrica, Econometric Society, vol. 70(1), pages 91-117, January.
    11. Tan, Zhiqiang, 2006. "A Distributional Approach for Causal Inference Using Propensity Scores," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1619-1637, December.
    12. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    13. Christopher R. Walters, 2015. "Inputs in the Production of Early Childhood Human Capital: Evidence from Head Start," American Economic Journal: Applied Economics, American Economic Association, vol. 7(4), pages 76-102, October.
    14. Toru Kitagawa & Aleksey Tetenov, 2018. "Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice," Econometrica, Econometric Society, vol. 86(2), pages 591-616, March.
    15. Nian Si & Fan Zhang & Zhengyuan Zhou & Jose Blanchet, 2023. "Distributionally Robust Batch Contextual Bandits," Management Science, INFORMS, vol. 69(10), pages 5772-5793, October.
    16. Hongxiang Qiu & Marco Carone & Ekaterina Sadikova & Maria Petukhova & Ronald C. Kessler & Alex Luedtke, 2021. "Optimal Individualized Decision Rules Using Instrumental Variable Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(533), pages 174-191, March.
    17. Newey, Whitney K, 1994. "The Asymptotic Variance of Semiparametric Estimators," Econometrica, Econometric Society, vol. 62(6), pages 1349-1382, November.
    18. Tomasz Olma, 2021. "Nonparametric Estimation of Truncated Conditional Expectation Functions," Papers 2109.06150, arXiv.org.
    19. Jacob Dorn & Kevin Guo & Nathan Kallus, 2025. "Doubly-Valid/Doubly-Sharp Sensitivity Analysis for Causal Inference with Unmeasured Confounding," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 120(549), pages 331-342, January.
    20. Jacob Dorn & Kevin Guo, 2023. "Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(544), pages 2645-2657, October.
    21. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    22. Fissler, Tobias & Merz, Michael & Wüthrich, Mario V., 2023. "Deep quantile and deep composite triplet regression," Insurance: Mathematics and Economics, Elsevier, vol. 109(C), pages 94-112.
    23. Belloni, Alexandre & Chernozhukov, Victor & Chetverikov, Denis & Fernández-Val, Iván, 2019. "Conditional quantile processes based on series or many regressors," Journal of Econometrics, Elsevier, vol. 213(1), pages 4-29.
    24. Ethan X. Fang & Zhaoran Wang & Lan Wang, 2023. "Fairness-Oriented Learning for Optimal Individualized Treatment Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 1733-1746, July.
    25. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    26. Jens Ludwig & Douglas L. Miller, 2007. "Does Head Start Improve Children's Life Chances? Evidence from a Regression Discontinuity Design," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(1), pages 159-208.
    27. Howard S. Bloom & Larry L. Orr & Stephen H. Bell & George Cave & Fred Doolittle & Winston Lin & Johannes M. Bos, 1997. "The Benefits and Costs of JTPA Title II-A Programs: Key Findings from the National Job Training Partnership Act Study," Journal of Human Resources, University of Wisconsin Press, vol. 32(3), pages 549-576.
    28. Yifan Cui & Eric Tchetgen Tchetgen, 2021. "A Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under Endogeneity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(533), pages 162-173, January.
    29. Lan Wang & Yu Zhou & Rui Song & Ben Sherwood, 2018. "Quantile-Optimal Treatment Regimes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1243-1254, July.
    30. Qingyuan Zhao & Dylan S. Small & Bhaswar B. Bhattacharya, 2019. "Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(4), pages 735-761, September.
    31. Hongxiang Qiu & Marco Carone & Ekaterina Sadikova & Maria Petukhova & Ronald C. Kessler & Alex Luedtke, 2021. "Rejoinder: Optimal Individualized Decision Rules Using Instrumental Variable Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(533), pages 207-209, March.
    32. Matthew A. Masten & Alexandre Poirier & Muyang Ren, 2025. "A General Approach to Relaxing Unconfoundedness," Papers 2501.15400, arXiv.org.
    33. Matthew S. Johnson & David I. Levine & Michael W. Toffel, 2023. "Improving Regulatory Effectiveness through Better Targeting: Evidence from OSHA," American Economic Journal: Applied Economics, American Economic Association, vol. 15(4), pages 30-67, October.
    34. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    35. Davide Viviano & Jelena Bradic, 2024. "Fair Policy Targeting," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 119(545), pages 730-743, January.
    36. Matthew A. Masten & Alexandre Poirier, 2018. "Identification of Treatment Effects Under Conditional Partial Independence," Econometrica, Econometric Society, vol. 86(1), pages 317-351, January.
    37. Jacob Dorn & Kevin Guo, 2021. "Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile Balancing," Papers 2102.04543, arXiv.org, revised Aug 2023.
    38. Imbens, Guido W & Angrist, Joshua D, 1994. "Identification and Estimation of Local Average Treatment Effects," Econometrica, Econometric Society, vol. 62(2), pages 467-475, March.
    39. Zhengling Qi & Jong-Shi Pang & Yufeng Liu, 2023. "On Robustness of Individualized Decision Rules," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 2143-2157, July.
    40. Hongming Pu & Bo Zhang, 2021. "Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 318-345, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yanqin Fan & Yuan Qi & Gaoqian Xu, 2025. "Policy Learning with $\alpha$-Expected Welfare," Papers 2505.00256, arXiv.org.
    2. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    3. Huber, Martin, 2019. "An introduction to flexible methods for policy evaluation," FSES Working Papers 504, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
    4. Ashesh Rambachan & Amanda Coston & Edward Kennedy, 2022. "Robust Design and Evaluation of Predictive Algorithms under Unobserved Confounding," Papers 2212.09844, arXiv.org, revised Nov 2025.
    5. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Semenova, Vira, 2023. "Debiased machine learning of set-identified linear models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1725-1746.
    7. Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
    8. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    9. Nathan Kallus, 2023. "Treatment Effect Risk: Bounds and Inference," Management Science, INFORMS, vol. 69(8), pages 4579-4590, August.
    10. Vira Semenova, 2020. "Generalized Lee Bounds," Papers 2008.12720, arXiv.org, revised May 2025.
    11. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    12. Marianne P. Bitler & Hilary W. Hoynes & Thurston Domina, 2014. "Experimental Evidence on Distributional Effects of Head Start," NBER Working Papers 20434, National Bureau of Economic Research, Inc.
    13. Davide Viviano, 2019. "Policy Targeting under Network Interference," Papers 1906.10258, arXiv.org, revised Apr 2024.
    14. Michael C Knaus, 2022. "Double machine learning-based programme evaluation under unconfoundedness [Econometric methods for program evaluation]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 602-627.
    15. Aditya Ghosh & Dominik Rothenhausler, 2025. "Assumption-robust Causal Inference," Papers 2505.08729, arXiv.org, revised Jun 2025.
    16. Nora Bearth & Michael Lechner & Jana Mareckova & Fabian Muny, 2025. "Fairness-Aware and Interpretable Policy Learning," Papers 2509.12119, arXiv.org.
    17. Yu-Chang Chen & Haitian Xie, 2022. "Personalized Subsidy Rules," Papers 2202.13545, arXiv.org, revised Mar 2022.
    18. Vira Semenova, 2023. "Debiased Machine Learning of Aggregated Intersection Bounds and Other Causal Parameters," Papers 2303.00982, arXiv.org, revised May 2025.
    19. Alejandro Sanchez-Becerra, 2023. "Robust inference for the treatment effect variance in experiments using machine learning," Papers 2306.03363, arXiv.org.
    20. Goller, Daniel & Lechner, Michael & Pongratz, Tamara & Wolff, Joachim, 2025. "Active labor market policies for the long-term unemployed: New evidence from causal machine learning," Labour Economics, Elsevier, vol. 94(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.20550. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.