IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.19230.html

Semiparametric Efficiency in Policy Learning with General Treatments

Author

Listed:
  • Yue Fang
  • Geert Ridder
  • Haitian Xie

Abstract

Recent literature on policy learning has primarily focused on regret bounds of the learned policy. We provide a new perspective by developing a unified semiparametric efficiency framework for policy learning, allowing for general treatments that are discrete, continuous, or mixed. We provide a characterization of the failure of pathwise differentiability for parameters arising from deterministic policies. We then establish efficiency bounds for pathwise differentiable parameters in randomized policies, both when the propensity score is known and when it must be estimated. Building on the convolution theorem, we introduce a notion of efficiency for the asymptotic distribution of welfare regret, showing that inefficient policy estimators not only inflate the variance of the asymptotic regret but also shift its mean upward. We derive the asymptotic theory of several common policy estimators, with a key contribution being a policy-learning analogue of the Hirano-Imbens-Ridder (HIR) phenomenon: the inverse propensity weighting estimator with an estimated propensity is efficient, whereas the same estimator using the true propensity is not. We illustrate the theoretical results with an empirically calibrated simulation study based on data from a job training program and an empirical application to a commitment savings program.

Suggested Citation

  • Yue Fang & Geert Ridder & Haitian Xie, 2025. "Semiparametric Efficiency in Policy Learning with General Treatments," Papers 2512.19230, arXiv.org, revised Feb 2026.
  • Handle: RePEc:arx:papers:2512.19230
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.19230
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sebastian Calonico & Matias D. Cattaneo & Max H. Farrell, 2018. "On the Effect of Bias Estimation on Coverage Accuracy in Nonparametric Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 767-779, April.
    2. Cattaneo, Matias D. & Jansson, Michael & Ma, Xinwei, 2024. "Local regression distribution estimators," Journal of Econometrics, Elsevier, vol. 240(2).
    3. Panle Jia Barwick & Shanjun Li & Andrew Waxman & Jing Wu & Tianli Xia, 2024. "Efficiency and Equity Impacts of Urban Transportation Policies with Equilibrium Sorting," American Economic Review, American Economic Association, vol. 114(10), pages 3161-3205, October.
    4. Stoye, Jörg, 2009. "Minimax regret treatment choice with finite samples," Journal of Econometrics, Elsevier, vol. 151(1), pages 70-81, July.
    5. Xiaohong Chen & Oliver Linton & Ingrid Van Keilegom, 2003. "Estimation of Semiparametric Models when the Criterion Function Is Not Smooth," Econometrica, Econometric Society, vol. 71(5), pages 1591-1608, September.
    6. Toru Kitagawa & Aleksey Tetenov, 2021. "Equality-Minded Treatment Choice," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(2), pages 561-574, March.
    7. Eric Mbakop & Max Tabord‐Meehan, 2021. "Model Selection for Treatment Choice: Penalized Welfare Maximization," Econometrica, Econometric Society, vol. 89(2), pages 825-848, March.
    8. Matias D. Cattaneo & Michael Jansson & Xinwei Ma, 2020. "Simple Local Polynomial Density Estimators," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1449-1455, July.
    9. Xiaohong Chen & Haitian Xie, 2025. "On Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Papers 2510.16683, arXiv.org, revised Feb 2026.
    10. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    11. Nava Ashraf & Dean Karlan & Wesley Yin, 2006. "Tying Odysseus to the Mast: Evidence From a Commitment Savings Product in the Philippines," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 121(2), pages 635-672.
    12. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    13. Xiaohong Chen & Zhenxiao Chen & Wayne Yuan Gao, 2025. "Inference on Welfare and Value Functionals under Optimal Treatment Assignment," Papers 2510.25607, arXiv.org.
    14. Crippa, Federico, 2025. "Regret analysis in threshold policy design," Journal of Econometrics, Elsevier, vol. 249(PB).
    15. Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
    16. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    17. Amy Finkelstein & Sarah Taubman & Bill Wright & Mira Bernstein & Jonathan Gruber & Joseph P. Newhouse & Heidi Allen & Katherine Baicker, 2012. "The Oregon Health Insurance Experiment: Evidence from the First Year," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 127(3), pages 1057-1106.
    18. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    19. Mert Demirer & Vasilis Syrgkanis & Greg Lewis & Victor Chernozhukov, 2019. "Semi-Parametric Efficient Policy Learning with Continuous Actions," Papers 1905.10116, arXiv.org, revised Jul 2019.
    20. Bhattacharya, Debopam & Dupas, Pascaline, 2012. "Inferring welfare maximizing treatment assignment under budget constraints," Journal of Econometrics, Elsevier, vol. 167(1), pages 168-196.
    21. Yue Fang & Jin Xi & Haitian Xie, 2025. "Model Selection for Multivalued-Treatment Policy Learning in Observational Studies," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 43(4), pages 897-909, October.
    22. Xiaohong Chen & Haitian Xie, 2025. "Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Cowles Foundation Discussion Papers 2467, Cowles Foundation for Research in Economics, Yale University.
    23. Charles F. Manski, 2004. "Statistical Treatment Rules for Heterogeneous Populations," Econometrica, Econometric Society, vol. 72(4), pages 1221-1246, July.
    24. Federico Crippa, 2024. "Regret Analysis in Threshold Policy Design," Papers 2404.11767, arXiv.org, revised Apr 2025.
    25. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chunrong Ai & Yue Fang & Haitian Xie, 2024. "Data-Driven Policy Learning for Continuous Treatments," Papers 2402.02535, arXiv.org, revised Dec 2025.
    2. Nan Liu & Yanbo Liu & Yuya Sasaki & Yuanyuan Wan, 2025. "Nonparametric Uniform Inference in Binary Classification and Policy Values," Papers 2511.14700, arXiv.org, revised Dec 2025.
    3. Hirano, Keisuke & Porter, Jack R., 2020. "Asymptotic analysis of statistical decision rules in econometrics," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 283-354, Elsevier.
    4. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    5. Toru Kitagawa & Weining Wang & Mengshan Xu, 2022. "Policy Choice in Time Series by Empirical Welfare Maximization," Papers 2205.03970, arXiv.org, revised Nov 2025.
    6. Toru Kitagawa & Weining Wang & Mengshan Xu, 2024. "Policy choice in time series by empirical welfare maximization," CeMMAP working papers 27/24, Institute for Fiscal Studies.
    7. Davide Viviano & Jess Rudder, 2020. "Policy design in experiments with unknown interference," Papers 2011.08174, arXiv.org, revised May 2024.
    8. Anders Bredahl Kock & David Preinerstorfer, 2024. "Regularizing Fairness in Optimal Policy Learning with Distributional Targets," Papers 2401.17909, arXiv.org, revised May 2025.
    9. Timothy Armstrong & Martin Weidner & Andrei Zeleneev, 2024. "Robust estimation and inference in panels with interactive fixed effects," IFS Working Papers WCWP28/24, Institute for Fiscal Studies.
    10. Toru Kitagawa & Sokbae Lee & Chen Qiu, 2023. "Treatment choice, mean square regret and partial identification," The Japanese Economic Review, Springer, vol. 74(4), pages 573-602, October.
    11. Chen, Xiaohong & Liu, Ying & Ma, Shujie & Zhang, Zheng, 2024. "Causal inference of general treatment effects using neural networks with a diverging number of confounders," Journal of Econometrics, Elsevier, vol. 238(1).
    12. Davide Viviano, 2019. "Policy Targeting under Network Interference," Papers 1906.10258, arXiv.org, revised Apr 2024.
    13. Yuehao Bai & Azeem M. Shaikh & Max Tabord-Meehan, 2024. "A Primer on the Analysis of Randomized Experiments and a Survey of some Recent Advances," Papers 2405.03910, arXiv.org, revised Apr 2025.
    14. Toru Kitagawa & Sokbae Lee & Chen Qiu, 2022. "Treatment Choice with Nonlinear Regret," Papers 2205.08586, arXiv.org, revised Oct 2024.
    15. Kirill Ponomarev & Vira Semenova, 2024. "On the Lower Confidence Band for the Optimal Welfare in Policy Learning," Papers 2410.07443, arXiv.org, revised Sep 2025.
    16. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    17. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP54/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    18. Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
    19. Shosei Sakaguchi, 2021. "Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraints," Papers 2106.05031, arXiv.org, revised Aug 2024.
    20. Kock, Anders Bredahl & Preinerstorfer, David & Veliyev, Bezirgen, 2023. "Treatment recommendation with distributional targets," Journal of Econometrics, Elsevier, vol. 234(2), pages 624-646.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.19230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.