IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2105.15197.html
   My bibliography  Save this paper

A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees

Author

Listed:
  • Victor Chernozhukov
  • Whitney K. Newey
  • Rahul Singh

Abstract

Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals, i.e. scalar summaries, of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased machine learning theorem that encompasses any global or local functional of any machine learning algorithm that satisfies a few simple, interpretable conditions. Formally, we prove consistency, Gaussian approximation, and semiparametric efficiency by finite sample arguments. The rate of convergence is $n^{-1/2}$ for global functionals, and it degrades gracefully for local functionals. Our results culminate in a simple set of conditions that an analyst can use to translate modern learning theory rates into traditional statistical inference. The conditions reveal a general double robustness property for ill posed inverse problems.

Suggested Citation

  • Victor Chernozhukov & Whitney K. Newey & Rahul Singh, 2021. "A Simple and General Debiased Machine Learning Theorem with Finite Sample Guarantees," Papers 2105.15197, arXiv.org, revised Oct 2022.
  • Handle: RePEc:arx:papers:2105.15197
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2105.15197
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    2. Andrews, Donald W K, 1994. "Asymptotics for Semiparametric Econometric Models via Stochastic Equicontinuity," Econometrica, Econometric Society, vol. 62(1), pages 43-72, January.
    3. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    4. Severini, Thomas A. & Tripathi, Gautam, 2012. "Efficiency bounds for estimating linear functionals of nonparametric regression models with endogenous regressors," Journal of Econometrics, Elsevier, vol. 170(2), pages 491-498.
    5. Newey, Whitney K, 1994. "The Asymptotic Variance of Semiparametric Estimators," Econometrica, Econometric Society, vol. 62(6), pages 1349-1382, November.
    6. Rahul Singh, 2021. "Debiased Kernel Methods," Papers 2102.11076, arXiv.org, revised Mar 2021.
    7. Nishanth Dikkala & Greg Lewis & Lester Mackey & Vasilis Syrgkanis, 2020. "Minimax Estimation of Conditional Moment Models," Papers 2006.07201, arXiv.org.
    8. Jason Abrevaya & Yu-Chin Hsu & Robert P. Lieli, 2015. "Estimating Conditional Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(4), pages 485-505, October.
    9. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    10. Whitney K. Newey & Fushing Hsieh & James M. Robins, 2004. "Twicing Kernels and a Small Bias Property of Semiparametric Estimators," Econometrica, Econometric Society, vol. 72(3), pages 947-962, May.
    11. Yoici Arai & Taisuke Otsu & Myung Hwan Seo, 2019. "Causal inference on regression discontinuity designs by high-dimensional methods," STICERD - Econometrics Paper Series 601, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    12. Newey, Whitney K., 1994. "Kernel Estimation of Partial Means and a General Variance Estimator," Econometric Theory, Cambridge University Press, vol. 10(2), pages 1-21, June.
    13. Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
    14. Whitney K. Newey & James L. Powell, 2003. "Instrumental Variable Estimation of Nonparametric Models," Econometrica, Econometric Society, vol. 71(5), pages 1565-1578, September.
    15. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dmitry Arkhangelsky & Kazuharu Yanagimoto & Tom Zohar, 2024. "Flexible Analysis of Individual Heterogeneity in Event Studies: Application to the Child Penalty," Papers 2403.19563, arXiv.org.
    2. Jikai Jin & Vasilis Syrgkanis, 2024. "Structure-agnostic Optimality of Doubly Robust Learning for Treatment Effect Estimation," Papers 2402.14264, arXiv.org, revised Mar 2024.
    3. Andrew Bennett & Nathan Kallus & Xiaojie Mao & Whitney Newey & Vasilis Syrgkanis & Masatoshi Uehara, 2023. "Source Condition Double Robust Inference on Functionals of Inverse Problems," Papers 2307.13793, arXiv.org.
    4. Rahul Singh & Liyuan Xu & Arthur Gretton, 2021. "Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves," Papers 2111.03950, arXiv.org, revised Jul 2023.
    5. Rahul Singh, 2021. "Generalized Kernel Ridge Regression for Causal Inference with Missing-at-Random Sample Selection," Papers 2111.05277, arXiv.org.
    6. David Bruns-Smith & Oliver Dukes & Avi Feller & Elizabeth L. Ogburn, 2023. "Augmented balancing weights as linear regression," Papers 2304.14545, arXiv.org, revised Aug 2023.
    7. Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Isaac Meza & Rahul Singh, 2021. "Nested Nonparametric Instrumental Variable Regression: Long Term, Mediated, and Time Varying Treatment Effects," Papers 2112.14249, arXiv.org, revised Mar 2024.
    2. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    3. Hidehiko Ichimura & Whitney K. Newey, 2022. "The influence function of semiparametric estimators," Quantitative Economics, Econometric Society, vol. 13(1), pages 29-61, January.
    4. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    5. Victor Chernozhukov & Whitney Newey & Rahul Singh & Vasilis Syrgkanis, 2020. "Adversarial Estimation of Riesz Representers," Papers 2101.00009, arXiv.org, revised Apr 2024.
    6. Rahul Singh, 2021. "Debiased Kernel Methods," Papers 2102.11076, arXiv.org, revised Mar 2021.
    7. Anish Agarwal & Rahul Singh, 2021. "Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy," Papers 2107.02780, arXiv.org, revised Feb 2024.
    8. Qizhao Chen & Vasilis Syrgkanis & Morgane Austern, 2022. "Debiased Machine Learning without Sample-Splitting for Stable Estimators," Papers 2206.01825, arXiv.org, revised Nov 2022.
    9. Dong, Chaohua & Gao, Jiti & Linton, Oliver, 2023. "High dimensional semiparametric moment restriction models," Journal of Econometrics, Elsevier, vol. 232(2), pages 320-345.
    10. Taisuke Otsu & Mengshan Xu, 2022. "Isotonic propensity score matching," STICERD - Econometrics Paper Series 623, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    11. Mengshan Xu & Taisuke Otsu, 2022. "Isotonic propensity score matching," Papers 2207.08868, arXiv.org.
    12. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    13. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    14. Gayle, Wayne-Roy & Namoro, Soiliou Daw, 2013. "Estimation of a nonlinear panel data model with semiparametric individual effects," Journal of Econometrics, Elsevier, vol. 175(1), pages 46-59.
    15. Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
    16. Gayle, George-Levi & Viauroux, Christelle, 2007. "Root-N consistent semiparametric estimators of a dynamic panel-sample-selection model," Journal of Econometrics, Elsevier, vol. 141(1), pages 179-212, November.
    17. Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    18. Kyle Colangelo & Ying-Ying Lee, 2020. "Double Debiased Machine Learning Nonparametric Inference with Continuous Treatments," Papers 2004.03036, arXiv.org, revised Sep 2023.
    19. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    20. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Dec 2017.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2105.15197. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.