Inference on treatment effects after selection amongst high-dimensional controls

Inference on treatment effects after selection amongst high-dimensional controls

Author

Listed:

Alexandre Belloni
(Institute for Fiscal Studies)
Victor Chernozhukov
(Institute for Fiscal Studies and MIT)
Christian Hansen
(Institute for Fiscal Studies and Chicago GSB)

Abstract

We propose robust methods for inference on the effect of a treatment variable on a scalar outcome in the presence of very many controls. Our setting is a partially linear model with possibly non-Gaussian and heteroscedastic disturbances where the number of controls may be much larger than the sample size. To make informative inference feasible, we require the model to be approximately sparse; that is, we require that the effect of confounding factors can be controlled for up to a small approximation error by conditioning on a relatively small number of controls whose identities are unknown. The latter condition makes it possible to estimate the treatment effect by selecting approximately the right set of controls. We develop a novel estimation and uniformly valid inference method for the treatment effect in this setting, called the 'post-double-selection' method. Our results apply to Lasso-type methods used for covariate selection as well as to any other model selection method that is able to find a sparse model with good approximation properties. The main attractive feature of our method is that it allows for imperfect selection of the controls and provides confidence intervals that are valid uniformly across a large class of models. In contrast, standard post-model selection estimators fail to provide uniform inference even in simple cases with a small, fixed number of controls. Thus our method resolves the problem of uniform inference after model selection for a large, interesting class of models. We also present a simple generalisation of our method to a fully heterogeneous model with a binary treatment variable. We illustrate the use of the developed methods with numerical simulations and an application that considers the effect of abortion crime rates.

Suggested Citation

Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "Inference on treatment effects after selection amongst high-dimensional controls," CeMMAP working papers CWP26/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

Handle: RePEc:ifs:cemmap:26/13

Download full text from publisher

Other versions of this item:

Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference on Treatment Effects After Selection Amongst High-Dimensional Controls," Papers 1201.0224, arXiv.org, revised May 2012.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "Inference on treatment effects after selection amongst high-dimensional controls," CeMMAP working papers 26/13, Institute for Fiscal Studies.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2012. "Inference on treatment effects after selection amongst high-dimensional controls," CeMMAP working papers 10/12, Institute for Fiscal Studies.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2012. "Inference on treatment effects after selection amongst high-dimensional controls," CeMMAP working papers CWP10/12, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

References listed on IDEAS

Hardle, Wolfgang & LIang, Hua & Gao, Jiti, 2000. "Partially linear models," MPRA Paper 39562, University Library of Munich, Germany, revised 01 Sep 2000.
Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
Andrews, Donald W.K. & Cheng, Xu, 2013. "Maximum likelihood estimation and uniform inference with sporadic identification failure," Journal of Econometrics, Elsevier, vol. 173(1), pages 36-56.
- Donald W. K. Andrews & Xu Cheng, 2011. "Maximum Likelihood Estimation and Uniform Inference with Sporadic Identification Failure," Cowles Foundation Discussion Papers 1824R, Cowles Foundation for Research in Economics, Yale University, revised Oct 2012.
- Donald W. K. Andrews & Xu Cheng, 2011. "Maximum Likelihood Estimation and Uniform Inference with Sporadic Identification Failure," Cowles Foundation Discussion Papers 1824, Cowles Foundation for Research in Economics, Yale University.
Hansen, Bruce E., 2005. "Challenges For Econometric Model Selection," Econometric Theory, Cambridge University Press, vol. 21(1), pages 60-68, February.
Kerkyacharian, G. & Picard, D., 1992. "Density estimation in Besov spaces," Statistics & Probability Letters, Elsevier, vol. 13(1), pages 15-24, January.
Alberto Abadie & Guido W. Imbens, 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(1), pages 1-11, January.
- Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
MacKinnon, James G. & White, Halbert, 1985. "Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties," Journal of Econometrics, Elsevier, vol. 29(3), pages 305-325, September.
- James G. MacKinnon & Halbert White, 1983. "Some Heteroskedasticity Consistent Covariance Matrix Estimators with Improved Finite Sample Properties," Working Paper 537, Economics Department, Queen's University.
A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
- Alexandre Belloni & Daniel Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain," Papers 1010.4345, arXiv.org, revised Apr 2015.
- Alexandre Belloni & D. Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse models and methods for optimal instruments with an application to eminent domain," CeMMAP working papers CWP31/10, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Koenker, Roger, 1988. "Asymptotic Theory and Econometric Practice," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 3(2), pages 139-147, April.
Leeb, Hannes & Pötscher, Benedikt M., 2008. "Can One Estimate The Unconditional Distribution Of Post-Model-Selection Estimators?," Econometric Theory, Cambridge University Press, vol. 24(2), pages 338-376, April.
- Hannes Leeb & Benedikt M. Potscher, 2003. "Can One Estimate the Conditional Distribution of Post-Model-Selection Estimators?," Cowles Foundation Discussion Papers 1444, Cowles Foundation for Research in Economics, Yale University.
- Leeb, Hannes & Pötscher, Benedikt M., 2005. "Can One Estimate the Unconditional Distribution of Post-Model-Selection Estimators ?," MPRA Paper 72, University Library of Munich, Germany.
Heckman, James J. & Lalonde, Robert J. & Smith, Jeffrey A., 1999. "The economics and econometrics of active labor market programs," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 31, pages 1865-2097, Elsevier.
Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
- Guido W. Imbens, 2003. "Nonparametric Estimation of Average Treatment Effects under Exogeneity: A Review," NBER Technical Working Papers 0294, National Bureau of Economic Research, Inc.
Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
James J. Heckman & Hidehiko Ichimura & Petra Todd, 1998. "Matching As An Econometric Evaluation Estimator," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 65(2), pages 261-294.
Christopher L. Foote & Christopher F. Goetz, 2008. "The Impact of Legalized Abortion on Crime: Comment," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 123(1), pages 407-423.
Eric Gautier & Alexandre Tsybakov, 2011. "High-Dimensional Instrumental Variables Regression and Confidence Sets," Working Papers 2011-13, Center for Research in Economics and Statistics.
- Eric Gautier & Christiern Rose, 2021. "High-dimensional instrumental variables regression and confidence sets," Working Papers hal-00591732, HAL.
- Gautier, Eric & Rose, Christiern & Tsybakov, Alexandre, 2018. "High-dimensional instrumental variables regression and confidence sets," TSE Working Papers 18-930, Toulouse School of Economics (TSE), revised Nov 2019.
Andrews, Donald W.K. & Cheng, Xu & Guggenberger, Patrik, 2020. "Generic results for establishing the asymptotic size of confidence sets and tests," Journal of Econometrics, Elsevier, vol. 218(2), pages 496-531.
- Donald W.K. Andrews & Xu Cheng & Patrik Guggenberger, 2011. "Generic Results for Establishing the Asymptotic Size of Confidence Sets and Tests," Cowles Foundation Discussion Papers 1813, Cowles Foundation for Research in Economics, Yale University.
Donald, S. G. & Newey, W. K., 1994. "Series Estimation of Semilinear Models," Journal of Multivariate Analysis, Elsevier, vol. 50(1), pages 30-40, July.
Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
- Guido Imbens, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometric Society World Congress 2000 Contributed Papers 1166, Econometric Society.
- Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2000. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," NBER Technical Working Papers 0251, National Bureau of Economic Research, Inc.
Newey, Whitney K., 1997. "Convergence rates and asymptotic normality for series estimators," Journal of Econometrics, Elsevier, vol. 79(1), pages 147-168, July.
Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
- Guido Imbens & Jeffrey M. Wooldridge, 2008. "Recent developments in the econometrics of program evaluation," CeMMAP working papers CWP24/08, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Wooldridge, Jeffrey M. & Imbens, Guido, 2009. "Recent Developments in the Econometrics of Program Evaluation," Scholarly Articles 3043416, Harvard University Department of Economics.
- Guido M. Imbens & Jeffrey M. Wooldridge, 2008. "Recent Developments in the Econometrics of Program Evaluation," NBER Working Papers 14251, National Bureau of Economic Research, Inc.
- Imbens, Guido W. & Wooldridge, Jeffrey M., 2008. "Recent Developments in the Econometrics of Program Evaluation," IZA Discussion Papers 3640, IZA Network @ LISER.
Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355, December.
Lin, Zhexiao & Han, Fang, 2025. "On regression-adjusted imputation estimators of average treatment effects," Journal of Econometrics, Elsevier, vol. 251(C).
repec:hum:wpaper:sfb649dp2014-043 is not listed on IDEAS
Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
- Max H. Farrell, 2013. "Robust Inference on Average Treatment Effects with Possibly More Covariates than Observations," Papers 1309.4686, arXiv.org, revised Feb 2018.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference for high-dimensional sparse econometric models," CeMMAP working papers CWP41/11, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference for High-Dimensional Sparse Econometric Models," Papers 1201.0220, arXiv.org.
Taisuke Otsu & Mengshan Xu, 2022. "Isotonic propensity score matching," STICERD - Econometrics Paper Series 623, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Hsiao, Cheng & Zhou, Qiankun, 2025. "Statistical inference for the low dimensional parameters of linear regression models in the presence of high-dimensional data: An orthogonal projection approach," Journal of Econometrics, Elsevier, vol. 252(PB).
Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "High-Dimensional Methods and Inference on Structural and Treatment Effects," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 29-50, Spring.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers 59/13, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "High dimensional methods and inference on structural and treatment effects," CeMMAP working papers CWP59/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Mengshan Xu & Taisuke Otsu, 2022. "Isotonic propensity score matching," Papers 2207.08868, arXiv.org, revised Jan 2025.
Richard K. Crump & V. Joseph Hotz & Guido W. Imbens & Oscar A. Mitnik, 2006. "Moving the Goalposts: Addressing Limited Overlap in the Estimation of Average Treatment Effects by Changing the Estimand," NBER Technical Working Papers 0330, National Bureau of Economic Research, Inc.
- Richard K. Crump & V. Joseph Hotz & Guido W. Imbens & Oscar A. Mitnik, 2006. "Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand," Working Papers 0608, University of Miami, Department of Economics.
- Crump, Richard K. & Hotz, V. Joseph & Imbens, Guido W. & Mitnik, Oscar A., 2006. "Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand," IZA Discussion Papers 2347, IZA Network @ LISER.
Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2016. "Semiparametric Estimation With Generated Covariates," Econometric Theory, Cambridge University Press, vol. 32(5), pages 1140-1177, October.
- Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2011. "Semiparametric estimation with generated covariates," SFB 649 Discussion Papers 2011-064, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
- Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2011. "Semiparametric Estimation with Generated Covariates," IZA Discussion Papers 6084, IZA Network @ LISER.
- Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2016. "Semiparametric estimation with generated covariates," Working Paper Series in Economics 81, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
- Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2014. "Semiparametric Estimation with Generated Covariates," SFB 649 Discussion Papers 2014-043, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
Jochen Kluve & Boris Augurzky, 2007. "Assessing the performance of matching algorithms when selection into treatment is strong," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(3), pages 533-557.
- Augurzky, Boris & Kluve, Jochen, 2004. "Assessing the Performance of Matching Algorithms When Selection into Treatment Is Strong," IZA Discussion Papers 1301, IZA Network @ LISER.
Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
- Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2015. "The Finite Sample Performance of Semi- and Nonparametric Estimators for Treatment Effects and Policy Evaluation," IZA Discussion Papers 8756, IZA Network @ LISER.
- Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2015. "The finite sample performance of semi- and nonparametric estimators for treatment effects and policy evaluation," FSES Working Papers 454, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
Difang Huang & Jiti Gao & Tatsushi Oka, 2025. "Semiparametric single-index estimation for average treatment effects," Econometric Reviews, Taylor & Francis Journals, vol. 44(6), pages 843-885, July.
- Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Papers 2206.08503, arXiv.org, revised Jan 2025.
- Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Monash Econometrics and Business Statistics Working Papers 10/22, Monash University, Department of Econometrics and Business Statistics.
Donald, Stephen G. & Hsu, Yu-Chin, 2014. "Estimation and inference for distribution functions and quantile functions in treatment effect models," Journal of Econometrics, Elsevier, vol. 178(P3), pages 383-397.
- Stephen G. Donald & Yu-Chin Hsu, 2012. "Estimation and Inference for Distribution Functions and Quantile Functions in Treatment Effect Models," IEAS Working Paper : academic research 12-A016, Institute of Economics, Academia Sinica, Taipei, Taiwan.
John C. Ham & Xianghong Li & Patricia B. Reagan, 2004. "Propensity Score Matching, a Distance-Based Measure of Migration, and the Wage Growth of Young Men," Working Papers 2004_3, York University, Department of Economics.
- John C. Ham & Xianghong Li & Patricia Reagan, 2005. "Propensity score matching, a distance-based measure of migration, and the wage growth of young men," Staff Reports 212, Federal Reserve Bank of New York.
- John C. Ham & Xianghong Li & Patricia B. Reagan, 2004. "Propensity Score Matching, a Distance-Based Measure of Migration, and the Wage Growth of Young Men," IEPR Working Papers 05.13, Institute of Economic Policy Research (IEPR).
Dong, Chaohua & Gao, Jiti & Linton, Oliver, 2023. "High dimensional semiparametric moment restriction models," Journal of Econometrics, Elsevier, vol. 232(2), pages 320-345.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2017. "High dimensional semiparametric moment restriction models," Monash Econometrics and Business Statistics Working Papers 17/17, Monash University, Department of Econometrics and Business Statistics.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," CeMMAP working papers CWP69/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," CeMMAP working papers CWP04/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Chaohua Dong & Jiti Gao & Oliver Linton, 2018. "High dimensional semiparametric moment restriction models," Monash Econometrics and Business Statistics Working Papers 23/18, Monash University, Department of Econometrics and Business Statistics.
- Dong, C. & Gao, J. & Linton, O., 2018. "High Dimensional Semiparametric Moment Restriction Models," Cambridge Working Papers in Economics 1881, Faculty of Economics, University of Cambridge.
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Ham, John C. & Li, Xianghong & Reagan, Patricia B., 2011. "Matching and semi-parametric IV estimation, a distance-based measure of migration, and the wages of young men," Journal of Econometrics, Elsevier, vol. 161(2), pages 208-227, April.
Jones A.M & Rice N, 2009. "Econometric Evaluation of Health Policies," Health, Econometrics and Data Group (HEDG) Working Papers 09/09, HEDG, c/o Department of Economics, University of York.

More about this item

Keywords

; ; ; ; ; ; ;

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ECM-2013-06-16 (Econometrics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:26/13. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Inference on treatment effects after selection amongst high-dimensional controls

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data