IDEAS home Printed from https://ideas.repec.org/p/ifs/cemmap/45-12.html
   My bibliography  Save this paper

Central limit theorems and multiplier bootstrap when p is much larger than n

Author

Listed:
  • Victor Chernozhukov

    (Institute for Fiscal Studies and MIT)

  • Denis Chetverikov

    (Institute for Fiscal Studies and UCLA)

  • Kengo Kato

    (Institute for Fiscal Studies)

Abstract

We derive a central limit theorem for the maximum of a sum of high dimensional random vectors. More precisely, we establish conditions under which the distribution of the maximum is approximated by the maximum of a sum of the Gaussian random vectors with the same covariance matrices as the original vectors. The key innovation of our result is that it applies even if the dimension of random vectors (p) is much larger than the sample size (n). In fact, the growth of p could be exponential in some fractional power of n. We also show that the distribution of the maximum of a sum of the Gaussian random vectors with unknown covariance matrices can be estimated by the distribution of the maximum of the (conditional) Gaussian process obtained by multiplying the original vectors with i.i.d. Gaussian multipliers. We call this procedure the “multiplier bootstrap”. Here too, the growth of p could be exponential in some fractional power of n. We prove that our distributional approximations, either Gaussian or conditional Gaussian, yield a high-quality approximation for the distribution of the original maximum, often with at most a polynomial approximation error. These results are of interest in numerous econometric and statistical applications. In particular, we demonstrate how our central limit theorem and the multiplier bootstrap can be used for high dimensional estimation, multiple hypothesis testing, and adaptive specification testing. All of our results contain non-asymptotic bounds on approximation errors.

Suggested Citation

  • Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Central limit theorems and multiplier bootstrap when p is much larger than n," CeMMAP working papers CWP45/12, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:cemmap:45/12
    as

    Download full text from publisher

    File URL: http://www.cemmap.ac.uk/wps/cwp451212.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Eric Gautier & Alexandre Tsybakov, 2011. "High-Dimensional Instrumental Variables Regression and Confidence Sets," Working Papers 2011-13, Center for Research in Economics and Statistics.
    2. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximation of suprema of empirical processes," CeMMAP working papers CWP44/12, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Joseph P. Romano & Michael Wolf, 2005. "Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 94-108, March.
    4. Horowitz, Joel L & Spokoiny, Vladimir G, 2001. "An Adaptive, Rate-Optimal Test of a Parametric Mean-Regression Model against a Nonparametric Alternative," Econometrica, Econometric Society, vol. 69(3), pages 599-631, May.
    5. Fan, Jianqing & Hall, Peter & Yao, Qiwei, 2007. "To How Many Simultaneous Hypothesis Tests Can Normal, Student's t or Bootstrap Calibration Be Applied?," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1282-1288, December.
    6. A. Belloni & V. Chernozhukov & L. Wang, 2011. "Square-root lasso: pivotal recovery of sparse signals via conic programming," Biometrika, Biometrika Trust, vol. 98(4), pages 791-806.
    7. Alquier, Pierre & Hebiri, Mohamed, 2011. "Generalization of ℓ1 constraints for high dimensional regression problems," Statistics & Probability Letters, Elsevier, vol. 81(12), pages 1760-1765.
    8. Emmanuel Guerre & Pascal Lavergne, 2004. "Data-Driven Rate-Optimal Specification Testing In Regression Models," Econometrics 0411008, University Library of Munich, Germany.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2013. "Comparison and anti-concentration bounds for maxima of Gaussian random vectors," CeMMAP working papers CWP71/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    3. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    4. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Dec 2017.
    5. Alexandre Belloni & Mingli Chen & Victor Chernozhukov, 2016. "Quantile Graphical Models: Prediction and Conditional Independence with Applications to Systemic Risk," Papers 1607.00286, arXiv.org, revised Oct 2019.
    6. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2019. "Valid Post-Selection Inference in High-Dimensional Approximately Sparse Quantile Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 749-758, April.
    7. Long Feng & Changliang Zou & Zhaojun Wang & Lixing Zhu, 2015. "Robust comparison of regression curves," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 185-204, March.
    8. Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers CWP57/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    9. Heckman, James J. & Schmierer, Daniel & Urzua, Sergio, 2010. "Testing the correlated random coefficient model," Journal of Econometrics, Elsevier, vol. 158(2), pages 177-203, October.
    10. Andrea Vaona, 2008. "The sensitivity of nonparametric misspecification tests to disturbance autocorrelation," Quaderni della facoltà di Scienze economiche dell'Università di Lugano 0803, USI Università della Svizzera italiana.
    11. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey, 2016. "Double machine learning for treatment and causal parameters," CeMMAP working papers 49/16, Institute for Fiscal Studies.
    12. Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach," Annual Review of Economics, Annual Reviews, vol. 7(1), pages 649-688, August.
    13. Heckman, James J. & Schmierer, Daniel, 2010. "Tests of hypotheses arising in the correlated random coefficient model," Economic Modelling, Elsevier, vol. 27(6), pages 1355-1367, November.
    14. Alexandre Belloni & Victor Chernozhukov & Lie Wang, 2013. "Pivotal estimation via square-root lasso in nonparametric regression," CeMMAP working papers CWP62/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    15. Masamune Iwasawa, 2015. "A Joint Specification Test for Response Probabilities in Unordered Multinomial Choice Models," Econometrics, MDPI, vol. 3(3), pages 1-31, September.
    16. Belloni, Alexandre & Chen, Mingli & Chernozhukov, Victor, 2016. "Quantile Graphical Models : Prediction and Conditional Independence with Applications to Financial Risk Management," Economic Research Papers 269321, University of Warwick - Department of Economics.
    17. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    18. Achim Ahrens & Arnab Bhattacharjee, 2015. "Two-Step Lasso Estimation of the Spatial Weights Matrix," Econometrics, MDPI, vol. 3(1), pages 1-28, March.
    19. Guo, Zijian & Kang, Hyunseung & Cai, T. Tony & Small, Dylan S., 2018. "Testing endogeneity with high dimensional covariates," Journal of Econometrics, Elsevier, vol. 207(1), pages 175-187.
    20. Zhu, Ying, 2013. "Sparse Linear Models and Two-Stage Estimation in High-Dimensional Settings with Possibly Many Endogenous Regressors," MPRA Paper 49846, University Library of Munich, Germany.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:45/12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.