IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v26y2017i4d10.1007_s11749-017-0554-2.html
   My bibliography  Save this article

High-dimensional simultaneous inference with the bootstrap

Author

Listed:
  • Ruben Dezeure

    (ETH Zürich)

  • Peter Bühlmann

    (ETH Zürich)

  • Cun-Hui Zhang

    (Rutgers University)

Abstract

We propose a residual and wild bootstrap methodology for individual and simultaneous inference in high-dimensional linear models with possibly non-Gaussian and heteroscedastic errors. We establish asymptotic consistency for simultaneous inference for parameters in groups G, where $$p \gg n$$ p ≫ n , $$s_0 = o(n^{1/2}/\{\log (p) \log (|G|)^{1/2}\})$$ s 0 = o ( n 1 / 2 / { log ( p ) log ( | G | ) 1 / 2 } ) and $$\log (|G|) = o(n^{1/7})$$ log ( | G | ) = o ( n 1 / 7 ) , with p the number of variables, n the sample size and $$s_0$$ s 0 the sparsity. The theory is complemented by many empirical results. Our proposed procedures are implemented in the R-package hdi (Meier et al. hdi: high-dimensional inference. R package version 0.1-6, 2016).

Suggested Citation

  • Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 685-719, December.
  • Handle: RePEc:spr:testjl:v:26:y:2017:i:4:d:10.1007_s11749-017-0554-2
    DOI: 10.1007/s11749-017-0554-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-017-0554-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-017-0554-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Victor Chernozhukov & Christian Hansen & Martin Spindler, 2016. "hdm: High-Dimensional Metrics," CeMMAP working papers 37/16, Institute for Fiscal Studies.
    2. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2014. "Central limit theorems and bootstrap in high dimensions," CeMMAP working papers 49/14, Institute for Fiscal Studies.
    3. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    4. Chatterjee, A. & Lahiri, S. N., 2011. "Bootstrapping Lasso Estimators," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 608-625.
    5. Meinshausen, Nicolai & Meier, Lukas & Bühlmann, Peter, 2009. "p-Values for High-Dimensional Regression," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1671-1681.
    6. Ian W. McKeague & Min Qian, 2015. "An Adaptive Resampling Test for Detecting the Presence of Significant Predictors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1422-1433, December.
    7. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    8. Victor Chernozhukov & Chris Hansen & Martin Spindler, 2016. "High-Dimensional Metrics in R," Papers 1603.01700, arXiv.org, revised Aug 2016.
    9. Rajen D. Shah & Richard J. Samworth, 2013. "Variable selection with error control: another look at stability selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(1), pages 55-80, January.
    10. Qing Zhou, 2014. "Monte Carlo Simulation for Lasso-Type Problems by Estimator Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(508), pages 1495-1516, December.
    11. Nicolai Meinshausen, 2015. "Group bound: confidence intervals for groups of variables in sparse high dimensional regression without assumptions on the design," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(5), pages 923-945, November.
    12. Cun-Hui Zhang & Stephanie S. Zhang, 2014. "Confidence intervals for low dimensional parameters in high dimensional linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 217-242, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Victor Chernozhukov & Wolfgang Härdle & Chen Huang & Weining Wang, 2018. "LASSO-driven inference in time and space," CeMMAP working papers CWP36/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Xiaorui Zhu & Yichen Qin & Peng Wang, 2023. "Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models," Papers 2307.07574, arXiv.org.
    3. Heiss, Florian & Hetzenecker, Stephan & Osterhaus, Maximilian, 2022. "Nonparametric estimation of the random coefficients model: An elastic net approach," Journal of Econometrics, Elsevier, vol. 229(2), pages 299-321.
    4. Jeng, X. Jessie & Chen, Xiongzhi, 2019. "Predictor ranking and false discovery proportion control in high-dimensional regression," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 163-175.
    5. Adamek, Robert & Smeekes, Stephan & Wilms, Ines, 2023. "Lasso inference for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1114-1143.
    6. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    7. Hanzhong Liu & Bin Yu, 2017. "Comments on: High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 740-750, December.
    8. Matthias Löffler & Richard Nickl, 2017. "Comments on: High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 731-733, December.
    9. Byol Kim & Song Liu & Mladen Kolar, 2021. "Two‐sample inference for high‐dimensional Markov networks," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 939-962, November.
    10. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Yuta Koike, 2022. "High-dimensional Data Bootstrap," Papers 2205.09691, arXiv.org.
    11. Zhu, Ke & Liu, Hanzhong, 2022. "Confidence intervals for parameters in high-dimensional sparse vector autoregression," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    12. Capucine Nobletz, 2021. "Return spillovers between green energy indexes and financial markets: a first sectoral approach," EconomiX Working Papers 2021-24, University of Paris Nanterre, EconomiX.
    13. He, Yi & Jaidee, Sombut & Gao, Jiti, 2023. "Most powerful test against a sequence of high dimensional local alternatives," Journal of Econometrics, Elsevier, vol. 234(1), pages 151-177.
    14. Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "Rejoinder on: High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 751-758, December.
    15. Belloni, Alexandre & Hansen, Christian & Newey, Whitney, 2022. "High-dimensional linear models with many endogenous variables," Journal of Econometrics, Elsevier, vol. 228(1), pages 4-26.
    16. Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 1-40, March.
    17. Tanin Sirimongkolkasem & Reza Drikvandi, 2019. "On Regularisation Methods for Analysis of High Dimensional Data," Annals of Data Science, Springer, vol. 6(4), pages 737-763, December.
    18. Ciuperca, Gabriela, 2021. "Variable selection in high-dimensional linear model with possibly asymmetric errors," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qing Zhou & Seunghyun Min, 2017. "Uncertainty quantification under group sparsity," Biometrika, Biometrika Trust, vol. 104(3), pages 613-632.
    2. Philipp Bach & Victor Chernozhukov & Malte S. Kurz & Martin Spindler & Sven Klaassen, 2021. "DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R," Papers 2103.09603, arXiv.org, revised Feb 2024.
    3. Harold D. Chiang, 2018. "Many Average Partial Effects: with An Application to Text Regression," Papers 1812.09397, arXiv.org, revised Jan 2022.
    4. Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 1-40, March.
    5. Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
    6. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey, 2016. "Double machine learning for treatment and causal parameters," CeMMAP working papers 49/16, Institute for Fiscal Studies.
    7. Philipp Bach & Victor Chernozhukov & Martin Spindler, 2018. "Valid Simultaneous Inference in High-Dimensional Settings (with the hdm package for R)," Papers 1809.04951, arXiv.org.
    8. Stefan Seifert & Marica Valente, 2018. "An Offer that you Can't Refuse? Agrimafias and Migrant Labor on Vineyards in Southern Italy," Discussion Papers of DIW Berlin 1735, DIW Berlin, German Institute for Economic Research.
    9. Joel L. Horowitz, 2018. "Bootstrap methods in econometrics," CeMMAP working papers CWP53/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    10. Christian Hansen & Damian Kozbur & Sanjog Misra, 2016. "Targeted undersmoothing," ECON - Working Papers 282, Department of Economics - University of Zurich, revised Apr 2018.
    11. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
    12. Jana Janková & Rajen D. Shah & Peter Bühlmann & Richard J. Samworth, 2020. "Goodness‐of‐fit testing in high dimensional generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(3), pages 773-795, July.
    13. Zhu, Ke & Liu, Hanzhong, 2022. "Confidence intervals for parameters in high-dimensional sparse vector autoregression," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    14. Joel L. Horowitz, 2018. "Bootstrap Methods in Econometrics," Papers 1809.04016, arXiv.org.
    15. Zhou, Jia & Zheng, Zemin & Zhou, Huiting & Dong, Ruipeng, 2021. "Innovated scalable efficient inference for ultra-large graphical models," Statistics & Probability Letters, Elsevier, vol. 173(C).
    16. Shi, Chengchun & Li, Lexin, 2022. "Testing mediation effects using logic of Boolean matrices," LSE Research Online Documents on Economics 108881, London School of Economics and Political Science, LSE Library.
    17. Chenchuan (Mark) Li & Ulrich K. Müller, 2021. "Linear regression with many controls of limited explanatory power," Quantitative Economics, Econometric Society, vol. 12(2), pages 405-442, May.
    18. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2013. "Uniform post selection inference for LAD regression and other z-estimation problems," CeMMAP working papers CWP74/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    19. Ian W. McKeague & Min Qian, 2015. "An Adaptive Resampling Test for Detecting the Presence of Significant Predictors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1422-1433, December.
    20. Huber, Martin & Imhof, David, 2019. "Machine learning with screens for detecting bid-rigging cartels," International Journal of Industrial Organization, Elsevier, vol. 65(C), pages 277-301.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:26:y:2017:i:4:d:10.1007_s11749-017-0554-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.