IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i5p735-d1349007.html
   My bibliography  Save this article

Distributed Bootstrap Simultaneous Inference for High-Dimensional Quantile Regression

Author

Listed:
  • Xingcai Zhou

    (School of Statistics and Data Science, Nanjing Audit University, Nanjing 211815, China)

  • Zhaoyang Jing

    (School of Statistics and Data Science, Nanjing Audit University, Nanjing 211815, China)

  • Chao Huang

    (Department of Statistics, Florida State University, Tallahassee, FL 32306, USA)

Abstract

Modern massive data with enormous sample size and tremendous dimensionality are usually impossible to process with a single machine. They are typically stored and processed in a distributed manner. In this paper, we propose a distributed bootstrap simultaneous inference for a high-dimensional quantile regression model using massive data. Meanwhile, a communication-efficient (CE) distributed learning algorithm is developed via the CE surrogate likelihood framework and ADMM procedure, which can handle the non-smoothness of the quantile regression loss and the Lasso penalty. We theoretically prove the convergence of the algorithm and establish a lower bound on the number of communication rounds ι min that warrant statistical accuracy and efficiency. The distributed bootstrap validity and efficiency are corroborated by an extensive simulation study.

Suggested Citation

  • Xingcai Zhou & Zhaoyang Jing & Chao Huang, 2024. "Distributed Bootstrap Simultaneous Inference for High-Dimensional Quantile Regression," Mathematics, MDPI, vol. 12(5), pages 1-54, February.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:5:p:735-:d:1349007
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/5/735/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/5/735/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chatterjee, A. & Lahiri, S. N., 2011. "Bootstrapping Lasso Estimators," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 608-625.
    2. Xianyang Zhang & Guang Cheng, 2017. "Simultaneous Inference for High-Dimensional Linear Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 757-768, April.
    3. Ariel Kleiner & Ameet Talwalkar & Purnamrita Sarkar & Michael I. Jordan, 2014. "A scalable bootstrap for massive data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 795-816, September.
    4. Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 685-719, December.
    5. Victor Chernozhukov & Denis Chetverikov & Kengo Kato, 2012. "Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors," Papers 1212.6906, arXiv.org, revised Jan 2018.
    6. Michael I. Jordan & Jason D. Lee & Yun Yang, 2019. "Communication-Efficient Distributed Statistical Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 668-681, April.
    7. Lan Wang & Yichao Wu & Runze Li, 2012. "Quantile Regression for Analyzing Heterogeneity in Ultra-High Dimension," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 214-222, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Victor Chernozhukov & Wolfgang Härdle & Chen Huang & Weining Wang, 2018. "LASSO-driven inference in time and space," CeMMAP working papers CWP36/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Xiaorui Zhu & Yichen Qin & Peng Wang, 2023. "Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models," Papers 2307.07574, arXiv.org.
    3. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Yuta Koike, 2022. "High-dimensional Data Bootstrap," Papers 2205.09691, arXiv.org.
    4. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    5. Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
    6. Adamek, Robert & Smeekes, Stephan & Wilms, Ines, 2023. "Lasso inference for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1114-1143.
    7. Claude Renaux & Laura Buzdugan & Markus Kalisch & Peter Bühlmann, 2020. "Hierarchical inference for genome-wide association studies: a view on methodology with software," Computational Statistics, Springer, vol. 35(1), pages 1-40, March.
    8. Shi, Chengchun & Lu, Wenbin & Song, Rui, 2018. "A massive data framework for M-estimators with cubic-rate," LSE Research Online Documents on Economics 102111, London School of Economics and Political Science, LSE Library.
    9. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    10. Hanzhong Liu & Bin Yu, 2017. "Comments on: High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 740-750, December.
    11. Matthias Löffler & Richard Nickl, 2017. "Comments on: High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 731-733, December.
    12. Alexander Giessing & Jianqing Fan, 2020. "Bootstrapping $\ell_p$-Statistics in High Dimensions," Papers 2006.13099, arXiv.org, revised Aug 2020.
    13. Zhu, Ke & Liu, Hanzhong, 2022. "Confidence intervals for parameters in high-dimensional sparse vector autoregression," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    14. Joel L. Horowitz, 2018. "Bootstrap Methods in Econometrics," Papers 1809.04016, arXiv.org.
    15. Wang, Xiaoqian & Kang, Yanfei & Hyndman, Rob J. & Li, Feng, 2023. "Distributed ARIMA models for ultra-long time series," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1163-1184.
    16. He, Yi & Jaidee, Sombut & Gao, Jiti, 2023. "Most powerful test against a sequence of high dimensional local alternatives," Journal of Econometrics, Elsevier, vol. 234(1), pages 151-177.
    17. Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 685-719, December.
    18. Joel L. Horowitz, 2018. "Bootstrap methods in econometrics," CeMMAP working papers CWP53/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    19. Belloni, Alexandre & Hansen, Christian & Newey, Whitney, 2022. "High-dimensional linear models with many endogenous variables," Journal of Econometrics, Elsevier, vol. 228(1), pages 4-26.
    20. Changgee Chang & Zhiqi Bu & Qi Long, 2023. "CEDAR: communication efficient distributed analysis for regressions," Biometrics, The International Biometric Society, vol. 79(3), pages 2357-2369, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:5:p:735-:d:1349007. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.