IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v63y2022i4d10.1007_s00362-021-01271-y.html
   My bibliography  Save this article

Optimal subsampling for composite quantile regression model in massive data

Author

Listed:
  • Yujing Shao

    (Nankai University)

  • Lei Wang

    (Nankai University)

Abstract

The composite quantile regression (CQR) estimator is a robust and efficient alternative to the ordinary least squares estimator and single quantile regression estimator in linear models. For massive data, two different optimal subsampling probabilities through minimizing the trace of the asymptotic variance–covariance matrix for a linearly transformed parameter estimator and the asymptotic mean squared error of the resultant estimator are proposed to downsize the data volume and reduce the computational burden. Furthermore, to improve the efficiency of the ordinary CQR, the optimal subsampling for the weighted CQR estimator is also studied. We also propose iterative subsampling procedures based on two optimal subsampling probabilities to estimate the variance–covariance matrix. The finite-sample performance of the proposed estimators is studied through simulations and an application to household electric power consumption data is also presented.

Suggested Citation

  • Yujing Shao & Lei Wang, 2022. "Optimal subsampling for composite quantile regression model in massive data," Statistical Papers, Springer, vol. 63(4), pages 1139-1161, August.
  • Handle: RePEc:spr:stpapr:v:63:y:2022:i:4:d:10.1007_s00362-021-01271-y
    DOI: 10.1007/s00362-021-01271-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-021-01271-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-021-01271-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bo Kai & Runze Li & Hui Zou, 2010. "Local composite quantile regression smoothing: an efficient and safe alternative to local polynomial regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(1), pages 49-69, January.
    2. Koenker,Roger, 2005. "Quantile Regression," Cambridge Books, Cambridge University Press, number 9780521845731, January.
    3. Jiang, Xuejun & Li, Jingzhi & Xia, Tian & Yan, Wanfeng, 2016. "Robust and efficient estimation with weighted composite quantile regression," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 457(C), pages 413-423.
    4. Jiang, Rong & Qian, Wei-Min & Zhou, Zhan-Gong, 2016. "Weighted composite quantile regression for single-index models," Journal of Multivariate Analysis, Elsevier, vol. 148(C), pages 34-48.
    5. Haiying Wang & Yanyuan Ma, 2021. "Optimal subsampling for quantile regression in big data," Biometrika, Biometrika Trust, vol. 108(1), pages 99-112.
    6. HaiYing Wang & Rong Zhu & Ping Ma, 2018. "Optimal Subsampling for Large Sample Logistic Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 829-844, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sottile, Gianluca & Frumento, Paolo, 2022. "Robust estimation and regression with parametric quantile functions," Computational Statistics & Data Analysis, Elsevier, vol. 171(C).
    2. Fengrui Di & Lei Wang, 2022. "Multi-round smoothed composite quantile regression for distributed data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(5), pages 869-893, October.
    3. Zhen Yu & Keming Yu & Wolfgang K. Härdle & Xueliang Zhang & Kai Wang & Maozai Tian, 2022. "Bayesian spatio‐temporal modeling for the inpatient hospital costs of alcohol‐related disorders," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(S2), pages 644-667, December.
    4. Rong Jiang & Mengxian Sun, 2022. "Single-index composite quantile regression for ultra-high-dimensional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 443-460, June.
    5. Xiaohui Yuan & Yong Li & Xiaogang Dong & Tianqing Liu, 2022. "Optimal subsampling for composite quantile regression in big data," Statistical Papers, Springer, vol. 63(5), pages 1649-1676, October.
    6. Hong-Xia Xu & Guo-Liang Fan & Zhen-Long Chen & Jiang-Feng Wang, 2018. "Weighted quantile regression and testing for varying-coefficient models with randomly truncated data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(4), pages 565-588, October.
    7. Jiang, Rong & Qian, Weimin & Zhou, Zhangong, 2012. "Variable selection and coefficient estimation via composite quantile regression with randomly censored data," Statistics & Probability Letters, Elsevier, vol. 82(2), pages 308-317.
    8. Yang, Jing & Tian, Guoliang & Lu, Fang & Lu, Xuewen, 2020. "Single-index modal regression via outer product gradients," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    9. Jiang, Depeng & Zhao, Puying & Tang, Niansheng, 2016. "A propensity score adjustment method for regression models with nonignorable missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 94(C), pages 98-119.
    10. Yu, Dengdeng & Zhang, Li & Mizera, Ivan & Jiang, Bei & Kong, Linglong, 2019. "Sparse wavelet estimation in quantile regression with multiple functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 12-29.
    11. Jun Yu & Jiaqi Liu & HaiYing Wang, 2023. "Information-based optimal subdata selection for non-linear models," Statistical Papers, Springer, vol. 64(4), pages 1069-1093, August.
    12. Park, Seyoung & Lee, Eun Ryung, 2021. "Hypothesis testing of varying coefficients for regional quantiles," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    13. Jiang, Rong & Zhou, Zhan-Gong & Qian, Wei-Min & Chen, Yong, 2013. "Two step composite quantile regression for single-index models," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 180-191.
    14. Jiang, Rong & Yu, Keming, 2020. "Single-index composite quantile regression for massive data," Journal of Multivariate Analysis, Elsevier, vol. 180(C).
    15. Zhao, Weihua & Lian, Heng & Song, Xinyuan, 2017. "Composite quantile regression for correlated data," Computational Statistics & Data Analysis, Elsevier, vol. 109(C), pages 15-33.
    16. Tianzhen Wang & Haixiang Zhang, 2022. "Optimal subsampling for multiplicative regression with massive data," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 76(4), pages 418-449, November.
    17. Tian, Yuzhu & Song, Xinyuan, 2020. "Bayesian bridge-randomized penalized quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    18. Li, Degui & Li, Runze, 2016. "Local composite quantile regression smoothing for Harris recurrent Markov processes," Journal of Econometrics, Elsevier, vol. 194(1), pages 44-56.
    19. Ziyang Wang & HaiYing Wang & Nalini Ravishanker, 2023. "Subsampling in Longitudinal Models," Methodology and Computing in Applied Probability, Springer, vol. 25(1), pages 1-29, March.
    20. Catania, Leopoldo & Luati, Alessandra, 2020. "Robust estimation of a location parameter with the integrated Hogg function," Statistics & Probability Letters, Elsevier, vol. 164(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:63:y:2022:i:4:d:10.1007_s00362-021-01271-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.