IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i5d10.1007_s00180-024-01562-6.html
   My bibliography  Save this article

A fast divide-and-conquer strategy for single-index model with massive data

Author

Listed:
  • Na Li

    (Hunan Normal University)

  • Jing Yang

    (Hunan Normal University)

Abstract

With the rapid development of modern technology, massive data has received widespread attention. Constrained by computer performance, traditional statistical analysis methods are difficult to obtain quantitative analysis results of massive data. Currently, the most popular analytical method for massive data is the divide-and-conquer(DC) strategy, but relevant studies rarely mention the fitting of semiparametric models under massive data. In this paper, we combine the ideas of DC and refined outer product gradient (rOPG) to propose DC-lsrOPG and DC-qrOPG methods for analyzing massive data for single-index models, based on least squares regression and quantile regression, respectively. The newly developed method significantly reduces the amount of main memory required and running time. The asymptotic normality of the proposed method has been established under some mild conditions. The resulting estimators are theoretically as efficient as the traditional rOPG estimators on the entire data. Some simulation studies and a real data analysis are conducted to illustrate the finite sample performance of the proposed methods.

Suggested Citation

  • Na Li & Jing Yang, 2025. "A fast divide-and-conquer strategy for single-index model with massive data," Computational Statistics, Springer, vol. 40(5), pages 2367-2387, June.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:5:d:10.1007_s00180-024-01562-6
    DOI: 10.1007/s00180-024-01562-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-024-01562-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-024-01562-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yingcun Xia & Howell Tong & W. K. Li & Li‐Xing Zhu, 2002. "An adaptive estimation of dimension reduction space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 363-410, August.
    2. Shujie Ma, 2016. "Estimation and inference in functional single-index models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 68(1), pages 181-208, February.
    3. Yanlin Tang & Huixia Judy Wang & Hua Liang, 2018. "Composite Estimation for Single‐Index Models with Responses Subject to Detection Limits," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 45(3), pages 444-464, September.
    4. Qifa Xu & Chao Cai & Cuixia Jiang & Fang Sun & Xue Huang, 2020. "Block average quantile regression for massive dataset," Statistical Papers, Springer, vol. 61(1), pages 141-165, February.
    5. Powell, James L & Stock, James H & Stoker, Thomas M, 1989. "Semiparametric Estimation of Index Coefficients," Econometrica, Econometric Society, vol. 57(6), pages 1403-1430, November.
    6. Wang, Qin & Yin, Xiangrong, 2008. "A nonlinear multi-dimensional variable selection method for high dimensional data: Sparse MAVE," Computational Statistics & Data Analysis, Elsevier, vol. 52(9), pages 4512-4520, May.
    7. Klein, Roger W & Spady, Richard H, 1993. "An Efficient Semiparametric Estimator for Binary Response Models," Econometrica, Econometric Society, vol. 61(2), pages 387-421, March.
    8. Ariel Kleiner & Ameet Talwalkar & Purnamrita Sarkar & Michael I. Jordan, 2014. "A scalable bootstrap for massive data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 795-816, September.
    9. Yang, Jing & Tian, Guoliang & Lu, Fang & Lu, Xuewen, 2020. "Single-index modal regression via outer product gradients," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    10. Xia, Yingcun, 2006. "Asymptotic Distributions For Two Estimators Of The Single-Index Model," Econometric Theory, Cambridge University Press, vol. 22(6), pages 1112-1137, December.
    11. Rong Jiang & Wei-wei Chen & Xin Liu, 2021. "Adaptive quantile regressions for massive datasets," Statistical Papers, Springer, vol. 62(4), pages 1981-1995, August.
    12. Wu, Tracy Z. & Yu, Keming & Yu, Yan, 2010. "Single-index quantile regression," Journal of Multivariate Analysis, Elsevier, vol. 101(7), pages 1607-1621, August.
    13. Chen, Lanjue & Zhou, Yong, 2020. "Quantile regression in big data: A divide and conquer based strategy," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang, Jing & Tian, Guoliang & Lu, Fang & Lu, Xuewen, 2020. "Single-index modal regression via outer product gradients," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    2. Huybrechts F. Bindele & Ash Abebe & Karlene N. Meyer, 2018. "General rank-based estimation for regression single index models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(5), pages 1115-1146, October.
    3. Jing Sun, 2016. "Composite quantile regression for single-index models with asymmetric errors," Computational Statistics, Springer, vol. 31(1), pages 329-351, March.
    4. Rothe, Christoph, 2009. "Semiparametric estimation of binary response models with endogenous regressors," Journal of Econometrics, Elsevier, vol. 153(1), pages 51-64, November.
    5. Sadikoglu, Serhan, 2019. "Essays in econometric theory," Other publications TiSEM 99d83644-f9dc-49e3-a4e1-5, Tilburg University, School of Economics and Management.
    6. Han, Zhong-Cheng & Lin, Jin-Guan & Zhao, Yan-Yong, 2020. "Adaptive semiparametric estimation for single index models with jumps," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    7. Bucher, Axel & El Ghouch, Anouar & Van Keilegom, Ingrid, 2014. "Single-index quantile regression models for censored data," LIDAM Discussion Papers ISBA 2014001, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    8. Lu, Xuewen, 2010. "Asymptotic distributions of two "synthetic data" estimators for censored single-index models," Journal of Multivariate Analysis, Elsevier, vol. 101(4), pages 999-1015, April.
    9. Xia, Yingcun & Härdle, Wolfgang Karl & Linton, Oliver, 2009. "Optimal smoothing for a computationally and statistically efficient single index estimator," SFB 649 Discussion Papers 2009-028, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    10. Yunquan Song & Zitong Li & Minglu Fang, 2022. "Robust Variable Selection Based on Penalized Composite Quantile Regression for High-Dimensional Single-Index Models," Mathematics, MDPI, vol. 10(12), pages 1-17, June.
    11. Ash Abebe & Huybrechts F. Bindele & Masego Otlaadisa & Boikanyo Makubate, 2021. "Robust estimation of single index models with responses missing at random," Statistical Papers, Springer, vol. 62(5), pages 2195-2225, October.
    12. Ichimura, Hidehiko & Lee, Sokbae, 2010. "Characterization of the asymptotic distribution of semiparametric M-estimators," Journal of Econometrics, Elsevier, vol. 159(2), pages 252-266, December.
    13. Qingming Zou & Zhongyi Zhu, 2014. "M-estimators for single-index model using B-spline," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 77(2), pages 225-246, February.
    14. repec:hum:wpaper:sfb649dp2009-028 is not listed on IDEAS
    15. Feng, Long & Zou, Changliang & Wang, Zhaojun, 2012. "Rank-based inference for the single-index model," Statistics & Probability Letters, Elsevier, vol. 82(3), pages 535-541.
    16. Feng, Sanying & Kong, Kaidi & Kong, Yinfei & Li, Gaorong & Wang, Zhaoliang, 2022. "Statistical inference of heterogeneous treatment effect based on single-index model," Computational Statistics & Data Analysis, Elsevier, vol. 175(C).
    17. Zhang, Hong-Fan, 2021. "Minimum Average Variance Estimation with group Lasso for the multivariate response Central Mean Subspace," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    18. Zhu, Xuehu & Guo, Xu & Lin, Lu & Zhu, Lixing, 2015. "Heteroscedasticity checks for single index models," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 41-55.
    19. D. Wang & C. S. McMahan & C. M. Gallagher & K. B. Kulasekera, 2014. "Semiparametric group testing regression models," Biometrika, Biometrika Trust, vol. 101(3), pages 587-598.
    20. Ming-Yueh Huang & Chin-Tsang Chiang, 2017. "Estimation and Inference Procedures for Semiparametric Distribution Models with Varying Linear-Index," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(2), pages 396-424, June.
    21. Yang, Qing & Zhang, Yi, 2022. "Change-point detection for the link function in a single-index model," Statistics & Probability Letters, Elsevier, vol. 186(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:5:d:10.1007_s00180-024-01562-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.