IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v190y2022ics0047259x2200029x.html
   My bibliography  Save this article

Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction

Author

Listed:
  • Chen, Canyi
  • Xu, Wangli
  • Zhu, Liping

Abstract

We are concerned with massive data which are possibly heterogeneous and scattered at different locations. We introduce a communication-efficient distributed algorithm to estimate the rank-deficient loading matrix in reduced rank regressions. The distributed algorithm, which proceeds iteratively, reduces the computational complexity substantially. During each iteration, it yields a closed-form solution and refines the previous estimators gradually. After a finite number of iterations, the final solution estimates the rank consistently, and more importantly, achieves the oracle rate. We recast sufficient dimension reduction methods under the framework of reduced rank regressions, which enables us to recover the central subspace and simultaneously estimate its structural dimension. We demonstrate the efficiency of our proposed distributed algorithm through simulations and an application to the airline on-line performance dataset consisting of 118,914,458 observations.

Suggested Citation

  • Chen, Canyi & Xu, Wangli & Zhu, Liping, 2022. "Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
  • Handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x2200029x
    DOI: 10.1016/j.jmva.2022.104991
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X2200029X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2022.104991?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Yanyuan Ma & Xinyu Zhang, 2015. "A validated information criterion to determine the structural dimension in dimension reduction models," Biometrika, Biometrika Trust, vol. 102(2), pages 409-420.
    3. Yanyuan Ma & Liping Zhu, 2012. "A Semiparametric Approach to Dimension Reduction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 168-179, March.
    4. Yingcun Xia & Howell Tong & W. K. Li & Li‐Xing Zhu, 2002. "An adaptive estimation of dimension reduction space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 363-410, August.
    5. Xiangrong Yin & R. Dennis Cook, 2002. "Dimension reduction for the conditional kth moment in regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(2), pages 159-175, May.
    6. Li‐Ping Zhu & Li‐Xing Zhu, 2009. "On distribution‐weighted partial least squares with diverging number of highly correlated predictors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(2), pages 525-548, April.
    7. Chen, Songxi, 2012. "Two Sample Tests for High Dimensional Covariance Matrices," MPRA Paper 46026, University Library of Munich, Germany.
    8. Yanyuan Ma & Liping Zhu, 2014. "On estimation efficiency of the central mean subspace," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(5), pages 885-901, November.
    9. Roman Vershynin, 2012. "How Close is the Sample Covariance Matrix to the Actual Covariance Matrix?," Journal of Theoretical Probability, Springer, vol. 25(3), pages 655-686, September.
    10. Michael I. Jordan & Jason D. Lee & Yun Yang, 2019. "Communication-Efficient Distributed Statistical Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 668-681, April.
    11. Zhu, Lixing & Miao, Baiqi & Peng, Heng, 2006. "On Sliced Inverse Regression With High-Dimensional Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 630-643, June.
    12. Chao Zhang & Zhidong Bai & Jiang Hu & Chen Wang, 2018. "Multi-sample test for high-dimensional covariance matrices," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 47(13), pages 3161-3177, July.
    13. Shi, Chengchun & Lu, Wenbin & Song, Rui, 2018. "A massive data framework for M-estimators with cubic-rate," LSE Research Online Documents on Economics 102111, London School of Economics and Political Science, LSE Library.
    14. Bura, E. & Yang, J., 2011. "Dimension estimation in sufficient dimension reduction: A unifying approach," Journal of Multivariate Analysis, Elsevier, vol. 102(1), pages 130-142, January.
    15. Masashi Hyodo & Takahiro Nishiyama, 2021. "Simultaneous testing of the mean vector and covariance matrix among k populations for high-dimensional data," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 50(3), pages 663-684, February.
    16. Ming Yuan & Ali Ekici & Zhaosong Lu & Renato Monteiro, 2007. "Dimension reduction and coefficient estimation in multivariate linear regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(3), pages 329-346, June.
    17. Zhu, Li-Ping & Yu, Zhou & Zhu, Li-Xing, 2010. "A sparse eigen-decomposition estimation in semiparametric regression," Computational Statistics & Data Analysis, Elsevier, vol. 54(4), pages 976-986, April.
    18. Izenman, Alan Julian, 1975. "Reduced-rank regression for the multivariate linear model," Journal of Multivariate Analysis, Elsevier, vol. 5(2), pages 248-264, June.
    19. Chengchun Shi & Wenbin Lu & Rui Song, 2018. "A Massive Data Framework for M-Estimators with Cubic-Rate," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1698-1709, October.
    20. Cook, R. Dennis & Ni, Liqiang, 2005. "Sufficient Dimension Reduction via Inverse Regression: A Minimum Discrepancy Approach," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 410-428, June.
    21. Zhu, Li-Ping & Zhu, Li-Xing & Feng, Zheng-Hui, 2010. "Dimension Reduction in Regressions Through Cumulative Slicing Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1455-1466.
    22. Kun Chen & Hongbo Dong & Kung-Sik Chan, 2013. "Reduced rank regression via adaptive nuclear norm penalization," Biometrika, Biometrika Trust, vol. 100(4), pages 901-920.
    23. Wei Luo & Bing Li, 2016. "Combining eigenvalues and variation of eigenvectors for order determination," Biometrika, Biometrika Trust, vol. 103(4), pages 875-887.
    24. Yanyuan Ma & Liping Zhu, 2013. "A Review on Dimension Reduction," International Statistical Review, International Statistical Institute, vol. 81(1), pages 134-150, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lu Li & Kai Tan & Xuerong Meggie Wen & Zhou Yu, 2023. "Variable-dependent partial dimension reduction," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 521-541, June.
    2. Qin Wang & Yuan Xue, 2023. "A structured covariance ensemble for sufficient dimension reduction," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(3), pages 777-800, September.
    3. Deng, Jianqiu & Yang, Xiaojie & Wang, Qihua, 2022. "Surrogate space based dimension reduction for nonignorable nonresponse," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    4. Ming-Yueh Huang & Chin-Tsang Chiang, 2017. "An Effective Semiparametric Estimation Approach for the Sufficient Dimension Reduction Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1296-1310, July.
    5. Hilafu, Haileab & Yin, Xiangrong, 2013. "Sufficient dimension reduction in multivariate regressions with categorical predictors," Computational Statistics & Data Analysis, Elsevier, vol. 63(C), pages 139-147.
    6. Pircalabelu, Eugen & Artemiou, Andreas, 2021. "Graph informed sliced inverse regression," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    7. Zhang, Xin & Wang, Chong & Wu, Yichao, 2018. "Functional envelope for model-free sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 163(C), pages 37-50.
    8. Shih‐Hao Huang & Kerby Shedden & Hsin‐wen Chang, 2023. "Inference for the dimension of a regression relationship using pseudo‐covariates," Biometrics, The International Biometric Society, vol. 79(3), pages 2394-2403, September.
    9. Wei Luo, 2022. "On efficient dimension reduction with respect to the interaction between two response variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 269-294, April.
    10. Kim, Kyongwon, 2022. "On principal graphical models with application to gene network," Computational Statistics & Data Analysis, Elsevier, vol. 166(C).
    11. Wang, Pei & Yin, Xiangrong & Yuan, Qingcong & Kryscio, Richard, 2021. "Feature filter for estimating central mean subspace and its sparse solution," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).
    12. Wang, Qin & Xue, Yuan, 2021. "An ensemble of inverse moment estimators for sufficient dimension reduction," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    13. Fan, Guo-Liang & Xu, Hong-Xia & Liang, Han-Ying, 2019. "Dimension reduction estimation for central mean subspace with missing multivariate response," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    14. Jared D. Huling & Menggang Yu, 2022. "Sufficient dimension reduction for populations with structured heterogeneity," Biometrics, The International Biometric Society, vol. 78(4), pages 1626-1638, December.
    15. Portier, François & Delyon, Bernard, 2013. "Optimal transformation: A new approach for covering the central subspace," Journal of Multivariate Analysis, Elsevier, vol. 115(C), pages 84-107.
    16. François Portier, 2016. "An Empirical Process View of Inverse Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(3), pages 827-844, September.
    17. Eliana Christou, 2020. "Robust dimension reduction using sliced inverse median regression," Statistical Papers, Springer, vol. 61(5), pages 1799-1818, October.
    18. Zhang, Hong-Fan, 2021. "Minimum Average Variance Estimation with group Lasso for the multivariate response Central Mean Subspace," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    19. Nordhausen, Klaus & Oja, Hannu & Tyler, David E., 2022. "Asymptotic and bootstrap tests for subspace dimension," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    20. Kohei Yoshikawa & Shuichi Kawano, 2023. "Sparse reduced-rank regression for simultaneous rank and variable selection via manifold optimization," Computational Statistics, Springer, vol. 38(1), pages 53-75, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x2200029x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.