IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v187y2023ics0167947323001202.html
   My bibliography  Save this article

Online regularized matrix regression with streaming data

Author

Listed:
  • Yang, Yaohong
  • Zhao, Weihua
  • Wang, Lei

Abstract

As extensions of vector data with ultrahigh dimensionality and complex structures, matrix data are fast emerging in a large variety of scientific applications. In this paper, we consider the matrix regression with streaming data and propose two-stage online regularized estimators with nuclear norm (NN) and adaptive nuclear norm (ANN) penalties, respectively. In the first stage, an equivalent form of offline matrix regression loss function using current raw data and summary statistics from historical data is established. In the second stage, gradient descent algorithm and soft thresholding methods are implemented iteratively to obtain the proposed online NN and ANN estimators. We establish the asymptotic properties of the resulting online regularized estimators and show the rank selection consistency for the online ANN estimator. The finite-sample performance of the proposed estimators is studied through simulations and an application to Beijing Air Quality data set.

Suggested Citation

  • Yang, Yaohong & Zhao, Weihua & Wang, Lei, 2023. "Online regularized matrix regression with streaming data," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
  • Handle: RePEc:eee:csdana:v:187:y:2023:i:c:s0167947323001202
    DOI: 10.1016/j.csda.2023.107809
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947323001202
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107809?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Fan, Jianqing & Gong, Wenyan & Zhu, Ziwei, 2019. "Generalized high-dimensional trace regression via nuclear norm regularization," Journal of Econometrics, Elsevier, vol. 212(1), pages 177-202.
    3. Hua Zhou & Lexin Li, 2014. "Regularized matrix regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(2), pages 463-483, March.
    4. Sydney C. Ludvigson & Serena Ng, 2009. "Macro Factors in Bond Risk Premia," The Review of Financial Studies, Society for Financial Studies, vol. 22(12), pages 5027-5067, December.
    5. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    6. Kun Chen & Hongbo Dong & Kung-Sik Chan, 2013. "Reduced rank regression via adaptive nuclear norm penalization," Biometrika, Biometrika Trust, vol. 100(4), pages 901-920.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Borup, Daniel & Christensen, Bent Jesper & Mühlbach, Nicolaj Søndergaard & Nielsen, Mikkel Slot, 2023. "Targeting predictors in random forest regression," International Journal of Forecasting, Elsevier, vol. 39(2), pages 841-868.
    2. Luo, Chongliang & Liang, Jian & Li, Gen & Wang, Fei & Zhang, Changshui & Dey, Dipak K. & Chen, Kun, 2018. "Leveraging mixed and incomplete outcomes via reduced-rank modeling," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 378-394.
    3. Buncic, Daniel & Tischhauser, Martin, 2017. "Macroeconomic factors and equity premium predictability," International Review of Economics & Finance, Elsevier, vol. 51(C), pages 621-644.
    4. Sermpinis, Georgios & Tsoukas, Serafeim & Zhang, Ping, 2018. "Modelling market implied ratings using LASSO variable selection techniques," Journal of Empirical Finance, Elsevier, vol. 48(C), pages 19-35.
    5. Kock, Anders Bredahl & Callot, Laurent, 2015. "Oracle inequalities for high dimensional vector autoregressions," Journal of Econometrics, Elsevier, vol. 186(2), pages 325-344.
    6. Jing-Zhi Huang & Zhan Shi, 2023. "Machine-Learning-Based Return Predictors and the Spanning Controversy in Macro-Finance," Management Science, INFORMS, vol. 69(3), pages 1780-1804, March.
    7. Fan, Jianqing & Gong, Wenyan & Zhu, Ziwei, 2019. "Generalized high-dimensional trace regression via nuclear norm regularization," Journal of Econometrics, Elsevier, vol. 212(1), pages 177-202.
    8. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    9. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    10. Chen, Canyi & Xu, Wangli & Zhu, Liping, 2022. "Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    11. Trucíos, Carlos & Mazzeu, João H.G. & Hotta, Luiz K. & Valls Pereira, Pedro L. & Hallin, Marc, 2021. "Robustness and the general dynamic factor model with infinite-dimensional space: Identification, estimation, and forecasting," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1520-1534.
    12. Georges Bresson & Jean-Michel Etienne & Pierre Mohnen, 2011. "How important is innovation? A Bayesian factor-augmented productivity model on panel data," TEPP Working Paper 2011-06, TEPP.
    13. P. Byrne, Joseph & Cao, Shuo & Korobilis, Dimitris, 2015. "Term Structure Dynamics, Macro-Finance Factors and Model Uncertainty," SIRE Discussion Papers 2015-71, Scottish Institute for Research in Economics (SIRE).
    14. Massimo Guidolin & Manuela Pedio, 2018. "Forecasting Commodity Futures Returns: An Economic Value Analysis of Macroeconomic vs. Specific Factors," BAFFI CAREFIN Working Papers 1886, BAFFI CAREFIN, Centre for Applied Research on International Markets Banking Finance and Regulation, Universita' Bocconi, Milano, Italy.
    15. Xu Cheng & Zhipeng Liao & Frank Schorfheide, 2016. "Shrinkage Estimation of High-Dimensional Factor Models with Structural Instabilities," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(4), pages 1511-1543.
    16. Aslanidis, Nektarios & Christiansen, Charlotte, 2014. "Quantiles of the realized stock–bond correlation and links to the macroeconomy," Journal of Empirical Finance, Elsevier, vol. 28(C), pages 321-331.
    17. Ralf Brüggemann & Christian Kascha, 2017. "Directed Graphs and Variable Selection in Large Vector Autoregressive Models," Working Paper Series of the Department of Economics, University of Konstanz 2017-06, Department of Economics, University of Konstanz.
    18. Costa, Alexandre Bonnet R. & Ferreira, Pedro Cavalcanti G. & Gaglianone, Wagner P. & Guillén, Osmani Teixeira C. & Issler, João Victor & Lin, Yihao, 2021. "Machine learning and oil price point and density forecasting," Energy Economics, Elsevier, vol. 102(C).
    19. Roberto Casarin & Stefano Grassi & Francesco Ravazzolo & Herman K. van Dijk, 2020. "A Bayesian Dynamic Compositional Model for Large Density Combinations in Finance," Working Paper series 20-27, Rimini Centre for Economic Analysis.
    20. Li, Xinjue & Zboňáková, Lenka & Wang, Weining & Härdle, Wolfgang Karl, 2019. "Combining Penalization and Adaption in High Dimension with Application in Bond Risk Premia Forecasting," IRTG 1792 Discussion Papers 2019-030, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:187:y:2023:i:c:s0167947323001202. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.