IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2506.09575.html
   My bibliography  Save this paper

Diffusion index forecasts under weaker loadings: PCA, ridge regression, and random projections

Author

Listed:
  • Tom Boot
  • Bart Keijsers

Abstract

We study the accuracy of forecasts in the diffusion index forecast model with possibly weak loadings. The default option to construct forecasts is to estimate the factors through principal component analysis (PCA) on the available predictor matrix, and use the estimated factors to forecast the outcome variable. Alternatively, we can directly relate the outcome variable to the predictors through either ridge regression or random projections. We establish that forecasts based on PCA, ridge regression and random projections are consistent for the conditional mean under the same assumptions on the strength of the loadings. However, under weaker loadings the convergence rate is lower for ridge and random projections if the time dimension is small relative to the cross-section dimension. We assess the relevance of these findings in an empirical setting by comparing relative forecast accuracy for monthly macroeconomic and financial variables using different window sizes. The findings support the theoretical results, and at the same time show that regularization-based procedures may be more robust in settings not covered by the developed theory.

Suggested Citation

  • Tom Boot & Bart Keijsers, 2025. "Diffusion index forecasts under weaker loadings: PCA, ridge regression, and random projections," Papers 2506.09575, arXiv.org.
  • Handle: RePEc:arx:papers:2506.09575
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2506.09575
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    2. Marine Carrasco & Barbara Rossi, 2016. "In-Sample Inference and Forecasting in Misspecified Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 313-338, July.
    3. Koop, Gary & Korobilis, Dimitris & Pettenuzzo, Davide, 2019. "Bayesian compressed vector autoregressions," Journal of Econometrics, Elsevier, vol. 210(1), pages 135-154.
    4. Schneider, Matthew J. & Gupta, Sachin, 2016. "Forecasting sales of new and existing products using consumer reviews: A random projections approach," International Journal of Forecasting, Elsevier, vol. 32(2), pages 243-256.
    5. Boot, Tom & Nibbering, Didier, 2019. "Forecasting using random subspace methods," Journal of Econometrics, Elsevier, vol. 209(2), pages 391-406.
    6. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    7. De Mol, Christine & Giannone, Domenico & Reichlin, Lucrezia, 2008. "Forecasting using a large number of predictors: Is Bayesian shrinkage a valid alternative to principal components?," Journal of Econometrics, Elsevier, vol. 146(2), pages 318-328, October.
    8. Bai, Jushan & Ng, Serena, 2023. "Approximate factor models with weaker loadings," Journal of Econometrics, Elsevier, vol. 235(2), pages 1893-1916.
    9. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    10. Yoshimasa Uematsu & Takashi Yamagata, 2022. "Inference in Sparsity-Induced Weak Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 126-139, December.
    11. Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
    12. Alexei Onatski, 2010. "Determining the Number of Factors from Empirical Distribution of Eigenvalues," The Review of Economics and Statistics, MIT Press, vol. 92(4), pages 1004-1016, November.
    13. Yoshimasa Uematsu & Takashi Yamagata, 2022. "Estimation of Sparsity-Induced Weak Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 213-227, December.
    14. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    15. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    16. Jianqing Fan & Yuan Liao, 2022. "Learning Latent Factors From Diversified Projections and Its Applications to Over-Estimated and Weak Factors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(538), pages 909-924, April.
    17. Hande Karabiyik & Joakim Westerlund, 2021. "Forecasting using cross-section average–augmented time series regressions," The Econometrics Journal, Royal Economic Society, vol. 24(2), pages 315-333.
    18. Brinkhuis, J. & Luo, Z-Q. & Zhang, S., 2005. "Matrix convex functions with applications to weighted centers for semidefinite programming," Econometric Institute Research Papers EI 2005-38, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    19. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    20. Jushan Bai & Serena Ng, 2006. "Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions," Econometrica, Econometric Society, vol. 74(4), pages 1133-1150, July.
    21. Khai Xiang Chiong & Matthew Shum, 2019. "Random Projection Estimation of Discrete-Choice Models with Large Choice Sets," Management Science, INFORMS, vol. 65(1), pages 256-271, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yoshimasa Uematsu & Takashi Yamagata, 2019. "Estimation of Weak Factor Models," ISER Discussion Paper 1053r, Institute of Social and Economic Research, The University of Osaka, revised Mar 2020.
    2. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    3. Chen, Qitong & Hong, Yongmiao & Li, Haiqi, 2024. "Time-varying forecast combination for factor-augmented regressions with smooth structural changes," Journal of Econometrics, Elsevier, vol. 240(1).
    4. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    5. Boot, Tom & Nibbering, Didier, 2019. "Forecasting using random subspace methods," Journal of Econometrics, Elsevier, vol. 209(2), pages 391-406.
    6. Bae, Juhee, 2024. "Factor-augmented forecasting in big data," International Journal of Forecasting, Elsevier, vol. 40(4), pages 1660-1688.
    7. Barbarino, Alessandro & Bura, Efstathia, 2024. "Forecasting Near-equivalence of Linear Dimension Reduction Methods in Large Panels of Macro-variables," Econometrics and Statistics, Elsevier, vol. 31(C), pages 1-18.
    8. Christian Brownlees & Gu{dh}mundur Stef'an Gu{dh}mundsson & Yaping Wang, 2024. "Performance of Empirical Risk Minimization For Principal Component Regression," Papers 2409.03606, arXiv.org, revised Sep 2024.
    9. Shu, Lei & Hao, Yifan & Chen, Yu & Yang, Qing, 2025. "SFQRA: Scaled factor-augmented quantile regression with aggregation in conditional mean forecasting," Journal of Multivariate Analysis, Elsevier, vol. 207(C).
    10. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    11. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    12. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    13. Rua, António, 2017. "A wavelet-based multivariate multiscale approach for forecasting," International Journal of Forecasting, Elsevier, vol. 33(3), pages 581-590.
    14. Bai, Jushan & Ng, Serena, 2023. "Approximate factor models with weaker loadings," Journal of Econometrics, Elsevier, vol. 235(2), pages 1893-1916.
    15. Kelly, Bryan & Pruitt, Seth, 2015. "The three-pass regression filter: A new approach to forecasting using many predictors," Journal of Econometrics, Elsevier, vol. 186(2), pages 294-316.
    16. Wanbo Lu & Guanglin Huang & Kris Boudt, 2024. "Estimation of Non-Gaussian Factors Using Higher-order Multi-cumulants in Weak Factor Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 24/1085, Ghent University, Faculty of Economics and Business Administration.
    17. Olivier Fortin‐Gagnon & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "A large Canadian database for macroeconomic analysis," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 55(4), pages 1799-1833, November.
    18. Matteo Barigozzi & Marc Hallin, 2024. "The Dynamic, the Static, and the Weak Factor Models and the Analysis of High-Dimensional Time Series," Working Papers ECARES 2024-14, ULB -- Universite Libre de Bruxelles.
    19. Philipp Gersing & Matteo Barigozzi & Christoph Rust & Manfred Deistler, 2023. "The Canonical Decomposition of Factor Models: Weak Factors are Everywhere," Papers 2307.10067, arXiv.org, revised Feb 2025.
    20. Jie Wei & Yonghui Zhang, 2023. "Does Principal Component Analysis Preserve the Sparsity in Sparse Weak Factor Models?," Papers 2305.05934, arXiv.org, revised Nov 2024.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2506.09575. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.