IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.04087.html

Mean Square Errors of factors extracted using principal components, linear projections, and Kalman filter

Author

Listed:
  • Matteo Barigozzi
  • Diego Fresoli
  • Esther Ruiz

Abstract

Factor extraction from systems of variables with a large cross-sectional dimension, $N$, is often based on either Principal Components (PC)-based procedures, or Kalman filter (KF)-based procedures. Measuring the uncertainty of the extracted factors is important when, for example, they have a direct interpretation and/or they are used to summarized the information in a large number of potential predictors. In this paper, we compare the finite $N$ mean square errors (MSEs) of PC and KF factors extracted under different structures of the idiosyncratic cross-correlations. We show that the MSEs of PC-based factors, implicitly based on treating the true underlying factors as deterministic, are larger than the corresponding MSEs of KF factors, obtained by treating the true factors as either serially independent or autocorrelated random variables. We also study and compare the MSEs of PC and KF factors estimated when the idiosyncratic components are wrongly considered as if they were cross-sectionally homoscedastic and/or uncorrelated. The relevance of the results for the construction of confidence intervals for the factors are illustrated with simulated data.

Suggested Citation

  • Matteo Barigozzi & Diego Fresoli & Esther Ruiz, 2026. "Mean Square Errors of factors extracted using principal components, linear projections, and Kalman filter," Papers 2601.04087, arXiv.org.
  • Handle: RePEc:arx:papers:2601.04087
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.04087
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Choi, In, 2012. "Efficient Estimation Of Factor Models," Econometric Theory, Cambridge University Press, vol. 28(2), pages 274-308, April.
    2. Tomohiro Ando & Ruey S. Tsay, 2011. "Quantile regression models with factor‐augmented predictors and information criterion," Econometrics Journal, Royal Economic Society, vol. 14, pages 1-24, February.
    3. Catherine Doz & Domenico Giannone & Lucrezia Reichlin, 2012. "A Quasi–Maximum Likelihood Approach for Large, Approximate Dynamic Factor Models," The Review of Economics and Statistics, MIT Press, vol. 94(4), pages 1014-1024, November.
    4. Breitung, Jörg & Tenhofen, Jörn, 2011. "GLS Estimation of Dynamic Factor Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1150-1166.
    5. Freyaldenhoven, Simon, 2022. "Factor models with local factors — Determining the number of relevant factors," Journal of Econometrics, Elsevier, vol. 229(1), pages 80-102.
    6. Bai, Jushan & Ng, Serena, 2013. "Principal components estimation and identification of static factors," Journal of Econometrics, Elsevier, vol. 176(1), pages 18-29.
    7. Danny Quah & Thomas J. Sargent, 1993. "A Dynamic Index Model for Large Cross Sections," NBER Chapters, in: Business Cycles, Indicators, and Forecasting, pages 285-310, National Bureau of Economic Research, Inc.
    8. Stock, James H. & Watson, Mark W., 2006. "Forecasting with Many Predictors," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 10, pages 515-554, Elsevier.
    9. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    10. Daniel J. Lewis & Karel Mertens & James H. Stock & Mihir Trivedi, 2022. "Measuring real activity using a weekly economic index," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(4), pages 667-687, June.
    11. Esther Ruiz & Pilar Poncela, 2022. "Factor Extraction in Dynamic Factor Models: Kalman Filter Versus Principal Components," Foundations and Trends(R) in Econometrics, now publishers, vol. 12(2), pages 121-231, November.
    12. Ryan Greenaway‐McGrevy & Nelson C. Mark & Donggyu Sul & Jyh‐Lin Wu, 2018. "Identifying Exchange Rate Common Factors," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 59(4), pages 2193-2218, November.
    13. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    14. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    15. Steffen R. Henzel & Malte Rengel, 2017. "Dimensions Of Macroeconomic Uncertainty: A Common Factor Analysis," Economic Inquiry, Western Economic Association International, vol. 55(2), pages 843-877, April.
    16. Bai, Jushan & Ng, Serena, 2006. "Evaluating latent and observed factors in macroeconomics and finance," Journal of Econometrics, Elsevier, vol. 131(1-2), pages 507-537.
    17. Jushan Bai & Kunpeng Li, 2016. "Maximum Likelihood Estimation and Inference for Approximate Factor Models of High Dimension," The Review of Economics and Statistics, MIT Press, vol. 98(2), pages 298-309, May.
    18. Boivin, Jean & Ng, Serena, 2006. "Are more data always better for factor analysis?," Journal of Econometrics, Elsevier, vol. 132(1), pages 169-194, May.
    19. Harvey, Andrew C. & Delle Monache, Davide, 2009. "Computing the mean square error of unobserved components extracted by misspecified time series models," Journal of Economic Dynamics and Control, Elsevier, vol. 33(2), pages 283-295, February.
    20. Thomas J. Sargent & Christopher A. Sims, 1977. "Business cycle modeling without pretending to have too much a priori economic theory," Working Papers 55, Federal Reserve Bank of Minneapolis.
    21. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    22. Jushan Bai & Serena Ng, 2006. "Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions," Econometrica, Econometric Society, vol. 74(4), pages 1133-1150, July.
    23. James H. Stock & Mark W. Watson, 2017. "Twenty Years of Time Series Econometrics in Ten Pictures," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 59-86, Spring.
    24. Gloria González‐Rivera & C. Vladimir Rodríguez‐Caballero & Esther Ruiz, 2024. "Expecting the unexpected: Stressed scenarios for economic growth," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(5), pages 926-942, August.
    25. R. H. Shumway & D. S. Stoffer, 1982. "An Approach To Time Series Smoothing And Forecasting Using The Em Algorithm," Journal of Time Series Analysis, Wiley Blackwell, vol. 3(4), pages 253-264, July.
    26. Luciani, Matteo, 2014. "Forecasting with approximate dynamic factor models: The role of non-pervasive shocks," International Journal of Forecasting, Elsevier, vol. 30(1), pages 20-29.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bellocca, Gian Pietro Enzo & Garrón Vedia, Ignacio & Rodríguez Caballero, Carlos Vladimir & Ruiz Ortega, Esther, 2026. "The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models," DES - Working Papers. Statistics and Econometrics. WS 49336, Universidad Carlos III de Madrid. Departamento de Estadística.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Barigozzi, 2023. "Quasi Maximum Likelihood Estimation of High-Dimensional Factor Models: A Critical Review," Papers 2303.11777, arXiv.org, revised May 2024.
    2. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    3. Fresoli, Diego & Poncela, Pilar & Ruiz, Esther, 2023. "Ignoring cross-correlated idiosyncratic components when extracting factors in dynamic factor models," Economics Letters, Elsevier, vol. 230(C).
    4. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers halshs-02262202, HAL.
    5. Matteo Barigozzi, 2023. "Asymptotic equivalence of Principal Components and Quasi Maximum Likelihood estimators in Large Approximate Factor Models," Papers 2307.09864, arXiv.org, revised Jun 2024.
    6. Poncela, Pilar & Ruiz, Esther & Miranda, Karen, 2021. "Factor extraction using Kalman filter and smoothing: This is not just another survey," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1399-1425.
    7. Matteo Barigozzi & Matteo Luciani, 2019. "Quasi Maximum Likelihood Estimation and Inference of Large Approximate Dynamic Factor Models via the EM algorithm," Papers 1910.03821, arXiv.org, revised Sep 2024.
    8. Pilar Poncela & Esther Ruiz, 2016. "Small- Versus Big-Data Factor Extraction in Dynamic Factor Models: An Empirical Assessment," Advances in Econometrics, in: Dynamic Factor Models, volume 35, pages 401-434, Emerald Group Publishing Limited.
    9. Matteo Barigozzi & Marc Hallin, 2026. "The Dynamic, the Static, and the Weak: Factor Models and the Analysis of High‐Dimensional Time Series," Journal of Time Series Analysis, Wiley Blackwell, vol. 47(1), pages 201-219, January.
    10. Karen Miranda & Pilar Poncela & Esther Ruiz, 2022. "Dynamic factor models: Does the specification matter?," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 13(1), pages 397-428, May.
    11. Diego Fresoli & Pilar Poncela & Esther Ruiz, 2024. "Dealing with idiosyncratic cross-correlation when constructing confidence regions for PC factors," Papers 2407.06883, arXiv.org.
    12. Rachida Ouysse, 2017. "Constrained principal components estimation of large approximate factor models," Discussion Papers 2017-12, School of Economics, The University of New South Wales.
    13. Cheng, Xu & Hansen, Bruce E., 2015. "Forecasting with factor-augmented regression: A frequentist model averaging approach," Journal of Econometrics, Elsevier, vol. 186(2), pages 280-293.
    14. Bai, Jushan & Liao, Yuan, 2016. "Efficient estimation of approximate factor models via penalized maximum likelihood," Journal of Econometrics, Elsevier, vol. 191(1), pages 1-18.
    15. Barigozzi, Matteo & Hallin, Marc & Luciani, Matteo & Zaffaroni, Paolo, 2024. "Inferential theory for generalized dynamic factor models," Journal of Econometrics, Elsevier, vol. 239(2).
    16. Helmut Lütkepohl, 2014. "Structural Vector Autoregressive Analysis in a Data Rich Environment: A Survey," Discussion Papers of DIW Berlin 1351, DIW Berlin, German Institute for Economic Research.
    17. Bellocca, Gian Pietro Enzo & Garrón Vedia, Ignacio & Rodríguez Caballero, Carlos Vladimir & Ruiz Ortega, Esther, 2026. "The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models," DES - Working Papers. Statistics and Econometrics. WS 49336, Universidad Carlos III de Madrid. Departamento de Estadística.
    18. Shaoxin Wang & Hu Yang & Chaoli Yao, 2019. "On the penalized maximum likelihood estimation of high-dimensional approximate factor model," Computational Statistics, Springer, vol. 34(2), pages 819-846, June.
    19. Francisco Corona & Pilar Poncela & Esther Ruiz, 2020. "Estimating Non-stationary Common Factors: Implications for Risk Sharing," Computational Economics, Springer;Society for Computational Economics, vol. 55(1), pages 37-60, January.
    20. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.04087. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.