IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2506.05116.html
   My bibliography  Save this paper

The Spurious Factor Dilemma: Robust Inference in Heavy-Tailed Elliptical Factor Models

Author

Listed:
  • Jiang Hu
  • Jiahui Xie
  • Yangchun Zhang
  • Wang Zhou

Abstract

Factor models are essential tools for analyzing high-dimensional data, particularly in economics and finance. However, standard methods for determining the number of factors often overestimate the true number when data exhibit heavy-tailed randomness, misinterpreting noise-induced outliers as genuine factors. This paper addresses this challenge within the framework of Elliptical Factor Models (EFM), which accommodate both heavy tails and potential non-linear dependencies common in real-world data. We demonstrate theoretically and empirically that heavy-tailed noise generates spurious eigenvalues that mimic true factor signals. To distinguish these, we propose a novel methodology based on a fluctuation magnification algorithm. We show that under magnifying perturbations, the eigenvalues associated with real factors exhibit significantly less fluctuation (stabilizing asymptotically) compared to spurious eigenvalues arising from heavy-tailed effects. This differential behavior allows the identification and detection of the true and spurious factors. We develop a formal testing procedure based on this principle and apply it to the problem of accurately selecting the number of common factors in heavy-tailed EFMs. Simulation studies and real data analysis confirm the effectiveness of our approach compared to existing methods, particularly in scenarios with pronounced heavy-tailedness.

Suggested Citation

  • Jiang Hu & Jiahui Xie & Yangchun Zhang & Wang Zhou, 2025. "The Spurious Factor Dilemma: Robust Inference in Heavy-Tailed Elliptical Factor Models," Papers 2506.05116, arXiv.org.
  • Handle: RePEc:arx:papers:2506.05116
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2506.05116
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Lorenzo Trapani, 2018. "A Randomized Sequential Procedure to Determine the Number of Factors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1341-1349, July.
    2. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    3. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    4. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    5. Xinbing Kong, 2020. "A random-perturbation-based rank estimator of the number of factors," Biometrika, Biometrika Trust, vol. 107(2), pages 505-511.
    6. Alexei Onatski, 2010. "Determining the Number of Factors from Empirical Distribution of Eigenvalues," The Review of Economics and Statistics, MIT Press, vol. 92(4), pages 1004-1016, November.
    7. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    8. Alexei Onatski, 2009. "Testing Hypotheses About the Number of Factors in Large Factor Models," Econometrica, Econometric Society, vol. 77(5), pages 1447-1479, September.
    9. Jianqing Fan & Kunpeng Li & Yuan Liao, 2021. "Recent Developments in Factor Models and Applications in Econometric Learning," Annual Review of Financial Economics, Annual Reviews, vol. 13(1), pages 401-430, November.
    10. Zheng Tracy Ke & Yucong Ma & Xihong Lin, 2023. "Estimation of the Number of Spiked Eigenvalues in a Covariance Matrix by Bulk Eigenvalue Matching Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 374-392, January.
    11. Xia, Qiang & Liang, Rubing & Wu, Jianhong, 2017. "Transformed contribution ratio test for the number of factors in static approximate factor models," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 235-241.
    12. Paul, Debashis & Silverstein, Jack W., 2009. "No eigenvalues outside the support of the limiting empirical spectral distribution of a separable covariance matrix," Journal of Multivariate Analysis, Elsevier, vol. 100(1), pages 37-57, January.
    13. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    14. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    15. Alessi, Lucia & Barigozzi, Matteo & Capasso, Marco, 2010. "Improved penalization for determining the number of factors in approximate factor models," Statistics & Probability Letters, Elsevier, vol. 80(23-24), pages 1806-1813, December.
    16. Baltagi, Badi H. & Kao, Chihwa & Wang, Fa, 2017. "Identification and estimation of a large factor model with structural instability," Journal of Econometrics, Elsevier, vol. 197(1), pages 87-100.
    17. Long Yu & Peng Zhao & Wang Zhou, 2025. "Testing the Number of Common Factors by Bootstrapped Sample Covariance Matrix in High-Dimensional Factor Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 120(549), pages 448-459, January.
    18. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    19. Yu, Long & He, Yong & Zhang, Xinsheng, 2019. "Robust factor number specification for large-dimensional elliptical factor model," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    2. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    3. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2020. "Estimation of large dimensional conditional factor models in finance," Handbook of Econometrics, in: Steven N. Durlauf & Lars Peter Hansen & James J. Heckman & Rosa L. Matzkin (ed.), Handbook of Econometrics, edition 1, volume 7, chapter 0, pages 219-282, Elsevier.
    4. Shuquan Yang & Nengxiang Ling & Yulin Gong, 2022. "Robust estimation of the number of factors for the pair-elliptical factor models," Computational Statistics, Springer, vol. 37(3), pages 1495-1522, July.
    5. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    6. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    7. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    8. Poncela, Pilar & Ruiz, Esther & Miranda, Karen, 2021. "Factor extraction using Kalman filter and smoothing: This is not just another survey," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1399-1425.
    9. Choi, Sung Hoon & Kim, Donggyu, 2023. "Large volatility matrix analysis using global and national factor models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1917-1933.
    10. Yu, Long & He, Yong & Zhang, Xinsheng, 2019. "Robust factor number specification for large-dimensional elliptical factor model," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    11. Yuefeng Han & Rong Chen & Cun-Hui Zhang, 2020. "Rank Determination in Tensor Factor Model," Papers 2011.07131, arXiv.org, revised May 2022.
    12. Ruan Weihua & Hou Qian, 2021. "Determining the Number of Factors in Static Approximate Factor Models Using Discrete Fourier Transforms and Pseudo-Eigenvalues," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 241(1), pages 71-117, February.
    13. Barigozzi, Matteo & Massacci, Daniele, 2025. "Modelling large dimensional datasets with Markov switching factor models," Journal of Econometrics, Elsevier, vol. 247(C).
    14. Aït-Sahalia, Yacine & Xiu, Dacheng, 2017. "Using principal component analysis to estimate a high dimensional factor model with high-frequency data," Journal of Econometrics, Elsevier, vol. 201(2), pages 384-399.
    15. Barigozzi, Matteo & Cho, Haeran & Fryzlewicz, Piotr, 2018. "Simultaneous multiple change-point and factor analysis for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 206(1), pages 187-225.
    16. Matteo Barigozzi & Antonio M. Conti & Matteo Luciani, 2014. "Do Euro Area Countries Respond Asymmetrically to the Common Monetary Policy?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(5), pages 693-714, October.
    17. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    18. Zhou, Ruichao & Wu, Jianhong, 2023. "Determining the number of change-points in high-dimensional factor models by cross-validation with matrix completion," Economics Letters, Elsevier, vol. 232(C).
    19. Bai, Jushan & Duan, Jiangtao & Han, Xu, 2024. "The likelihood ratio test for structural changes in factor models," Journal of Econometrics, Elsevier, vol. 238(2).
    20. Gu, Shihao & Kelly, Bryan & Xiu, Dacheng, 2021. "Autoencoder asset pricing models," Journal of Econometrics, Elsevier, vol. 222(1), pages 429-450.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2506.05116. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.