IDEAS home Printed from https://ideas.repec.org/p/msh/ebswps/2019-31.html
   My bibliography  Save this paper

Spiked Eigenvalues of High-Dimensional Separable Sample Covariance Matrices

Author

Listed:
  • Bo Zhang
  • Jiti Gao
  • Guangming Pan
  • Yanrong Yang

Abstract

This paper establishes asymptotic properties for spiked empirical eigenvalues of sample covariance matrices for high-dimensional data with both cross-sectional dependence and a dependent sample structure. A new finding from the established theoretical results is that spiked empirical eigenvalues will reflect the dependent sample structure instead of the cross-sectional structure under some scenarios, which indicates that principal component analysis (PCA) may provide inaccurate inference for cross-sectional structures. An illustrated example is provided to show that some commonly used statistics based on spiked empirical eigenvalues misestimate the true number of common factors. As an application of high-dimensional time series, we propose a test statistic to distinguish the unit root from the factor structure and demonstrate its effective finite sample performance on simulated data. Our results are then applied to analyze OECD healthcare expenditure data and U.S. mortality data, both of which possess cross-sectional dependence as well as non-stationary temporal dependence. It is worth mentioning that we contribute to statistical justification for the benchmark paper by Lee and Carter [25] in mortality forecasting.

Suggested Citation

  • Bo Zhang & Jiti Gao & Guangming Pan & Yanrong Yang, 2019. "Spiked Eigenvalues of High-Dimensional Separable Sample Covariance Matrices," Monash Econometrics and Business Statistics Working Papers 31/19, Monash University, Department of Econometrics and Business Statistics.
  • Handle: RePEc:msh:ebswps:2019-31
    as

    Download full text from publisher

    File URL: https://www.monash.edu/business/ebs/research/publications/ebs/wp31-2019.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    2. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    3. Alexei Onatski, 2009. "Testing Hypotheses About the Number of Factors in Large Factor Models," Econometrica, Econometric Society, vol. 77(5), pages 1447-1479, September.
    4. Baltagi, Badi H. & Moscone, Francesco, 2010. "Health care expenditure and income in the OECD reconsidered: Evidence from panel data," Economic Modelling, Elsevier, vol. 27(4), pages 804-811, July.
    5. Jiti Gao & Kai Xia, 2017. "Heterogeneous panel data models with cross-sectional dependence," Monash Econometrics and Business Statistics Working Papers 16/17, Monash University, Department of Econometrics and Business Statistics.
    6. Fang Han & Han Liu, 2018. "ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 252-268, January.
    7. Alexei Onatski, 2010. "Determining the Number of Factors from Empirical Distribution of Eigenvalues," The Review of Economics and Statistics, MIT Press, vol. 92(4), pages 1004-1016, November.
    8. Baik, Jinho & Silverstein, Jack W., 2006. "Eigenvalues of large sample covariance matrices of spiked population models," Journal of Multivariate Analysis, Elsevier, vol. 97(6), pages 1382-1408, July.
    9. Bai, Jushan, 2004. "Estimating cross-section common stochastic trends in nonstationary panel data," Journal of Econometrics, Elsevier, vol. 122(1), pages 137-183, September.
    10. Guangming Pan & Jiti Gao & Yanrong Yang, 2014. "Testing Independence Among a Large Number of High-Dimensional Random Vectors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 600-612, June.
    11. Bo Zhang & Jiti Gao & Guangming Pan, 2019. "A Near Unit Root Test for High-Dimensional Nonstationary Time Series," Monash Econometrics and Business Statistics Working Papers 10/19, Monash University, Department of Econometrics and Business Statistics.
    12. Carter, Lawrence R. & Lee, Ronald D., 1992. "Modeling and forecasting US sex differentials in mortality," International Journal of Forecasting, Elsevier, vol. 8(3), pages 393-411, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matteo Barigozzi & Marco Lippi & Matteo Luciani, 2016. "Non-Stationary Dynamic Factor Models for Large Datasets," Finance and Economics Discussion Series 2016-024, Board of Governors of the Federal Reserve System (U.S.).
    2. Barigozzi, Matteo & Lippi, Marco & Luciani, Matteo, 2021. "Large-dimensional Dynamic Factor Models: Estimation of Impulse–Response Functions with I(1) cointegrated factors," Journal of Econometrics, Elsevier, vol. 221(2), pages 455-482.
    3. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    4. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    5. Xia, Qiang & Liang, Rubing & Wu, Jianhong, 2017. "Transformed contribution ratio test for the number of factors in static approximate factor models," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 235-241.
    6. Poncela, Pilar & Ruiz, Esther & Miranda, Karen, 2021. "Factor extraction using Kalman filter and smoothing: This is not just another survey," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1399-1425.
    7. Matteo Barigozzi & Lorenzo Trapani, 2018. "Determining the dimension of factor structures in non-stationary large datasets," Papers 1806.03647, arXiv.org.
    8. Yu, Long & He, Yong & Zhang, Xinsheng, 2019. "Robust factor number specification for large-dimensional elliptical factor model," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    9. Matteo Barigozzi & Marco Lippi & Matteo Luciani, 2014. "Dynamic Factor Models, Cointegration and Error Correction Mechanisms," Working Papers ECARES ECARES 2014-14, ULB -- Universite Libre de Bruxelles.
    10. Jin, Sainan & Miao, Ke & Su, Liangjun, 2021. "On factor models with random missing: EM estimation, inference, and cross validation," Journal of Econometrics, Elsevier, vol. 222(1), pages 745-777.
    11. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    12. Jörg Breitung & In Choi, 2013. "Factor models," Chapters, in: Nigar Hashimzade & Michael A. Thornton (ed.), Handbook of Research Methods and Applications in Empirical Macroeconomics, chapter 11, pages 249-265, Edward Elgar Publishing.
      • In Choi & Jorg Breitung, 2011. "Factor models," Working Papers 1121, Nam Duck-Woo Economic Research Institute, Sogang University (Former Research Institute for Market Economy), revised Dec 2011.
    13. Li, Hongjun & Li, Qi & Shi, Yutang, 2017. "Determining the number of factors when the number of factors can increase with sample size," Journal of Econometrics, Elsevier, vol. 197(1), pages 76-86.
    14. Bo Zhang & Jiti Gao & Guangming Pan & Yanrong Yang, 2023. "Eigen-Analysis for High-Dimensional Time Series Clustering," Monash Econometrics and Business Statistics Working Papers 22/23, Monash University, Department of Econometrics and Business Statistics.
    15. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," PSE Working Papers halshs-02262202, HAL.
    16. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers halshs-02262202, HAL.
    17. GUO-FITOUSSI, Liang, 2013. "A Comparison of the Finite Sample Properties of Selection Rules of Factor Numbers in Large Datasets," MPRA Paper 50005, University Library of Munich, Germany.
    18. Alain-Philippe Fortin & Patrick Gagliardini & O. Scaillet, 2022. "Eigenvalue tests for the number of latent factors in short panels," Swiss Finance Institute Research Paper Series 22-81, Swiss Finance Institute.
    19. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    20. Jushan Bai & Serena Ng, 2020. "Simpler Proofs for Approximate Factor Models of Large Dimensions," Papers 2008.00254, arXiv.org.

    More about this item

    Keywords

    factor model; high-dimensional data; principal component analysis; spiked empirical eigenvalue.;
    All these keywords.

    JEL classification:

    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C32 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes; State Space Models
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:msh:ebswps:2019-31. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Professor Xibin Zhang (email available below). General contact details of provider: https://edirc.repec.org/data/dxmonau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.