IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2306.16393.html
   My bibliography  Save this paper

High-Dimensional Canonical Correlation Analysis

Author

Listed:
  • Anna Bykhovskaya
  • Vadim Gorin

Abstract

This paper studies high-dimensional canonical correlation analysis (CCA) with an emphasis on the vectors that define canonical variables. The paper shows that when two dimensions of data grow to infinity jointly and proportionally, the classical CCA procedure for estimating those vectors fails to deliver a consistent estimate. This provides the first result on the impossibility of identification of canonical variables in the CCA procedure when all dimensions are large. As a countermeasure, the paper derives the magnitude of the estimation error, which can be used in practice to assess the precision of CCA estimates. Applications of the results to cyclical vs. non-cyclical stocks and to a limestone grassland data set are provided.

Suggested Citation

  • Anna Bykhovskaya & Vadim Gorin, 2023. "High-Dimensional Canonical Correlation Analysis," Papers 2306.16393, arXiv.org, revised Aug 2023.
  • Handle: RePEc:arx:papers:2306.16393
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2306.16393
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Onatski, Alexei & Wang, Chen, 2019. "Extreme canonical correlations and high-dimensional cointegration analysis," Journal of Econometrics, Elsevier, vol. 212(1), pages 307-322.
    2. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    3. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    4. Alexei Onatski, 2009. "Testing Hypotheses About the Number of Factors in Large Factor Models," Econometrica, Econometric Society, vol. 77(5), pages 1447-1479, September.
    5. Flavio Cunha & James J. Heckman & Susanne M. Schennach, 2010. "Estimating the Technology of Cognitive and Noncognitive Skill Formation," Econometrica, Econometric Society, vol. 78(3), pages 883-931, May.
    6. J.-P. Bouchaud & L. Laloux & M. A. Miceli & M. Potters, 2007. "Large dimension forecasting models and random singular value spectra," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 55(2), pages 201-207, January.
    7. Bai, Jushan & Ng, Serena, 2007. "Determining the Number of Primitive Shocks in Factor Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 52-60, January.
    8. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    9. Johansen, Soren, 1988. "Statistical analysis of cointegration vectors," Journal of Economic Dynamics and Control, Elsevier, vol. 12(2-3), pages 231-254.
    10. Benaych-Georges, Florent & Nadakuditi, Raj Rao, 2012. "The singular values and vectors of low rank perturbations of large rectangular random matrices," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 120-135.
    11. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    12. Hallin, Marc & Liska, Roman, 2007. "Determining the Number of Factors in the General Dynamic Factor Model," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 603-617, June.
    13. Jianqing Fan & Yuan Liao & Han Liu, 2016. "An overview of the estimation of large covariance and precision matrices," Econometrics Journal, Royal Economic Society, vol. 19(1), pages 1-32, February.
    14. Jianqing Fan & Jinchi Lv & Lei Qi, 2011. "Sparse High-Dimensional Models in Economics," Annual Review of Economics, Annual Reviews, vol. 3(1), pages 291-317, September.
    15. Baik, Jinho & Silverstein, Jack W., 2006. "Eigenvalues of large sample covariance matrices of spiked population models," Journal of Multivariate Analysis, Elsevier, vol. 97(6), pages 1382-1408, July.
    16. Bai, Jushan & Ng, Serena, 2008. "Large Dimensional Factor Analysis," Foundations and Trends(R) in Econometrics, now publishers, vol. 3(2), pages 89-163, June.
    17. Anderson, T. W., 1999. "Asymptotic Theory for Canonical Correlation Analysis," Journal of Multivariate Analysis, Elsevier, vol. 70(1), pages 1-29, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Paulo M. M. Rodrigues & Mirjam Salish & Nazarii Salish, 2024. "Saving for sunny days: The impact of climate (change) on consumer prices in the euro area," Papers 2401.03740, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mao Takongmo, Charles Olivier & Stevanovic, Dalibor, 2015. "Selection Of The Number Of Factors In Presence Of Structural Instability: A Monte Carlo Study," L'Actualité Economique, Société Canadienne de Science Economique, vol. 91(1-2), pages 177-233, Mars-Juin.
    2. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    3. Poncela, Pilar & Ruiz, Esther & Miranda, Karen, 2021. "Factor extraction using Kalman filter and smoothing: This is not just another survey," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1399-1425.
    4. Alexander Chudik & M. Hashem Pesaran, 2013. "Large panel data models with cross-sectional dependence: a survey," Globalization Institute Working Papers 153, Federal Reserve Bank of Dallas.
    5. Matteo Barigozzi & Marco Lippi & Matteo Luciani, 2014. "Dynamic Factor Models, Cointegration and Error Correction Mechanisms," Working Papers ECARES ECARES 2014-14, ULB -- Universite Libre de Bruxelles.
    6. Francisco Corona & Pilar Poncela & Esther Ruiz, 2017. "Determining the number of factors after stationary univariate transformations," Empirical Economics, Springer, vol. 53(1), pages 351-372, August.
    7. Pilar Poncela & Esther Ruiz, 2016. "Small- Versus Big-Data Factor Extraction in Dynamic Factor Models: An Empirical Assessment," Advances in Econometrics, in: Dynamic Factor Models, volume 35, pages 401-434, Emerald Group Publishing Limited.
    8. Francisco Corona & Pedro Orraca, 2019. "Remittances in Mexico and their unobserved components," The Journal of International Trade & Economic Development, Taylor & Francis Journals, vol. 28(8), pages 1047-1066, November.
    9. Lu, Xun & Su, Liangjun, 2016. "Shrinkage estimation of dynamic panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 190(1), pages 148-175.
    10. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    11. Yinchu Zhu, 2019. "How well can we learn large factor models without assuming strong factors?," Papers 1910.10382, arXiv.org, revised Nov 2019.
    12. Zhao Zhao & Guowei Cui & Shaoping Wang, 2017. "A Monte Carlo comparison of estimating the number of dynamic factors," Empirical Economics, Springer, vol. 53(3), pages 1217-1241, November.
    13. Nathan Bedock & Dalibor Stevanović, 2017. "An empirical study of credit shock transmission in a small open economy," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 50(2), pages 541-570, May.
    14. Philipp Gersing & Christoph Rust & Manfred Deistler, 2023. "Weak Factors are Everywhere," Papers 2307.10067, arXiv.org, revised Jan 2024.
    15. Francisco Corona & Graciela González-Farías & Pedro Orraca, 2017. "A dynamic factor model for the Mexican economy: are common trends useful when predicting economic activity?," Latin American Economic Review, Springer;Centro de Investigaciòn y Docencia Económica (CIDE), vol. 26(1), pages 1-35, December.
    16. GUO-FITOUSSI, Liang, 2013. "A Comparison of the Finite Sample Properties of Selection Rules of Factor Numbers in Large Datasets," MPRA Paper 50005, University Library of Munich, Germany.
    17. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    18. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    19. Matteo Barigozzi & Marco Lippi & Matteo Luciani, 2016. "Non-Stationary Dynamic Factor Models for Large Datasets," Finance and Economics Discussion Series 2016-024, Board of Governors of the Federal Reserve System (U.S.).
    20. Steffen R. Henzel & Malte Rengel, 2017. "Dimensions Of Macroeconomic Uncertainty: A Common Factor Analysis," Economic Inquiry, Western Economic Association International, vol. 55(2), pages 843-877, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2306.16393. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.