IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v183y2021ics0047259x20302967.html
   My bibliography  Save this article

Canonical correlation analysis for elliptical copulas

Author

Listed:
  • Langworthy, Benjamin W.
  • Stephens, Rebecca L.
  • Gilmore, John H.
  • Fine, Jason P.

Abstract

Canonical correlation analysis (CCA) is a common method used to estimate the associations between two different sets of variables by maximizing the Pearson correlation between linear combinations of the two sets of variables. We propose a version of CCA for transelliptical distributions with an elliptical copula using pairwise Kendall’s tau to estimate a latent scatter matrix. Because Kendall’s tau relies only on the ranks of the data this method does not make any assumptions about the marginal distributions of the variables, and is valid when moments do not exist. We establish consistency and asymptotic normality for canonical directions and correlations estimated using Kendall’s tau. Simulations indicate that this estimator outperforms standard CCA for data generated from heavy tailed elliptical distributions. Our method also identifies more meaningful relationships when the marginal distributions are skewed. We also propose a method for testing for non-zero canonical correlations using bootstrap methods. This testing procedure does not require any assumptions on the joint distribution of the variables and works for all elliptical copulas. This is in contrast to permutation tests which are only valid when data are generated from a distribution with a Gaussian copula. This method’s practical utility is shown in an analysis of the association between radial diffusivity in white matter tracts and cognitive tests scores for six-year-old children from the Early Brain Development Study at UNC-Chapel Hill. An R package implementing this method is available at github.com/blangworthy/transCCA.

Suggested Citation

  • Langworthy, Benjamin W. & Stephens, Rebecca L. & Gilmore, John H. & Fine, Jason P., 2021. "Canonical correlation analysis for elliptical copulas," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
  • Handle: RePEc:eee:jmvana:v:183:y:2021:i:c:s0047259x20302967
    DOI: 10.1016/j.jmva.2020.104715
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X20302967
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2020.104715?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. González, Ignacio & Déjean, Sébastien & Martin, Pascal G. P. & Baccini, Alain, 2008. "CCA: An R Package to Extend Canonical Correlation Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i12).
    2. Andreas Alfons & Christophe Croux & Peter Filzmoser, 2017. "Robust Maximum Association Estimators," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 436-445, January.
    3. Fang Han & Han Liu, 2014. "Scale-Invariant Sparse PCA on High-Dimensional Meta-Elliptical Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 275-287, March.
    4. Witten Daniela M & Tibshirani Robert J., 2009. "Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-27, June.
    5. Cambanis, Stamatis & Huang, Steel & Simons, Gordon, 1981. "On the theory of elliptically contoured distributions," Journal of Multivariate Analysis, Elsevier, vol. 11(3), pages 368-385, September.
    6. Taskinen, Sara & Croux, Christophe & Kankainen, Annaliisa & Ollila, Esa & Oja, Hannu, 2006. "Influence functions and efficiencies of the canonical correlation and vector estimates based on scatter and shape matrices," Journal of Multivariate Analysis, Elsevier, vol. 97(2), pages 359-384, February.
    7. Fang, Hong-Bin & Fang, Kai-Tai & Kotz, Samuel, 2002. "The Meta-elliptical Distributions with Given Marginals," Journal of Multivariate Analysis, Elsevier, vol. 82(1), pages 1-16, July.
    8. Owen, Joel & Rabinovitch, Ramon, 1983. "On the Class of Elliptical Distributions and Their Applications to the Theory of Portfolio Choice," Journal of Finance, American Finance Association, vol. 38(3), pages 745-752, June.
    9. Claudia Klüppelberg & Gabriel Kuhn, 2009. "Copula structure analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 737-753, June.
    10. Anderson, T. W., 1999. "Asymptotic Theory for Canonical Correlation Analysis," Journal of Multivariate Analysis, Elsevier, vol. 70(1), pages 1-29, July.
    11. Todorov, Valentin & Filzmoser, Peter, 2009. "An Object-Oriented Framework for Robust Multivariate Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i03).
    12. Friedrich Schmid & Rafael Schmidt, 2007. "Nonparametric inference on multivariate versions of Blomqvist’s beta and related measures of tail dependence," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 66(3), pages 323-354, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yu, Long & He, Yong & Zhang, Xinsheng, 2019. "Robust factor number specification for large-dimensional elliptical factor model," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    2. Quessy, Jean-François & Durocher, Martin, 2019. "The class of copulas arising from squared distributions: Properties and inference," Econometrics and Statistics, Elsevier, vol. 12(C), pages 148-166.
    3. Jorge G. Adrover & Stella M. Donato, 2023. "Aspects of robust canonical correlation analysis, principal components and association," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 623-650, June.
    4. Wang, Wenjia & Zhou, Yi-Hui, 2021. "Eigenvector-based sparse canonical correlation analysis: Fast computation for estimation of multiple canonical vectors," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    5. Battey, Heather & Linton, Oliver, 2014. "Nonparametric estimation of multivariate elliptic densities via finite mixture sieves," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 43-67.
    6. Chuancun Yin, 2019. "Stochastic ordering of Gini indexes for multivariate elliptical random variables," Papers 1908.01943, arXiv.org, revised Sep 2019.
    7. Deimen, Inga & Szalay, Dezsö, 2014. "A Smooth, strategic communication," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 479, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
    8. Heather Battey & Oliver Linton, 2013. "Nonparametric estimation of multivariate elliptic densities via finite mixture sieves," CeMMAP working papers 15/13, Institute for Fiscal Studies.
    9. Alvarez, Agustín & Boente, Graciela & Kudraszow, Nadia, 2019. "Robust sieve estimators for functional canonical correlation analysis," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 46-62.
    10. Fotopoulos, Stergios B., 2017. "Symmetric Gaussian mixture distributions with GGC scales," Journal of Multivariate Analysis, Elsevier, vol. 160(C), pages 185-194.
    11. Tenenhaus, Arthur & Philippe, Cathy & Frouin, Vincent, 2015. "Kernel Generalized Canonical Correlation Analysis," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 114-131.
    12. Dmitry Kobak & Yves Bernaerts & Marissa A. Weis & Federico Scala & Andreas S. Tolias & Philipp Berens, 2021. "Sparse reduced‐rank regression for exploratory visualisation of paired multivariate data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 980-1000, August.
    13. Hashorva, Enkelejd, 2009. "Asymptotics for Kotz Type III elliptical distributions," Statistics & Probability Letters, Elsevier, vol. 79(7), pages 927-935, April.
    14. He, Yong & Zhang, Liang & Ji, Jiadong & Zhang, Xinsheng, 2019. "Robust feature screening for elliptical copula regression model," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 568-582.
    15. Dong Hwan Oh & Andrew J. Patton, 2017. "Modeling Dependence in High Dimensions With Factor Copulas," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(1), pages 139-154, January.
    16. Dominique Guegan & Bertrand K. Hassani, 2019. "Risk Measurement," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-02119256, HAL.
    17. Nielsen, Lars Tyge, 1996. "Common knowledge: The case of linear regression," Journal of Mathematical Economics, Elsevier, vol. 26(3), pages 285-304.
    18. Framstad, N.C., 2011. "Portfolio separation properties of the skew-elliptical distributions, with generalizations," Statistics & Probability Letters, Elsevier, vol. 81(12), pages 1862-1866.
    19. Claudia Klüppelberg & Gabriel Kuhn, 2009. "Copula structure analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 737-753, June.
    20. Deng, Kaihua, 2016. "A test of asymmetric comovement for state-dependent stock returns," Journal of Empirical Finance, Elsevier, vol. 36(C), pages 68-85.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:183:y:2021:i:c:s0047259x20302967. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.