IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v31y2014i1p107-128.html
   My bibliography  Save this article

The Canonical Analysis of Distance

Author

Listed:
  • John Gower
  • Niel Roux
  • Sugnet Gardner-Lubbe

Abstract

Canonical Variate Analysis (CVA) is one of the most useful of multivariate methods. It is concerned with separating between and within group variation among N samples from K populations with respect to p measured variables. Mahalanobis distance between the K group means can be represented as points in a (K - 1) dimensional space and approximated in a smaller space, with the variables shown as calibrated biplot axes. Within group variation may also be shown, together with circular confidence regions and other convex prediction regions, which may be used to discriminate new samples. This type of representation extends to what we term Analysis of Distance (AoD), whenever a Euclidean inter-sample distance is defined. Although the N × N distance matrix of the samples, which may be large, is required, eigenvalue calculations are needed only for the much smaller K × K matrix of distances between group centroids. All the ancillary information that is attached to a CVA analysis is available in an AoD analysis. We outline the theory and the R programs we developed to implement AoD by presenting two examples. Copyright Springer Science+Business Media New York 2014

Suggested Citation

  • John Gower & Niel Roux & Sugnet Gardner-Lubbe, 2014. "The Canonical Analysis of Distance," Journal of Classification, Springer;The Classification Society, vol. 31(1), pages 107-128, April.
  • Handle: RePEc:spr:jclass:v:31:y:2014:i:1:p:107-128
    DOI: 10.1007/s00357-014-9149-8
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00357-014-9149-8
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00357-014-9149-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gower, John C. & Ngouenet, Roger F., 2005. "Nonlinearity effects in multidimensional scaling," Journal of Multivariate Analysis, Elsevier, vol. 94(2), pages 344-365, June.
    2. W. Krzanowski, 1994. "Ordination in the presence of group structure, for general multivariate data," Journal of Classification, Springer;The Classification Society, vol. 11(2), pages 195-207, September.
    3. J. Gower & P. Legendre, 1986. "Metric and Euclidean properties of dissimilarity coefficients," Journal of Classification, Springer;The Classification Society, vol. 3(1), pages 5-48, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. John C. Gower & Niël J. Le Roux & Sugnet Gardner-Lubbe, 2022. "Properties of individual differences scaling and its interpretation," Statistical Papers, Springer, vol. 63(4), pages 1221-1245, August.
    2. Le Roux, Niël J. & Gardner-Lubbe, Sugnet & Gower, John C., 2014. "The analysis of distance of grouped data with categorical variables: Categorical canonical variate analysis," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 9-24.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fry, J.T. & Slifko, Matt & Leman, Scotland, 2018. "Generalized biplots for stress-based multidimensionally scaled projections," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 340-353.
    2. repec:jss:jstsof:30:i12 is not listed on IDEAS
    3. la Grange, Anthony & le Roux, Niël & Gardner-Lubbe, Sugnet, 2009. "BiplotGUI: Interactive Biplots in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 30(i12).
    4. Vines, S.K., 2015. "Predictive nonlinear biplots: Maps and trajectories," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 47-59.
    5. Patrick Groenen & Niël Roux & Sugnet Gardner-Lubbe, 2015. "Spline-based nonlinear biplots," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 9(2), pages 219-238, June.
    6. Guohuan Su & Adam Mertel & Sébastien Brosse & Justin M. Calabrese, 2023. "Species invasiveness and community invasibility of North American freshwater fish fauna revealed via trait-based analysis," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    7. Balepur, Prashant Narayan, 1998. "Impacts of Computer-Mediated Communication on Travel and Communication Patterns: The Davis Community Network Study," Institute of Transportation Studies, Research Reports, Working Papers, Proceedings qt6cb1f85c, Institute of Transportation Studies, UC Berkeley.
    8. Douglas L. Steinley & M. J. Brusco, 2019. "Using an Iterative Reallocation Partitioning Algorithm to Verify Test Multidimensionality," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 397-413, October.
    9. Anna Maria D’Arcangelis & Giulia Rotundo, 2016. "Complex Networks in Finance," Lecture Notes in Economics and Mathematical Systems, in: Pasquale Commendatore & Mariano Matilla-García & Luis M. Varela & Jose S. Cánovas (ed.), Complex Networks and Dynamics, pages 209-235, Springer.
    10. Carla Coltharp & Rene P Kessler & Jie Xiao, 2012. "Accurate Construction of Photoactivated Localization Microscopy (PALM) Images for Quantitative Measurements," PLOS ONE, Public Library of Science, vol. 7(12), pages 1-15, December.
    11. S. T. Buckland & Y. Yuan & E. Marcon, 2017. "Measuring temporal trends in biodiversity," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 101(4), pages 461-474, October.
    12. Matthijs Warrens, 2008. "On the Indeterminacy of Resemblance Measures for Binary (Presence/Absence) Data," Journal of Classification, Springer;The Classification Society, vol. 25(1), pages 125-136, June.
    13. Kivimäki, Ilkka & Shimbo, Masashi & Saerens, Marco, 2014. "Developments in the theory of randomized shortest paths with a comparison of graph node distances," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 393(C), pages 600-616.
    14. Alfonso Gutierrez-Lopez & Carlos Chávez & Carlos Díaz-Delgado, 2022. "Autocorrelation Ratio as a Measure of Inertia for the Classification of Extreme Events," Mathematics, MDPI, vol. 10(12), pages 1-15, June.
    15. Raffaella Calabrese & Galina Andreeva & Jake Ansell, 2019. "“Birds of a Feather” Fail Together: Exploring the Nature of Dependency in SME Defaults," Risk Analysis, John Wiley & Sons, vol. 39(1), pages 71-84, January.
    16. Jean-Baptiste Hasse, 2022. "Systemic risk: a network approach," Empirical Economics, Springer, vol. 63(1), pages 313-344, July.
    17. Heiberg, Jonas & Truffer, Bernhard & Binz, Christian, 2022. "Assessing transitions through socio-technical configuration analysis – a methodological framework and a case study in the water sector," Research Policy, Elsevier, vol. 51(1).
    18. Florian Schreiber, 2017. "Identification of customer groups in the German term life market: a benefit segmentation," Annals of Operations Research, Springer, vol. 254(1), pages 365-399, July.
    19. Joris Knoben & Leon A. G. Oerlemans & Annefleur R. Krijkamp & Keith G. Provan, 2018. "What Do They Know? The Antecedents of Information Accuracy Differentials in Interorganizational Networks," Organization Science, INFORMS, vol. 29(3), pages 471-488, June.
    20. Francesco Cannarile & Michele Compare & Francesco Di Maio & Enrico Zio, 2018. "A clustering approach for mining reliability big data for asset management," Journal of Risk and Reliability, , vol. 232(2), pages 140-150, April.
    21. Rosetta Lombardo, 2016. "Is there also a North–South Divide in the Diffusion of Crime? A Cluster Analysis of Italian Provinces," Review of Development Economics, Wiley Blackwell, vol. 20(2), pages 443-455, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:31:y:2014:i:1:p:107-128. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.