IDEAS home Printed from https://ideas.repec.org/p/clb/wpaper/201209.html
   My bibliography  Save this paper

Weighting Distance Matrices Using Rank Correlations

Author

Listed:
  • Ilaria Lucrezia Amerise

    ()

  • Agostino Tarsitano

    () (Dipartimento di Economia e Statistica, Università della Calabria)

Abstract

In a number of applications of multivariate analysis, the data matrix is not fully observed. Instead a set of distance matrices on the same entities is available. A reasonable strategy to construct a global distance matrix is to compute a weighted average of the partial distance matrices, provided that an appropriate system of weights can be defined. The Distatis method developed by Abdi et al. (2005) is a three-step procedure for computing the global distance matrix. An important aspect of that procedure is the computation of the vector correlation coefficient (RV) to measure the similarity between partial distance matrices. The RV coefficient is based on the Pearson product moment correlation coeffcient, which is highly prone to the effects of outliers. We are convinced that, in many measurable phenomena, the relationships between distances are far more likely to be ordinal than interval in nature, and it is therefore preferable to adopt an approach appropriate to ordinal data. The goal of our paper is to revise the system of weights of the Distatis procedure substituting the conventional Pearson coefficient with rank correlations that are less affected by errors of measurement, perturbation or presence of outliers in the data. In the light of our findings on real and simulated data sets, we recommend the use of a speci c coefficient of rank correlation to replace, where necessary, the conventional vector correlation.

Suggested Citation

  • Ilaria Lucrezia Amerise & Agostino Tarsitano, 2012. "Weighting Distance Matrices Using Rank Correlations," Working Papers 201209, Università della Calabria, Dipartimento di Economia, Statistica e Finanza "Giovanni Anania" - DESF.
  • Handle: RePEc:clb:wpaper:201209
    as

    Download full text from publisher

    File URL: http://www.ecostat.unical.it/RePEc/WorkingPapers/WP09_2012.pdf
    File Function: First version, 2012-12
    Download Restriction: no

    References listed on IDEAS

    as
    1. Vladimir Batagelj & Matevz Bren, 1995. "Comparing resemblance measures," Journal of Classification, Springer;The Classification Society, vol. 12(1), pages 73-90, March.
    2. Francis Cailliez, 1983. "The analytical solution of the additive constant problem," Psychometrika, Springer;The Psychometric Society, vol. 48(2), pages 305-308, June.
    3. Véronique Campbell & Pierre Legendre & François-Joseph Lapointe, 2009. "Assessing Congruence Among Ultrametric Distance Matrices," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 103-117, April.
    Full references (including those not matched with items on IDEAS)

    More about this item

    Keywords

    Distatis; Ordinal data; Vector rank correlation;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:clb:wpaper:201209. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Giovanni Dodero). General contact details of provider: http://edirc.repec.org/data/decalit.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.