IDEAS home Printed from https://ideas.repec.org/p/upf/upfgen/1380.html
   My bibliography  Save this paper

Weighted Euclidean biplots

Author

Abstract

We construct a weighted Euclidean distance that approximates any distance or dissimilarity measure between individuals that is based on a rectangular cases-by-variables data matrix. In contrast to regular multidimensional scaling methods for dissimilarity data, the method leads to biplots of individuals and variables while preserving all the good properties of dimension-reduction methods that are based on the singular-value decomposition. The main benefits are the decomposition of variance into components along principal axes, which provide the numerical diagnostics known as contributions, and the estimation of nonnegative weights for each variable. The idea is inspired by the distance functions used in correspondence analysis and in principal component analysis of standardized data, where the normalizations inherent in the distances can be considered as differential weighting of the variables. In weighted Euclidean biplots we allow these weights to be unknown parameters, which are estimated from the data to maximize the fit to the chosen distances or dissimilarities. These weights are estimated using a majorization algorithm. Once this extra weight-estimation step is accomplished, the procedure follows the classical path in decomposing the matrix and displaying its rows and columns in biplots.

Suggested Citation

  • Michael Greenacre & Patrick J. F. Groenen, 2013. "Weighted Euclidean biplots," Economics Working Papers 1380, Department of Economics and Business, Universitat Pompeu Fabra.
  • Handle: RePEc:upf:upfgen:1380
    as

    Download full text from publisher

    File URL: https://econ-papers.upf.edu/papers/1380.pdf
    File Function: Whole Paper
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Michael Greenacre, 2009. "Contribution biplots," Economics Working Papers 1162, Department of Economics and Business, Universitat Pompeu Fabra, revised Jan 2011.
    2. Michael Greenacre, 2008. "Correspondence analysis of raw data," Economics Working Papers 1112, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2009.
    3. Greenacre Michael, 2010. "Biplots in Practice," Books, Fundacion BBVA / BBVA Foundation, number 2011113, October.
    4. J. Gower & P. Legendre, 1986. "Metric and Euclidean properties of dissimilarity coefficients," Journal of Classification, Springer;The Classification Society, vol. 3(1), pages 5-48, March.
    5. de Leeuw, Jan & Mair, Patrick, 2009. "Multidimensional Scaling Using Majorization: SMACOF in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 31(i03).
    6. Jan Leeuw, 1988. "Convergence of the majorization method for multidimensional scaling," Journal of Classification, Springer;The Classification Society, vol. 5(2), pages 163-180, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fry, J.T. & Slifko, Matt & Leman, Scotland, 2018. "Generalized biplots for stress-based multidimensionally scaled projections," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 340-353.
    2. Giuseppe Bove & Akinori Okada, 2018. "Methods for the analysis of asymmetric pairwise relationships," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(1), pages 5-31, March.
    3. Federico Gobbo, 2017. "Beyond the Nation-State? The Ideology of the Esperanto Movement between Neutralism and Multilingualism," Social Inclusion, Cogitatio Press, vol. 5(4), pages 38-47.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Greenacre, 2012. "Fuzzy coding in constrained ordinations," Economics Working Papers 1325, Department of Economics and Business, Universitat Pompeu Fabra.
    2. Michael Greenacre, 2014. "Size and shape in the measurement of multivariate proximity," Economics Working Papers 1444, Department of Economics and Business, Universitat Pompeu Fabra.
    3. Michael Greenacre, 2011. "The contributions of rare objects in correspondence analysis," Economics Working Papers 1278, Department of Economics and Business, Universitat Pompeu Fabra.
    4. Groenen, P.J.F. & Borg, I., 2013. "The Past, Present, and Future of Multidimensional Scaling," Econometric Institute Research Papers EI 2013-07, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    5. Michael Greenacre, 2004. "Weighted metric multidimensional scaling," Economics Working Papers 777, Department of Economics and Business, Universitat Pompeu Fabra.
    6. Gruenhage, Gina & Opper, Manfred & Barthelme, Simon, 2016. "Visualizing the effects of a changing distance on data using continuous embeddings," Computational Statistics & Data Analysis, Elsevier, vol. 104(C), pages 51-65.
    7. Guohuan Su & Adam Mertel & Sébastien Brosse & Justin M. Calabrese, 2023. "Species invasiveness and community invasibility of North American freshwater fish fauna revealed via trait-based analysis," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    8. Eric Beh & Luigi D’Ambra, 2009. "Some Interpretative Tools for Non-Symmetrical Correspondence Analysis," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 55-76, April.
    9. Pilar García Gómez & Ángel López Nicolás, 2005. "Socio-economic inequalities in health in Catalonia," Hacienda Pública Española / Review of Public Economics, IEF, vol. 175(4), pages 103-121, december.
    10. Patrick Groenen & Rudolf Mathar & Willem Heiser, 1995. "The majorization approach to multidimensional scaling for Minkowski distances," Journal of Classification, Springer;The Classification Society, vol. 12(1), pages 3-19, March.
    11. Alfonso Gambardella & Walter Garcia Fontes, 1996. "European research funding and regional technological capabilities: Network composition analysis," Economics Working Papers 174, Department of Economics and Business, Universitat Pompeu Fabra.
    12. Balepur, Prashant Narayan, 1998. "Impacts of Computer-Mediated Communication on Travel and Communication Patterns: The Davis Community Network Study," Institute of Transportation Studies, Research Reports, Working Papers, Proceedings qt6cb1f85c, Institute of Transportation Studies, UC Berkeley.
    13. Douglas L. Steinley & M. J. Brusco, 2019. "Using an Iterative Reallocation Partitioning Algorithm to Verify Test Multidimensionality," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 397-413, October.
    14. Carlo Ciccarelli & Tommaso Proietti, 2013. "Patterns of industrial specialisation in post-Unification Italy," Scandinavian Economic History Review, Taylor & Francis Journals, vol. 61(3), pages 259-286, November.
    15. Anna Maria D’Arcangelis & Giulia Rotundo, 2016. "Complex Networks in Finance," Lecture Notes in Economics and Mathematical Systems, in: Pasquale Commendatore & Mariano Matilla-García & Luis M. Varela & Jose S. Cánovas (ed.), Complex Networks and Dynamics, pages 209-235, Springer.
    16. Carla Coltharp & Rene P Kessler & Jie Xiao, 2012. "Accurate Construction of Photoactivated Localization Microscopy (PALM) Images for Quantitative Measurements," PLOS ONE, Public Library of Science, vol. 7(12), pages 1-15, December.
    17. Malcolm Dow & Peter Willett & Roderick McDonald & Belver Griffith & Michael Greenacre & Peter Bryant & Daniel Wartenberg & Ove Frank, 1987. "Book reviews," Journal of Classification, Springer;The Classification Society, vol. 4(2), pages 245-278, September.
    18. Walesiak Marek & Dudek Andrzej, 2017. "Selecting the Optimal Multidimensional Scaling Procedure for Metric Data With R Environment," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 521-540, September.
    19. S. T. Buckland & Y. Yuan & E. Marcon, 2017. "Measuring temporal trends in biodiversity," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 101(4), pages 461-474, October.
    20. Jurlin, Kresimir & Malekovic, Sanja & Puljiz, Jaksa & Cziraky, Dario & Polic, Mario, 2002. "Covariance structure analysis of regional development data: an application to municipality development assessment," ERSA conference papers ersa02p469, European Regional Science Association.

    More about this item

    Keywords

    biplot; correspondence analysis; distance; majorization; multidimensional scaling; singular-value decomposition; weighted least squares;
    All these keywords.

    JEL classification:

    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • C88 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Other Computer Software

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:upf:upfgen:1380. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: http://www.econ.upf.edu/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.