IDEAS home Printed from https://ideas.repec.org/p/upf/upfgen/1044.html
   My bibliography  Save this paper

Power transformations in correspondence analysis

Author

Abstract

Power transformations of positive data tables, prior to applying the correspondence analysis algorithm, are shown to open up a family of methods with direct connections to the analysis of log-ratios. Two variations of this idea are illustrated. The first approach is simply to power the original data and perform a correspondence analysis – this method is shown to converge to unweighted log-ratio analysis as the power parameter tends to zero. The second approach is to apply the power transformation to the contingency ratios, that is the values in the table relative to expected values based on the marginals – this method converges to weighted log-ratio analysis, or the spectral map. Two applications are described: first, a matrix of population genetic data which is inherently two-dimensional, and second, a larger cross-tabulation with higher dimensionality, from a linguistic analysis of several books.

Suggested Citation

  • Michael Greenacre, 2007. "Power transformations in correspondence analysis," Economics Working Papers 1044, Department of Economics and Business, Universitat Pompeu Fabra, revised Mar 2008.
  • Handle: RePEc:upf:upfgen:1044
    as

    Download full text from publisher

    File URL: https://econ-papers.upf.edu/papers/1044.pdf
    File Function: Whole Paper
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Michael Greenacre, 2008. "Correspondence analysis of raw data," Economics Working Papers 1112, Department of Economics and Business, Universitat Pompeu Fabra, revised Jul 2009.
    2. John Aitchison & Michael Greenacre, 2002. "Biplots of compositional data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 51(4), pages 375-392, October.
    3. Nenadic, Oleg & Greenacre, Michael, 2007. "Correspondence Analysis in R, with Two- and Three-dimensional Graphics: The ca Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 20(i03).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Blasius, J. & Greenacre, M. & Groenen, P.J.F. & van de Velden, M., 2009. "Special issue on correspondence analysis and related methods," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 3103-3106, June.
    2. Eric J. Beh & Rosaria Lombardo, 2024. "Correspondence Analysis Using the Cressie–Read Family of Divergence Statistics," International Statistical Review, International Statistical Institute, vol. 92(1), pages 17-42, April.
    3. Antonello D’Ambra & Anna Crisci & Luigi D’Ambra, 2017. "Weighted log ratio analysis by means of Poisson factor models: a case study to evaluate the quality of the public services offered to the citizens," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(2), pages 629-639, March.
    4. Michael Greenacre & Paul Lewi, 2009. "Distributional Equivalence and Subcompositional Coherence in the Analysis of Compositional Data, Contingency Tables and Ratio-Scale Measurements," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 29-54, April.
    5. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    6. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    7. Ida Camminatiello & Antonello D’Ambra & Luigi D’Ambra, 2022. "The association in two-way ordinal contingency tables through global odds ratios," METRON, Springer;Sapienza Università di Roma, vol. 80(1), pages 9-22, April.
    8. Michael Greenacre, 2024. "The chiPower transformation: a valid alternative to logratio transformations in compositional data analysis," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 18(3), pages 769-796, September.
    9. Lombardo, Rosaria & Camminatiello, Ida & D'Ambra, Antonello & Beh, Eric J., 2021. "Assessing the Italian tax courts system by weighted three-way log-ratio analysis," Socio-Economic Planning Sciences, Elsevier, vol. 73(C).
    10. Michael Greenacre, 2023. "The chi-square standardization, combined with Box-Cox transformation, is a valid alternative to transforming to logratios in compositional data analysis," Economics Working Papers 1857, Department of Economics and Business, Universitat Pompeu Fabra.
    11. Tsai, Arthur C. & Liou, Michelle & Simak, Maria & Cheng, Philip E., 2017. "On hyperbolic transformations to normality," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 250-266.
    12. J. L. Scealy & Patrice de Caritat & Eric C. Grunsky & Michail T. Tsagris & A. H. Welsh, 2015. "Robust Principal Component Analysis for Power Transformed Compositional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 136-148, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Greenacre & Paul Lewi, 2009. "Distributional Equivalence and Subcompositional Coherence in the Analysis of Compositional Data, Contingency Tables and Ratio-Scale Measurements," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 29-54, April.
    2. Michael Greenacre, 2012. "Fuzzy coding in constrained ordinations," Economics Working Papers 1325, Department of Economics and Business, Universitat Pompeu Fabra.
    3. Michael Greenacre, 2006. "Tying up the loose ends in simple correspondence analysis," Economics Working Papers 940, Department of Economics and Business, Universitat Pompeu Fabra.
    4. Michael Greenacre, 2003. "Singular value decomposition of matched matrices," Journal of Applied Statistics, Taylor & Francis Journals, vol. 30(10), pages 1101-1113.
    5. Udina, Frederic, 2005. "Interactive Biplot Construction," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 13(i05).
    6. Michael Greenacre, 2002. "Ratio maps and correspondence analysis," Economics Working Papers 598, Department of Economics and Business, Universitat Pompeu Fabra.
    7. Michael Greenacre, 2011. "The contributions of rare objects in correspondence analysis," Economics Working Papers 1278, Department of Economics and Business, Universitat Pompeu Fabra.
    8. Beaton, Derek & Chin Fatt, Cherise R. & Abdi, Hervé, 2014. "An ExPosition of multivariate analysis with the singular value decomposition in R," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 176-189.
    9. Michael Greenacre & Anna Torres, 2002. "Measuring asymmetries in brand associations using correspondence analysis," Economics Working Papers 630, Department of Economics and Business, Universitat Pompeu Fabra.
    10. Michael Greenacre & Rafael Pardo, 2004. "Subset correspondence analysis: Visualizing relationships among a selected set of response categories from a questionnaire survey," Economics Working Papers 791, Department of Economics and Business, Universitat Pompeu Fabra.
    11. B. Baris Alkan & Afsin Sahin, 2011. "Measuring inequalities in the distribution of health workers by bi-plot approach: The case of Turkey," Journal of Economics and Behavioral Studies, AMH International, vol. 2(2), pages 57-66.
    12. Eric Beh & Luigi D’Ambra, 2009. "Some Interpretative Tools for Non-Symmetrical Correspondence Analysis," Journal of Classification, Springer;The Classification Society, vol. 26(1), pages 55-76, April.
    13. Pilar García Gómez & Ángel López Nicolás, 2005. "Socio-economic inequalities in health in Catalonia," Hacienda Pública Española / Review of Public Economics, IEF, vol. 175(4), pages 103-121, december.
    14. Michael Greenacre, 2011. "A Simple Permutation Test for Clusteredness," Working Papers 555, Barcelona School of Economics.
    15. David Bholat & Stephen Hans & Pedro Santos & Cheryl Schonhardt-Bailey, 2015. "Text mining for central banks," Handbooks, Centre for Central Banking Studies, Bank of England, number 33, April.
    16. Michael Greenacre, 2016. "Selection and statistical analysis of compositional ratios," Economics Working Papers 1551, Department of Economics and Business, Universitat Pompeu Fabra.
    17. Rémi Bazillier & Nicolas Sirven, 2006. "Les normes fondamentales du travail contribuent-elles à réduire les inégalités ?," Revue Française d'Économie, Programme National Persée, vol. 21(2), pages 111-146.
    18. Giovanni C. Porzio & Giancarlo Ragozini & Domenico Vistocco, 2008. "On the use of archetypes as benchmarks," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 24(5), pages 419-437, September.
    19. Alfonso Gambardella & Walter Garcia Fontes, 1996. "European research funding and regional technological capabilities: Network composition analysis," Economics Working Papers 174, Department of Economics and Business, Universitat Pompeu Fabra.
    20. Javier Palarea-Albaladejo & Josep Martín-Fernández & Jesús Soto, 2012. "Dealing with Distances and Transformations for Fuzzy C-Means Clustering of Compositional Data," Journal of Classification, Springer;The Classification Society, vol. 29(2), pages 144-169, July.

    More about this item

    Keywords

    Box-Cox transformation; chi-square distance; contingency ratio; correspondence analysis; log-ratio analysis; power transformation; ratio data; singular value decomposition; spectral map;
    All these keywords.

    JEL classification:

    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • C88 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Other Computer Software

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:upf:upfgen:1044. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: http://www.econ.upf.edu/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.