IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/53068.html
   My bibliography  Save this paper

A data-based power transformation for compositional data

Author

Listed:
  • T. Tsagris, Michail
  • Preston, Simon
  • T.A. Wood, Andrew

Abstract

Compositional data analysis is carried out either by neglecting the compositional constraint and applying standard multivariate data analysis, or by transforming the data using the logs of the ratios of the components. In this work we examine a more general transformation which includes both approaches as special cases. It is a power transformation and involves a single parameter�. The transformation has two equivalent versions. The �first is the stay-in-the-simplex version. This expression is the power transformation as de�fined by Aitchison (1986). The second version, which is a linear transformation of the stay-in-the-simplex, is a Box-Cox type transformation. We call the second version the isometric �alpha-transformation because of the multiplication with the Helmert sub-matrix. We discuss a parametric way of estimating the value of alpha�, which is maximization of its pro�le like-lihood (assuming multivariate normality of the transformed data) and the equivalence between the two versions is exhibited. Other ways include maximization of the correct classi�cation probability in discriminant analysis and maximization of the pseudo-R2 in linear regression. We examine the relationship between the transformation, the raw data approach and the isometric log-ratio transformation. Furthermore, we also de�fine a suitable family of metrics corresponding to the family of �alpha-transformation and consider the corresponding family of Fr�echet means.

Suggested Citation

  • T. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2011. "A data-based power transformation for compositional data," MPRA Paper 53068, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:53068
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/53068/1/MPRA_paper_53068.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. ,, 2003. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 19(4), pages 691-705, August.
    2. ,, 2003. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 19(5), pages 879-883, October.
    3. ,, 2003. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 19(6), pages 1195-1198, December.
    4. ,, 2003. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 19(1), pages 225-228, February.
    5. ,, 2003. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 19(2), pages 411-413, April.
    6. M. J. Baxter, 1995. "Standardization and Transformation in Principal Component Analysis, with Applications to Archaeometry," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 44(4), pages 513-527, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    2. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    3. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    4. Yannis Pantazis & Michail Tsagris & Andrew T. A. Wood, 2019. "Gaussian Asymptotic Limits for the α-transformation in the Analysis of Compositional Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 63-82, February.
    5. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Nonparametric hypothesis testing for equality of means on the simplex," MPRA Paper 72771, University Library of Munich, Germany.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yakut, Oguz, 2021. "Implementation of hydraulically driven barrel shooting control by utilizing artificial neural networks," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 190(C), pages 1206-1223.
    2. X. Qin & G. Huang, 2009. "An Inexact Chance-constrained Quadratic Programming Model for Stream Water Quality Management," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 23(4), pages 661-695, March.
    3. Md. Yousuf Gazi & Khandakar Tahmida Tafhim, 2019. "Investigation of Heavy-mineral Deposits Using Multispectral Satellite Imagery in the Eastern Coastal Margin of Bangladesh," Earth Sciences Malaysia (ESMY), Zibeline International Publishing, vol. 3(2), pages 16-22, October.
    4. Billionnet, Alain, 2011. "Solving the probabilistic reserve selection problem," Ecological Modelling, Elsevier, vol. 222(3), pages 546-554.
    5. Minghe Sun, 2005. "Warm-Start Routines for Solving Augmented Weighted Tchebycheff Network Programs in Multiple-Objective Network Programming," INFORMS Journal on Computing, INFORMS, vol. 17(4), pages 422-437, November.
    6. François Clautiaux & Cláudio Alves & José Valério de Carvalho & Jürgen Rietz, 2011. "New Stabilization Procedures for the Cutting Stock Problem," INFORMS Journal on Computing, INFORMS, vol. 23(4), pages 530-545, November.
    7. Eichengreen, Barry & Kletzer, Kenneth & Mody, Ashoka, 2003. "Crisis Resolution: Next Steps," Santa Cruz Center for International Economics, Working Paper Series qt4cj974r4, Center for International Economics, UC Santa Cruz.
    8. Tansel, Aysit & Karao?lan, Deniz, 2016. "The Causal Effect of Education on Health Behaviors: Evidence from Turkey," IZA Discussion Papers 10020, Institute of Labor Economics (IZA).
    9. Di Feng & Bettina Klaus, 2022. "Preference revelation games and strict cores of multiple‐type housing market problems," International Journal of Economic Theory, The International Society for Economic Theory, vol. 18(1), pages 61-76, March.
    10. Anna Scherbina, 2021. "Assessing the Optimality of a COVID Lockdown in the United States," Economics of Disasters and Climate Change, Springer, vol. 5(2), pages 177-201, July.
    11. John McKay, 2005. "How Significant and Effective are North Korea's "Market Reforms"?," Global Economic Review, Taylor & Francis Journals, vol. 34(1), pages 83-97.
    12. Timothy K.M. Beatty & Erling Røed Larsen & Dag Einar Sommervoll, 2005. "Measuring the Price of Housing Consumption for Owners in the CPI," Discussion Papers 427, Statistics Norway, Research Department.
    13. Marco Bianchi & Carlos Tapia & Ikerne del Valle, 2020. "Monitoring domestic material consumption at lower territorial levels: A novel data downscaling method," Journal of Industrial Ecology, Yale University, vol. 24(5), pages 1074-1087, October.
    14. Sonmez, Tayfun & Utku Unver, M., 2005. "House allocation with existing tenants: an equivalence," Games and Economic Behavior, Elsevier, vol. 52(1), pages 153-185, July.
    15. Juarez, Ruben, 2013. "Group strategyproof cost sharing: The role of indifferences," Games and Economic Behavior, Elsevier, vol. 82(C), pages 218-239.
    16. Bustillo, Inés & Velloso, Helvia & Vézina, François, 2006. "The Canadian retirement income system," Documentos de Proyectos 3682, Naciones Unidas Comisión Económica para América Latina y el Caribe (CEPAL).
    17. Melega, Gislaine Mara & de Araujo, Silvio Alexandre & Jans, Raf, 2018. "Classification and literature review of integrated lot-sizing and cutting stock problems," European Journal of Operational Research, Elsevier, vol. 271(1), pages 1-19.
    18. Roth, Alvin E. & Sonmez, Tayfun & Utku Unver, M., 2005. "Pairwise kidney exchange," Journal of Economic Theory, Elsevier, vol. 125(2), pages 151-188, December.
    19. Martino Bardi & Peter Caines & Italo Capuzzo Dolcetta, 2013. "Preface: DGAA Special Issue on Mean Field Games," Dynamic Games and Applications, Springer, vol. 3(4), pages 443-445, December.
    20. repec:dau:papers:123456789/5389 is not listed on IDEAS
    21. Robert Hahn & Paul Tetlock, 2006. "A New Approach for Regulating Information Markets," Journal of Regulatory Economics, Springer, vol. 29(3), pages 265-281, May.

    More about this item

    Keywords

    Compositional data; power transformation; alpha; Frechet mean;
    All these keywords.

    JEL classification:

    • C89 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:53068. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.