IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/72769.html
   My bibliography  Save this paper

A novel, divergence based, regression for compositional data

Author

Listed:
  • Tsagris, Michail

Abstract

In compositional data, an observation is a vector with non-negative components which sum to a constant, typically 1. Data of this type arise in many areas, such as geology, archaeology, biology, economics and political science among others. The goal of this paper is to propose a new, divergence based, regression modelling technique for compositional data. To do so, a recently proved metric which is a special case of the Jensen-Shannon divergence is employed. A strong advantage of this new regression technique is that zeros are naturally handled. An example with real data and simulation studies are presented and are both compared with the log-ratio based regression suggested by Aitchison in 1986.

Suggested Citation

  • Tsagris, Michail, 2015. "A novel, divergence based, regression for compositional data," MPRA Paper 72769, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:72769
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/72769/1/MPRA_paper_72769.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kateri, Maria & Agresti, Alan, 2010. "A generalized regression model for a binary response," Statistics & Probability Letters, Elsevier, vol. 80(2), pages 89-95, January.
    2. Ferdinand Österreicher & Igor Vajda, 2003. "A new class of metric divergences on probability spaces and its applicability in statistics," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 55(3), pages 639-653, September.
    3. J. L. Scealy & A. H. Welsh, 2011. "Regression for compositional data by using distributions defined on the hypersphere," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 351-375, June.
    4. Gueorguieva, Ralitza & Rosenheck, Robert & Zelterman, Daniel, 2008. "Dirichlet component regression and its applications to psychiatric data," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5344-5355, August.
    5. José M. R. Murteira & Joaquim J. S. Ramalho, 2016. "Regression Analysis of Multivariate Fractional Data," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 515-552, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    2. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    3. Tsagris, Michail, 2014. "The k-NN algorithm for compositional data: a revised approach with and without zero values present," MPRA Paper 65866, University Library of Munich, Germany.
    4. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    5. Jiajia Chen & Xiaoqin Zhang & Shengjia Li, 2017. "Multiple linear regression with compositional response and covariates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(12), pages 2270-2285, September.
    6. Napoleón Vargas Jurado & Kent M. Eskridge & Stephen D. Kachman & Ronald M. Lewis, 2018. "Using a Bayesian Hierarchical Linear Mixing Model to Estimate Botanical Mixtures," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(2), pages 190-207, June.
    7. Yuquan W. Zhang & Jianhong E. Mu & Mark Musumba & Bruce A. McCarl & Xiaokun Gu & Yuanfei Zhou & Zhengwei Cao & Qiang Li, 2018. "The Role of Climate Factors in Shaping China’s Crop Mix: An Empirical Exploration," Sustainability, MDPI, vol. 10(10), pages 1-17, October.
    8. Melo, Tatiane F.N. & Vasconcellos, Klaus L.P. & Lemonte, Artur J., 2009. "Some restriction tests in a new class of regression models for proportions," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 3972-3979, October.
    9. Kateri, Maria & Nikolov, Nikolay I., 2022. "A generalized Mallows model based on ϕ-divergence measures," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    10. Monastyrenko, Evgenii, 2017. "Eco-efficiency outcomes of mergers and acquisitions in the European electricity industry," Energy Policy, Elsevier, vol. 107(C), pages 258-277.
    11. Morais, Joanna & Simioni, Michel & Thomas-Agnan, Christine, 2016. "A tour of regression models for explaining shares," TSE Working Papers 16-742, Toulouse School of Economics (TSE).
    12. Gerrath, Maximilian H.E.E. & Mafael, Alexander & Ulqinaku, Aulona & Biraglia, Alessandro, 2023. "Service failures in times of crisis: An analysis of eWOM emotionality," Journal of Business Research, Elsevier, vol. 154(C).
    13. Pokhrel, Bijay & Krishna, Paudel & Eduardo, Segarra, 2016. "Factors Affecting the Choice, Intensity, and Allocation of Irrigation Technologies by U.S. Cotton Farmers," 2016 Annual Meeting, February 6-9, 2016, San Antonio, Texas 230199, Southern Agricultural Economics Association.
    14. Sagron, Ruth & Pugatch, Rami, 2021. "Universal distribution of batch completion times and time-cost tradeoff in a production line with arbitrary buffer size," European Journal of Operational Research, Elsevier, vol. 293(3), pages 980-989.
    15. Pandit, Mahesh & Paudel, Krishna P. & Hinson, Roger A., 2013. "Market Channel Analysis of Ornamental Plants using Clustering Procedures," 2013 Annual Meeting, February 2-5, 2013, Orlando, Florida 143067, Southern Agricultural Economics Association.
    16. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    17. Takahiro Yoshida & Morito Tsutsumi, 2018. "On the effects of spatial relationships in spatial compositional multivariate models," Letters in Spatial and Resource Sciences, Springer, vol. 11(1), pages 57-70, March.
    18. Jongwoo Kim & Nicole M. Mason & David Mather & Felicia Wu, 2021. "The effects of the national agricultural input voucher scheme (NAIVS) on sustainable intensification of maize production in Tanzania," Journal of Agricultural Economics, Wiley Blackwell, vol. 72(3), pages 857-877, September.
    19. Kristoffer H. Hellton, 2023. "Penalized angular regression for personalized predictions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 184-212, March.
    20. Papastamoulis Panagiotis & Rattray Magnus, 2017. "Bayesian estimation of differential transcript usage from RNA-seq data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(5-6), pages 367-386, December.

    More about this item

    Keywords

    compositional data; Jensen-Shannon divergence; regression; zero values; φ-divergence;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:72769. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.