IDEAS home Printed from https://ideas.repec.org/p/crt/wpaper/2603.html

The α–regression for compositional data: a unified framework for standard, spatially-lagged, spatial autoregressive and geographically-weighted regression models

Author

Listed:
  • Michail Tsagris

  • Yannis Pantazis

Abstract

Compositional data–vectors of non-negative components summing to unity–frequently arise in scientific applications where covariates influence the relative proportions of components, yet traditional regression approaches face challenges regarding the unit-sum constraint and zero values. This paper revisits the α–regression framework, which uses a flexible power transformation parameterized by α to interpolate between raw data analysis and log-ratio methods, naturally handling zeros without imputation while allowing data-driven transformation selection. We formulate α–regression as a non-linear least squares problem, study its asymptotic properties, provide efficient estimation via the Levenberg-Marquardt algorithm, and derive marginal effects for interpretation.

Suggested Citation

  • Michail Tsagris & Yannis Pantazis, 2026. "The α–regression for compositional data: a unified framework for standard, spatially-lagged, spatial autoregressive and geographically-weighted regression models," Working Papers 2603, University of Crete, Department of Economics.
  • Handle: RePEc:crt:wpaper:2603
    as

    Download full text from publisher

    File URL: https://economics.soc.uoc.gr/wpa/docs/2603.pdf
    File Function: First version
    Download Restriction: No
    ---><---

    References listed on IDEAS

    as
    1. Thi Huong An Nguyen & Christine Thomas-Agnan & Thibault Laurent & Anne Ruiz-Gazen, 2021. "A simultaneous spatial autoregressive model for compositional data," Spatial Economic Analysis, Taylor & Francis Journals, vol. 16(2), pages 161-175, April.
    2. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    3. José M. R. Murteira & Joaquim J. S. Ramalho, 2016. "Regression Analysis of Multivariate Fractional Data," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 515-552, April.
    4. Christine Thomas-Agnan & Thibault Laurent & Anne Ruiz-Gazen & Thi Huong An Nguyen & Raja Chakir & Anna Lungarska, 2021. "Spatial Simultaneous Autoregressive Models for Compositional Data: Application to Land Use," Springer Books, in: Peter Filzmoser & Karel Hron & Josep Antoni Martín-Fernández & Javier Palarea-Albaladejo (ed.), Advances in Compositional Data Analysis, pages 225-249, Springer.
    5. T. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2011. "A data-based power transformation for compositional data," MPRA Paper 53068, University Library of Munich, Germany.
    6. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    7. J. L. Scealy & A. H. Welsh, 2011. "Regression for compositional data by using distributions defined on the hypersphere," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 351-375, June.
    8. Melo, Tatiane F.N. & Vasconcellos, Klaus L.P. & Lemonte, Artur J., 2009. "Some restriction tests in a new class of regression models for proportions," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 3972-3979, October.
    9. Gueorguieva, Ralitza & Rosenheck, Robert & Zelterman, Daniel, 2008. "Dirichlet component regression and its applications to psychiatric data," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5344-5355, August.
    10. Malini Iyengar & Dipak Dey, 2002. "A semiparametric model for compositional data analysis in presence of covariates on the simplex," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 11(2), pages 303-315, December.
    11. Katz, Jonathan N. & King, Gary, 1999. "A Statistical Model for Multiparty Electoral Data," American Political Science Review, Cambridge University Press, vol. 93(1), pages 15-32, March.
    12. Joanna Morais & Christine Thomas-Agnan & Michel Simioni, 2018. "Using compositional and Dirichlet models for market share regression," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(9), pages 1670-1689, July.
    13. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    14. Tsagris, Michail, 2015. "A novel, divergence based, regression for compositional data," MPRA Paper 72769, University Library of Munich, Germany.
    15. Michel Goulard & Thibault Laurent & Christine Thomas-Agnan, 2017. "About predictions in spatial autoregressive models: optimal and almost optimal strategies," Spatial Economic Analysis, Taylor & Francis Journals, vol. 12(2-3), pages 304-325, July.
    16. Fan Xia & Jun Chen & Wing Kam Fung & Hongzhe Li, 2013. "A Logistic Normal Multinomial Regression Model for Microbiome Compositional Data Analysis," Biometrics, The International Biometric Society, vol. 69(4), pages 1053-1063, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthias Eckardt & Philipp Otto, 2025. "Spatiotemporal Autoregressive Models for Areal Compositional Data," Papers 2507.14389, arXiv.org, revised Mar 2026.
    2. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    3. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    4. Tsagris, Michail, 2015. "A novel, divergence based, regression for compositional data," MPRA Paper 72769, University Library of Munich, Germany.
    5. Antoine Burg & Christophe Dutang, 2026. "Closed-form estimators for multivariate regressions models -a single categorical variable approach," Post-Print hal-05539060, HAL.
    6. Antoine Burg & Christophe Dutang, 2026. "Closed-form estimators for multivariate regressions models: a single categorical variable approach," Computational Statistics, Springer, vol. 41(3), pages 1-23, April.
    7. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    8. Yannis Pantazis & Michail Tsagris & Andrew T. A. Wood, 2019. "Gaussian Asymptotic Limits for the α-transformation in the Analysis of Compositional Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 63-82, February.
    9. T. H. A. Nguyen & T. Laurent & C. Thomas-Agnan & A. Ruiz-Gazen, 2022. "Analyzing the impacts of socio-economic factors on French departmental elections with CoDa methods," Journal of Applied Statistics, Taylor & Francis Journals, vol. 49(5), pages 1235-1251, April.
    10. Jacob Fiksel & Scott Zeger & Abhirup Datta, 2022. "A transformation‐free linear regression for compositional outcomes and predictors," Biometrics, The International Biometric Society, vol. 78(3), pages 974-987, September.
    11. Takahiro Yoshida & Morito Tsutsumi, 2018. "On the effects of spatial relationships in spatial compositional multivariate models," Letters in Spatial and Resource Sciences, Springer, vol. 11(1), pages 57-70, March.
    12. Dargel, Lukas & Thomas-Agnan, Christine, 2023. "Share-ratio interpretations of compositional regression models," TSE Working Papers 23-1456, Toulouse School of Economics (TSE), revised 20 Sep 2023.
    13. Juan José Egozcue & Vera Pawlowsky-Glahn, 2019. "Compositional data: the sample space and its structure," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 599-638, September.
    14. Wang, Dieter & Andrée, Bo Pieter Johannes & Chamorro, Andres Fernando & Spencer, Phoebe Girouard, 2022. "Transitions into and out of food insecurity: A probabilistic approach with panel data evidence from 15 countries," World Development, Elsevier, vol. 159(C).
    15. Wang,Dieter & Andree,Bo Pieter Johannes & Chamorro Elizondo,Andres Fernando & Spencer,Phoebe Girouard, 2020. "Stochastic Modeling of Food Insecurity," Policy Research Working Paper Series 9413, The World Bank.
    16. Thibault Laurent & Christine Thomas-Agnan & Anne Ruiz-Gazen, 2023. "Covariates impacts in spatial autoregressive models for compositional data," Journal of Spatial Econometrics, Springer, vol. 4(1), pages 1-23, December.
    17. Dargel, Lukas & Thomas-Agnan, Christine, 2024. "Pairwise share ratio interpretations of compositional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 195(C).
    18. Jiajia Chen & Xiaoqin Zhang & Shengjia Li, 2017. "Multiple linear regression with compositional response and covariates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(12), pages 2270-2285, September.
    19. Julia Cage & Edgard Dewitte, 2021. "It Takes Money to Make MPs: Evidence from 150 Years of British Campaign Spending," Working Papers hal-03384143, HAL.
    20. Arzheimer, Kai & Evans, Jocelyn, 2010. "Bread and butter à la française: Multiparty forecasts of the French legislative vote (1981-2007)," International Journal of Forecasting, Elsevier, vol. 26(1), pages 19-31, January.

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
    • R15 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Econometric and Input-Output Models; Other Methods

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:crt:wpaper:2603. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kostis Pigounakis (email available below). General contact details of provider: https://edirc.repec.org/data/deuchgr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.