IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v78y2022i3p974-987.html
   My bibliography  Save this article

A transformation‐free linear regression for compositional outcomes and predictors

Author

Listed:
  • Jacob Fiksel
  • Scott Zeger
  • Abhirup Datta

Abstract

Compositional data are common in many fields, both as outcomes and predictor variables. The inventory of models for the case when both the outcome and predictor variables are compositional is limited, and the existing models are often difficult to interpret in the compositional space, due to their use of complex log‐ratio transformations. We develop a transformation‐free linear regression model where the expected value of the compositional outcome is expressed as a single Markov transition from the compositional predictor. Our approach is based on estimating equations thereby not requiring complete specification of data likelihood and is robust to different data‐generating mechanisms. Our model is simple to interpret, allows for 0s and 1s in both the compositional outcome and covariates, and subsumes several interesting subcases of interest. We also develop permutation tests for linear independence and equality of effect sizes of two components of the predictor. Finally, we show that despite its simplicity, our model accurately captures the relationship between compositional data using two datasets from education and medical research.

Suggested Citation

  • Jacob Fiksel & Scott Zeger & Abhirup Datta, 2022. "A transformation‐free linear regression for compositional outcomes and predictors," Biometrics, The International Biometric Society, vol. 78(3), pages 974-987, September.
  • Handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:974-987
    DOI: 10.1111/biom.13465
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13465
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13465?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Theory," Econometrica, Econometric Society, vol. 52(3), pages 681-700, May.
    2. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Applications to Poisson Models," Econometrica, Econometric Society, vol. 52(3), pages 701-720, May.
    3. Wei Lin & Pixu Shi & Rui Feng & Hongzhe Li, 2014. "Variable selection in regression with compositional covariates," Biometrika, Biometrika Trust, vol. 101(4), pages 785-797.
    4. José M. R. Murteira & Joaquim J. S. Ramalho, 2016. "Regression Analysis of Multivariate Fractional Data," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 515-552, April.
    5. Joanna Morais & Christine Thomas-Agnan & Michel Simioni, 2017. "Interpretation of explanatory variables impacts in compositional regression models," Working Papers hal-01563362, HAL.
    6. T. H. A. Nguyen & T. Laurent & C. Thomas-Agnan & A. Ruiz-Gazen, 2022. "Analyzing the impacts of socio-economic factors on French departmental elections with CoDa methods," Journal of Applied Statistics, Taylor & Francis Journals, vol. 49(5), pages 1235-1251, April.
    7. Billheimer D. & Guttorp P. & Fagan W.F., 2001. "Statistical Interpretation of Species Composition," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1205-1214, December.
    8. Papke, Leslie E & Wooldridge, Jeffrey M, 1996. "Econometric Methods for Fractional Response Variables with an Application to 401(K) Plan Participation Rates," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 11(6), pages 619-632, Nov.-Dec..
    9. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.
    10. Mr. Matthew T Jones, 2005. "Estimating Markov Transition Matrices Using Proportions Data: An Application to Credit Risk," IMF Working Papers 2005/219, International Monetary Fund.
    11. MacRae, Elizabeth Chase, 1977. "Estimation of Time-Varying Markov Processes with Aggregate Data," Econometrica, Econometric Society, vol. 45(1), pages 183-198, January.
    12. Jiajia Chen & Xiaoqin Zhang & Shengjia Li, 2017. "Multiple linear regression with compositional response and covariates," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(12), pages 2270-2285, September.
    13. K. Hron & P. Filzmoser & K. Thompson, 2012. "Linear regression with compositional explanatory variables," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(5), pages 1115-1128, November.
    14. Adam Butler & Chris Glasbey, 2008. "A latent Gaussian model for compositional data with zeros," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 57(5), pages 505-520, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Montoya-Blandón, Santiago & Jacho-Chávez, David T., 2020. "Semiparametric quasi maximum likelihood estimation of the fractional response model," Economics Letters, Elsevier, vol. 186(C).
    2. John Mullahy, 2010. "Multivariate Fractional Regression Estimation of Econometric Share Models," NBER Working Papers 16354, National Bureau of Economic Research, Inc.
    3. Juan José Egozcue & Vera Pawlowsky-Glahn, 2019. "Compositional data: the sample space and its structure," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 599-638, September.
    4. Giuliani, Elisa & Martinelli, Arianna & Rabellotti, Roberta, 2016. "Is Co-Invention Expediting Technological Catch Up? A Study of Collaboration between Emerging Country Firms and EU Inventors," World Development, Elsevier, vol. 77(C), pages 192-205.
    5. de Rassenfosse, Gaétan & Schoen, Anja & Wastyn, Annelies, 2014. "Selection bias in innovation studies: A simple test," Technological Forecasting and Social Change, Elsevier, vol. 81(C), pages 287-299.
    6. Harald Oberhofer & Michael Pfaffermayr, 2014. "Two-Part Models for Fractional Responses Defined as Ratios of Integers," Econometrics, MDPI, vol. 2(3), pages 1-22, September.
    7. Vânia G. Silva & Esmeralda A. Ramalho & Carlos R. Vieira, 2017. "The Use of Cheques in the European Union: A Cross-Country Analysis," Open Economies Review, Springer, vol. 28(3), pages 581-602, July.
    8. Song, Jingyu & Delgado, Michael & Preckel, Paul & Villoria, Nelson, 2016. "Pixel Level Cropland Allocation and Marginal Impacts of Biophysical Factors," 2016 Annual Meeting, July 31-August 2, Boston, Massachusetts 235327, Agricultural and Applied Economics Association.
    9. Christelis, Dimitris & Georgarakos, Dimitris & Jappelli, Tullio, 2015. "Wealth shocks, unemployment shocks and consumption in the wake of the Great Recession," Journal of Monetary Economics, Elsevier, vol. 72(C), pages 21-41.
    10. Wooldridge, Jeffrey M., 2020. "On the consistency of the logistic quasi-MLE under conditional symmetry," Economics Letters, Elsevier, vol. 194(C).
    11. Paulo Bastos & Manuel Cabral, 2007. "The Dynamics of International Trade Patterns," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 143(3), pages 391-415, October.
    12. Joaquim J.S. Ramalho & Jacinto Vidigal da Silva, 2009. "A two-part fractional regression model for the financial leverage decisions of micro, small, medium and large firms," Quantitative Finance, Taylor & Francis Journals, vol. 9(5), pages 621-636.
    13. Reboul, E. & Guérin, I. & Nordman, C.J., 2021. "The gender of debt and credit: Insights from rural Tamil Nadu," World Development, Elsevier, vol. 142(C).
    14. Callado Muñoz, Francisco Jose & González Chapela, Jorge & Utrero González, Natalia, 2014. "Analysis of deviance in household financial portfolio choice: evidence from Spain," MPRA Paper 57497, University Library of Munich, Germany.
    15. Schwiebert, Jörg & Wagner, Joachim, 2015. "A Generalized Two-Part Model for Fractional Response Variables with Excess Zeros," VfS Annual Conference 2015 (Muenster): Economic Development - Theory and Policy 113059, Verein für Socialpolitik / German Economic Association.
    16. Thomas-Agnan, Christine & Morais, Joanna, 2019. "Covariates impacts in compositional models and simplicial derivatives," TSE Working Papers 19-1057, Toulouse School of Economics (TSE).
    17. Morton, Rebecca B. & Muller, Daniel & Page, Lionel & Torgler, Benno, 2015. "Exit polls, turnout, and bandwagon voting: Evidence from a natural experiment," European Economic Review, Elsevier, vol. 77(C), pages 65-81.
    18. Santos Silva, J.M.C. & Tenreyro, Silvana & Wei, Kehai, 2014. "Estimating the extensive margin of trade," Journal of International Economics, Elsevier, vol. 93(1), pages 67-75.
    19. José M. R. Murteira & Joaquim J. S. Ramalho, 2016. "Regression Analysis of Multivariate Fractional Data," Econometric Reviews, Taylor & Francis Journals, vol. 35(4), pages 515-552, April.
    20. Xinde Ji & Kelly M. Cobourn, 2018. "The Economic Benefits of Irrigation Districts under Prior Appropriation Doctrine: An Econometric Analysis of Agricultural Land‐Allocation Decisions," Canadian Journal of Agricultural Economics/Revue canadienne d'agroeconomie, Canadian Agricultural Economics Society/Societe canadienne d'agroeconomie, vol. 66(3), pages 441-467, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:78:y:2022:i:3:p:974-987. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.