IDEAS home Printed from https://ideas.repec.org/a/spr/alstar/v106y2022i3d10.1007_s10182-021-00432-6.html
   My bibliography  Save this article

Flexible models for non-equidispersed count data: comparative performance of parametric models to deal with underdispersion

Author

Listed:
  • Douglas Toledo

    (Universidade de São Paulo)

  • Cristiane Akemi Umetsu

    (Universidade Estadual Paulista)

  • Antonio Fernando Monteiro Camargo

    (Universidade Estadual Paulista
    Universidade Estadual Paulista)

  • Idemauro Antonio Rodrigues Lara

    (Universidade de São Paulo
    Universidade de São Paulo)

Abstract

Count data as response variables are commonly modeled using Poisson regression models, which require equidispersion, i.e., equal mean and variance. However, this relationship does not always occur, and the variance may be higher or lower than the mean, phenomena are known as overdispersion and underdispersion, respectively. Non-equidispersion, when disregarded, can lead to a number of misinterpretations and inadequate predictions. Here, we compare the use of the COM-Poisson, double Poisson, Gamma-count, and restricted generalized Poisson models as a more flexible class for count problems associated with over- and underdispersion, since they have an additional parameter that allows more flexible analysis. The proposed method is useful in different applications, but here we provide an example using an underdispersed dataset concerning ecological invasion. For validation of the models, we use half-normal plots. The COM-Poisson, double Poisson, and Gamma-count performed best and properly modeled the underdispersion. The use of correct statistical models is recommended to handle this data property using objective criteria to ensure accurate statistical inferences.

Suggested Citation

  • Douglas Toledo & Cristiane Akemi Umetsu & Antonio Fernando Monteiro Camargo & Idemauro Antonio Rodrigues Lara, 2022. "Flexible models for non-equidispersed count data: comparative performance of parametric models to deal with underdispersion," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(3), pages 473-497, September.
  • Handle: RePEc:spr:alstar:v:106:y:2022:i:3:d:10.1007_s10182-021-00432-6
    DOI: 10.1007/s10182-021-00432-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10182-021-00432-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10182-021-00432-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michel Loreau & Andy Hector, 2001. "Partitioning selection and complementarity in biodiversity experiments," Nature, Nature, vol. 412(6842), pages 72-76, July.
    2. Winkelmann, Rainer, 1995. "Duration Dependence and Dispersion in Count-Data Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(4), pages 467-474, October.
    3. Galit Shmueli & Thomas P. Minka & Joseph B. Kadane & Sharad Borle & Peter Boatwright, 2005. "A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(1), pages 127-142, January.
    4. Walmes Marques Zeviani & Paulo Justiniano Ribeiro & Wagner Hugo Bonat & Silvia Emiko Shimakura & Joel Augusto Muniz, 2014. "The Gamma-count distribution in the analysis of experimental underdispersed data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(12), pages 2616-2626, December.
    5. R. W. Conway & W. L. Maxwell, 1962. "Network Dispatching by the Shortest-Operation Discipline," Operations Research, INFORMS, vol. 10(1), pages 51-73, February.
    6. Kimberly F. Sellers & Darcy S. Morris, 2017. "Underdispersion models: Models that are “under the radar”," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(24), pages 12075-12086, December.
    7. Russell B. Millar, 2009. "Comparison of Hierarchical Bayesian Models for Overdispersed Count Data using DIC and Bayes' Factors," Biometrics, The International Biometric Society, vol. 65(3), pages 962-969, September.
    8. Rafael A. Moral & John Hinde & Clarice G. B. Demétrio & Carolina Reigada & Wesley A. C. Godoy, 2018. "Models for Jointly Estimating Abundances of Two Unmarked Site-Associated Species Subject to Imperfect Detection," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(1), pages 20-38, March.
    9. Michel Loreau & Andy Hector, 2001. "Erratum: Partitioning selection and complementarity in biodiversity experiments," Nature, Nature, vol. 413(6855), pages 548-548, October.
    10. Kimberly F. Sellers & Sharad Borle & Galit Shmueli, 2012. "The COM‐Poisson model for count data: a survey of methods and applications," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 28(2), pages 104-116, March.
    11. Rainer Winkelmann, 2008. "Econometric Analysis of Count Data," Springer Books, Springer, edition 0, number 978-3-540-78389-3, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Darcy Steeg Morris & Kimberly F. Sellers, 2022. "A Flexible Mixed Model for Clustered Count Data," Stats, MDPI, vol. 5(1), pages 1-18, January.
    2. Seng Huat Ong & Shin Zhu Sim & Shuangzhe Liu & Hari M. Srivastava, 2023. "A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data," Stats, MDPI, vol. 6(3), pages 1-14, September.
    3. Sáez-Castillo, A.J. & Conde-Sánchez, A., 2013. "A hyper-Poisson regression model for overdispersed and underdispersed count data," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 148-157.
    4. Xun-Jian Li & Guo-Liang Tian & Mingqian Zhang & George To Sum Ho & Shuang Li, 2023. "Modeling Under-Dispersed Count Data by the Generalized Poisson Distribution via Two New MM Algorithms," Mathematics, MDPI, vol. 11(6), pages 1-24, March.
    5. Gabriela Woźniak & Monika Malicka & Jacek Kasztowski & Łukasz Radosz & Joanna Czarnecka & Jaco Vangronsveld & Dariusz Prostański, 2022. "How Important Are the Relations between Vegetation Diversity and Bacterial Functional Diversity for the Functioning of Novel Ecosystems?," Sustainability, MDPI, vol. 15(1), pages 1-16, December.
    6. Chun-Wei Chang & Takeshi Miki & Hao Ye & Sami Souissi & Rita Adrian & Orlane Anneville & Helen Agasild & Syuhei Ban & Yaron Be’eri-Shlevin & Yin-Ru Chiang & Heidrun Feuchtmayr & Gideon Gal & Satoshi I, 2022. "Causal networks of phytoplankton diversity and biomass are modulated by environmental context," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    7. Guangzhou Wang & Haley M. Burrill & Laura Y. Podzikowski & Maarten B. Eppinga & Fusuo Zhang & Junling Zhang & Peggy A. Schultz & James D. Bever, 2023. "Dilution of specialist pathogens drives productivity benefits from diversity in plant mixtures," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    8. Rainer Winkelmann, 2015. "Counting on count data models," IZA World of Labor, Institute of Labor Economics (IZA), pages 148-148, May.
    9. Yuxin Liu & Chenjing Fan & Dongdong Xue, 2024. "A Review of the Effects of Urban and Green Space Forms on the Carbon Budget Using a Landscape Sustainability Framework," Sustainability, MDPI, vol. 16(5), pages 1-29, February.
    10. Bilal Barakat, 2017. "Generalised count distributions for modelling parity," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 36(26), pages 745-758.
    11. Jonathan S. Lefcheck & Graham J. Edgar & Rick D. Stuart-Smith & Amanda E. Bates & Conor Waldock & Simon J. Brandl & Stuart Kininmonth & Scott D. Ling & J. Emmett Duffy & Douglas B. Rasher & Aneil F. A, 2021. "Species richness and identity both determine the biomass of global reef fish communities," Nature Communications, Nature, vol. 12(1), pages 1-9, December.
    12. D. G. Kapayou & E. M. Herrighty & C. Gish Hill & V. Cano Camacho & A. Nair & D. M. Winham & M. D. McDaniel, 2023. "Reuniting the Three Sisters: collaborative science with Native growers to improve soil and community health," Agriculture and Human Values, Springer;The Agriculture, Food, & Human Values Society (AFHVS), vol. 40(1), pages 65-82, March.
    13. Marcelo Bourguignon & Diego I. Gallardo & Rodrigo M. R. Medeiros, 2022. "A simple and useful regression model for underdispersed count data based on Bernoulli–Poisson convolution," Statistical Papers, Springer, vol. 63(3), pages 821-848, June.
    14. Adeniyi, Isaac Adeola, 2020. "Bayesian Generalized Linear Mixed Effects Models Using Normal-Independent Distributions: Formulation and Applications," MPRA Paper 99165, University Library of Munich, Germany.
    15. Barbara Emmenegger & Julien Massoni & Christine M. Pestalozzi & Miriam Bortfeld-Miller & Benjamin A. Maier & Julia A. Vorholt, 2023. "Identifying microbiota community patterns important for plant protection using synthetic communities and machine learning," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    16. Liting Zheng & Kathryn E. Barry & Nathaly R. Guerrero-Ramírez & Dylan Craven & Peter B. Reich & Kris Verheyen & Michael Scherer-Lorenzen & Nico Eisenhauer & Nadia Barsoum & Jürgen Bauhus & Helge Bruel, 2024. "Effects of plant diversity on productivity strengthen over time due to trait-dependent shifts in species overyielding," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    17. György Barabás & Christine Parent & Andrew Kraemer & Frederik Perre & Frederik Laender, 2022. "The evolution of trait variance creates a tension between species diversity and functional diversity," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    18. Morris, Darcy Steeg & Raim, Andrew M. & Sellers, Kimberly F., 2020. "A Conway–Maxwell-multinomial distribution for flexible modeling of clustered categorical data," Journal of Multivariate Analysis, Elsevier, vol. 179(C).
    19. Gregori Baetschmann & Rainer Winkelmann, 2014. "A dynamic hurdle model for zero-inflated count data: with an application to health care utilization," ECON - Working Papers 151, Department of Economics - University of Zurich.
    20. Célestin C. Kokonendji & Sobom M. Somé & Youssef Esstafa & Marcelo Bourguignon, 2023. "On Underdispersed Count Kernels for Smoothing Probability Mass Functions," Stats, MDPI, vol. 6(4), pages 1-15, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:alstar:v:106:y:2022:i:3:d:10.1007_s10182-021-00432-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.