IDEAS home Printed from https://ideas.repec.org/a/eee/ecomod/v407y2019ic6.html
   My bibliography  Save this article

K-aggregated transformation of discrete distributions improves modeling count data with excess ones

Author

Listed:
  • Zhou, Can
  • Jiao, Yan
  • Browder, Joan

Abstract

The excess one pattern in count data has been documented in ecology but it has not been explicitly modeled or examined. In this study, we introduce a k-aggregated transformation of discrete distributions to better model count data with excess ones in a Bayesian generalized linear model framework and demonstrate its use with two groups of case studies (group 1: seabird bycatch in longline fisheries and Legionnaires disease incidence; group 2: survey abundance of Leadbeater’s possum and Frigatebird nesting sites). Group 1 examples have a clear excess one data pattern, and these examples are used to demonstrate the concept of the k-aggregation technique. On the other hand, group 2 examples lack a clear excess one pattern, and a modeler may not be motivated enough to use the k-aggregation technique in these cases. Nonetheless, k-aggregated transformation demonstrated better performance for both groups of examples. In all our case studies, the excess zero pattern co-occurred with an excess one pattern, and the excess zeros were modeled thorough either a zero-inflated or hurdle configuration. The better performance of k-aggregated distributions is due to their flexibility of adapting to the relatively high frequency of singletons in the data sets. This new technique has broad applicability and utility in improving modeling count data with potential excess ones.

Suggested Citation

  • Zhou, Can & Jiao, Yan & Browder, Joan, 2019. "K-aggregated transformation of discrete distributions improves modeling count data with excess ones," Ecological Modelling, Elsevier, vol. 407(C), pages 1-1.
  • Handle: RePEc:eee:ecomod:v:407:y:2019:i:c:6
    DOI: 10.1016/j.ecolmodel.2019.108726
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304380019302261
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ecolmodel.2019.108726?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hai-yan Xu & Min Xie & Thong Ngee Goh, 2014. "Objective Bayes analysis of zero-inflated Poisson distribution with application to healthcare data," IISE Transactions, Taylor & Francis Journals, vol. 46(8), pages 843-852, August.
    2. Seth D. Guikema & Jeremy P. Goffelt, 2008. "A Flexible Count Data Regression Model for Risk Analysis," Risk Analysis, John Wiley & Sons, vol. 28(1), pages 213-223, February.
    3. Galit Shmueli & Thomas P. Minka & Joseph B. Kadane & Sharad Borle & Peter Boatwright, 2005. "A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(1), pages 127-142, January.
    4. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    5. Li, Yan & Jiao, Yan, 2013. "Modeling seabird bycatch in the U.S. Atlantic pelagic longline fishery: Fixed year effect versus random year effect," Ecological Modelling, Elsevier, vol. 260(C), pages 36-41.
    6. Yincai Tang & Wenchen Liu & Ancha Xu, 2017. "Statistical inference for zero-and-one-inflated poisson models," Statistical Theory and Related Fields, Taylor & Francis Journals, vol. 1(2), pages 216-226, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Can Zhou & Yan Jiao & Joan Browder, 2019. "How much do we know about seabird bycatch in pelagic longline fisheries? A simulation study on the potential bias caused by the usually unobserved portion of seabird bycatch," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-19, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Can Zhou & Yan Jiao & Joan Browder, 2019. "How much do we know about seabird bycatch in pelagic longline fisheries? A simulation study on the potential bias caused by the usually unobserved portion of seabird bycatch," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-19, August.
    2. Wenchen Liu & Yincai Tang & Ancha Xu, 2021. "Zero-and-one-inflated Poisson regression model," Statistical Papers, Springer, vol. 62(2), pages 915-934, April.
    3. Darcy Steeg Morris & Kimberly F. Sellers, 2022. "A Flexible Mixed Model for Clustered Count Data," Stats, MDPI, vol. 5(1), pages 1-18, January.
    4. Rodrigues, Josemar & Bazán, Jorge L. & Suzuki, Adriano K. & Balakrishnan, Narayanaswamy, 2016. "The Bayesian restricted Conway–Maxwell-Binomial model to control dispersion in count data," Statistics & Probability Letters, Elsevier, vol. 119(C), pages 281-288.
    5. Adeniyi, Isaac Adeola, 2020. "Bayesian Generalized Linear Mixed Effects Models Using Normal-Independent Distributions: Formulation and Applications," MPRA Paper 99165, University Library of Munich, Germany.
    6. Dominique Lord & Srinivas Reddy Geedipally & Seth D. Guikema, 2010. "Extension of the Application of Conway‐Maxwell‐Poisson Models: Analyzing Traffic Crash Data Exhibiting Underdispersion," Risk Analysis, John Wiley & Sons, vol. 30(8), pages 1268-1276, August.
    7. S. Hadi Khazraee & Antonio Jose Sáez‐Castillo & Srinivas Reddy Geedipally & Dominique Lord, 2015. "Application of the Hyper‐Poisson Generalized Linear Model for Analyzing Motor Vehicle Crashes," Risk Analysis, John Wiley & Sons, vol. 35(5), pages 919-930, May.
    8. Kimberly F. Sellers & Tong Li & Yixuan Wu & Narayanaswamy Balakrishnan, 2021. "A Flexible Multivariate Distribution for Correlated Count Data," Stats, MDPI, vol. 4(2), pages 1-19, April.
    9. Royce A. Francis & Srinivas Reddy Geedipally & Seth D. Guikema & Soma Sekhar Dhavala & Dominique Lord & Sarah LaRocca, 2012. "Characterizing the Performance of the Conway‐Maxwell Poisson Generalized Linear Model," Risk Analysis, John Wiley & Sons, vol. 32(1), pages 167-183, January.
    10. Xun-Jian Li & Guo-Liang Tian & Mingqian Zhang & George To Sum Ho & Shuang Li, 2023. "Modeling Under-Dispersed Count Data by the Generalized Poisson Distribution via Two New MM Algorithms," Mathematics, MDPI, vol. 11(6), pages 1-24, March.
    11. Buddhavarapu, Prasad & Bansal, Prateek & Prozzi, Jorge A., 2021. "A new spatial count data model with time-varying parameters," Transportation Research Part B: Methodological, Elsevier, vol. 150(C), pages 566-586.
    12. Mumtaz, Haroon & Theodoridis, Konstantinos, 2017. "Common and country specific economic uncertainty," Journal of International Economics, Elsevier, vol. 105(C), pages 205-216.
    13. Jesse Elliott & Zemin Bai & Shu-Ching Hsieh & Shannon E Kelly & Li Chen & Becky Skidmore & Said Yousef & Carine Zheng & David J Stewart & George A Wells, 2020. "ALK inhibitors for non-small cell lung cancer: A systematic review and network meta-analysis," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-18, February.
    14. Christina Leuker & Thorsten Pachur & Ralph Hertwig & Timothy J. Pleskac, 2019. "Do people exploit risk–reward structures to simplify information processing in risky choice?," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 5(1), pages 76-94, August.
    15. Francois Olivier & Laval Guillaume, 2011. "Deviance Information Criteria for Model Selection in Approximate Bayesian Computation," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-25, July.
    16. Raggi, Davide & Bordignon, Silvano, 2012. "Long memory and nonlinearities in realized volatility: A Markov switching approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3730-3742.
    17. Angelica Gianfreda & Francesco Ravazzolo & Luca Rossini, 2023. "Large Time‐Varying Volatility Models for Hourly Electricity Prices," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(3), pages 545-573, June.
    18. Rubio, F.J. & Steel, M.F.J., 2011. "Inference for grouped data with a truncated skew-Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3218-3231, December.
    19. Alessandri, Piergiorgio & Mumtaz, Haroon, 2019. "Financial regimes and uncertainty shocks," Journal of Monetary Economics, Elsevier, vol. 101(C), pages 31-46.
    20. Padilla, Juan L. & Azevedo, Caio L.N. & Lachos, Victor H., 2018. "Multidimensional multiple group IRT models with skew normal latent trait distributions," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 250-268.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecomod:v:407:y:2019:i:c:6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/ecological-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.