IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2008i2p381-393.html

A framework for modelling overdispersed count data, including the Poisson-shifted generalized inverse Gaussian distribution

Author

Listed:
  • Rigby, R.A.
  • Stasinopoulos, D.M.
  • Akantziliotou, C.

Abstract

A variety of methods of modelling overdispersed count data are compared. The methods are classified into three main categories. The first category are ad hoc methods (i.e. pseudo-likelihood, (extended) quasi-likelihood, double exponential family distributions). The second category are discretized continuous distributions and the third category are observational level random effects models (i.e. mixture models comprising explicit and non-explicit continuous mixture models and finite mixture models). The main focus of the paper is a family of mixed Poisson distributions defined so that its mean [mu] is an explicit parameter of the distribution. This allows easier interpretation when [mu] is modelled using explanatory variables and provides a more orthogonal parameterization to ease model fitting. Specific three parameter distributions considered are the Sichel and Delaporte distributions. A new four parameter distribution, the Poisson-shifted generalized inverse Gaussian distribution is introduced, which includes the Sichel and Delaporte distributions as a special and a limiting case respectively. A general formula for the derivative of the likelihood with respect to [mu], applicable to the whole family of mixed Poisson distributions considered, is given. Within the framework introduced here all parameters of the distributions are modelled as parametric and/or nonparametric (smooth) functions of explanatory variables. This provides a very flexible way of modelling count data. Maximum (penalized) likelihood estimation is used to fit the (non)parametric models.

Suggested Citation

  • Rigby, R.A. & Stasinopoulos, D.M. & Akantziliotou, C., 2008. "A framework for modelling overdispersed count data, including the Poisson-shifted generalized inverse Gaussian distribution," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 381-393, December.
  • Handle: RePEc:eee:csdana:v:53:y:2008:i:2:p:381-393
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(08)00386-1
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Murray Aitkin, 1999. "A General Maximum Likelihood Analysis of Variance Components in Generalized Linear Models," Biometrics, The International Biometric Society, vol. 55(1), pages 117-128, March.
    2. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    3. J. K. Lindsey, 1999. "On the use of corrections for overdispersion," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(4), pages 553-561.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tzougas, George, 2020. "EM estimation for the Poisson-Inverse Gamma regression model with varying dispersion: an application to insurance ratemaking," LSE Research Online Documents on Economics 106539, London School of Economics and Political Science, LSE Library.
    2. Vilhelm Verendel, 2023. "Tracking artificial intelligence in climate inventions with patent data," Nature Climate Change, Nature, vol. 13(1), pages 40-47, January.
    3. Moritz Berger & Gerhard Tutz, 2021. "Transition models for count data: a flexible alternative to fixed distribution models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(4), pages 1259-1283, October.
    4. Tzougas, George & Hong, Natalia & Ho, Ryan, 2022. "Mixed poisson regression models with varying dispersion arising from non-conjugate mixing distributions," LSE Research Online Documents on Economics 113616, London School of Economics and Political Science, LSE Library.
    5. José Rodríguez-Avi & María José Olmo-Jiménez, 2017. "A regression model for overdispersed data without too many zeros," Statistical Papers, Springer, vol. 58(3), pages 749-773, September.
    6. Marcelo Bourguignon & Diego I. Gallardo & Rodrigo M. R. Medeiros, 2022. "A simple and useful regression model for underdispersed count data based on Bernoulli–Poisson convolution," Statistical Papers, Springer, vol. 63(3), pages 821-848, June.
    7. Rahma Abid & Célestin C. Kokonendji & Afif Masmoudi, 2021. "On Poisson-exponential-Tweedie models for ultra-overdispersed count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(1), pages 1-23, March.
    8. A. Baccini & L. Barabesi & M. Cioni & C. Pisani, 2014. "Crossing the hurdle: the determinants of individual scientific performance," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(3), pages 2035-2062, December.
    9. Qiang Jia & Yong Zhang & Lei Cao, 2019. "Response of Anatidae Abundance to Environmental Factors in the Middle and Lower Yangtze River Floodplain, China," Sustainability, MDPI, vol. 11(23), pages 1-10, December.
    10. Baíllo, A. & Berrendero, J.R. & Cárcamo, J., 2009. "Tests for zero-inflation and overdispersion: A new approach based on the stochastic convex order," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2628-2639, May.
    11. George Tzougas & Himchan Jeong, 2021. "An Expectation-Maximization Algorithm for the Exponential-Generalized Inverse Gaussian Regression Model with Varying Dispersion and Shape for Modelling the Aggregate Claim Amount," Risks, MDPI, vol. 9(1), pages 1-17, January.
    12. Willmot, Gordon E. & Woo, Jae-Kyung, 2022. "Remarks on a generalized inverse Gaussian type integral with applications," Applied Mathematics and Computation, Elsevier, vol. 430(C).
    13. Tzougas, George & Vrontos, Spyridon D. & Frangos, Nickolaos E., 2015. "Risk classification for claim counts and losses using regression models for location, scale and shape," LSE Research Online Documents on Economics 70921, London School of Economics and Political Science, LSE Library.
    14. Rodríguez-Avi, J. & Conde-Sánchez, A. & Sáez-Castillo, A.J. & Olmo-Jiménez, M.J. & Martínez-Rodríguez, A.M., 2009. "A generalized Waring regression model for count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3717-3725, August.
    15. Tzougas, George & Jeong, Himchan, 2021. "An expectation-maximization algorithm for the exponential-generalized inverse Gaussian regression model with varying dispersion and shape for modelling the aggregate claim amount," LSE Research Online Documents on Economics 108210, London School of Economics and Political Science, LSE Library.
    16. Tzougas, George & Pignatelli di Cerchiara, Alice, 2021. "The multivariate mixed Negative Binomial regression model with an application to insurance a posteriori ratemaking," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 602-625.
    17. R. Maya & Jie Huang & M. R. Irshad & Fukang Zhu, 2024. "On Poisson Moment Exponential Distribution with Associated Regression and INAR(1) Process," Annals of Data Science, Springer, vol. 11(5), pages 1741-1759, October.
    18. Liu, Yafen & He, Zhen & Shu, Lianjie & Wu, Zhang, 2009. "Statistical computation and analyses for attribute events," Computational Statistics & Data Analysis, Elsevier, vol. 53(9), pages 3412-3425, July.
    19. Farouk Mselmi, 2022. "Generalized linear model for subordinated Lévy processes," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(2), pages 772-801, June.
    20. Fatemeh Hassanzadeh & Iraj Kazemi, 2017. "Regression modeling of one-inflated positive count data," Statistical Papers, Springer, vol. 58(3), pages 791-809, September.
    21. Ben Omrane, Walid & Heinen, Andréas, 2010. "Public news announcements and quoting activity in the Euro/Dollar foreign exchange market," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2419-2431, November.
    22. Cordeiro, Gauss M. & Andrade, Marinho G. & de Castro, Mário, 2009. "Power series generalized nonlinear models," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1155-1166, February.
    23. Wen, Li & Rogers, Kerrylee & Saintilan, Neil & Ling, Joanne, 2011. "The influences of climate and hydrology on population dynamics of waterbirds in the lower Murrumbidgee River floodplains in Southeast Australia: Implications for environmental water management," Ecological Modelling, Elsevier, vol. 222(1), pages 154-163.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Antonello Maruotti & Pierfrancesco Alaimo Di Loro, 2023. "CO2 emissions and growth: A bivariate bidimensional mean‐variance random effects model," Environmetrics, John Wiley & Sons, Ltd., vol. 34(5), August.
    2. Adrián Quintero-Sarmiento & Edilberto Cepeda-Cuervo & Vicente Núñez-Antón, 2012. "Estimating infant mortality in Colombia: some overdispersion modelling approaches," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(5), pages 1011-1036, October.
    3. Claudia Stommel & Heribert Hofer & Marion L East, 2016. "The Effect of Reduced Water Availability in the Great Ruaha River on the Vulnerable Common Hippopotamus in the Ruaha National Park, Tanzania," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-18, June.
    4. van den Hout, Ardo & Muniz-Terrera, Graciela & Matthews, Fiona E., 2013. "Change point models for cognitive tests using semi-parametric maximum likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 684-698.
    5. Bauer, Ida & Haupt, Harry & Linner, Stefan, 2024. "Pinball boosting of regression quantiles," Computational Statistics & Data Analysis, Elsevier, vol. 200(C).
    6. Yixuan Wang & Jianzhu Li & Ping Feng & Rong Hu, 2015. "A Time-Dependent Drought Index for Non-Stationary Precipitation Series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(15), pages 5631-5647, December.
    7. Hartzel, Jonathan & Liu, I-Ming & Agresti, Alan, 2001. "Describing heterogeneous effects in stratified ordinal contingency tables, with application to multi-center clinical trials," Computational Statistics & Data Analysis, Elsevier, vol. 35(4), pages 429-449, February.
    8. John M. Grego & Philip A. Yates, 2025. "Robust Local Likelihood Estimation for Non-stationary Flood Frequency Analysis," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 30(3), pages 663-682, September.
    9. Germán Ibacache-Pulgar & Gilberto Paula & Francisco Cysneiros, 2013. "Semiparametric additive models under symmetric distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(1), pages 103-121, March.
    10. Auteri, Monica & Cremaschini, Alessandro, 2024. "Ownership or procurement, which matters? exploring asymmetries in local public transportation in Italy through a semi-parametric approach," The Journal of Economic Asymmetries, Elsevier, vol. 30(C).
    11. Nathaniel Geiger & Bryan McLaughlin & John Velez, 2021. "Not all boomers: temporal orientation explains inter- and intra-cultural variability in the link between age and climate engagement," Climatic Change, Springer, vol. 166(1), pages 1-20, May.
    12. Julian P. T. Higgins & Simon G. Thompson & David J. Spiegelhalter, 2009. "A re‐evaluation of random‐effects meta‐analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(1), pages 137-159, January.
    13. Gauss Cordeiro & Josemar Rodrigues & Mário Castro, 2012. "The exponential COM-Poisson distribution," Statistical Papers, Springer, vol. 53(3), pages 653-664, August.
    14. Wanting He & Wenyuan Li & Yunran Wei, 2025. "Periodic evaluation of defined-contribution pension fund: A dynamic risk measure approach," Papers 2508.05241, arXiv.org.
    15. Markus Dabernig & Georg J. Mayr & Jakob W. Messner & Achim Zeileis, 2016. "Spatial Ensemble Post-Processing with Standardized Anomalies," Working Papers 2016-08, Faculty of Economics and Statistics, Universität Innsbruck.
    16. John Komlos & Marek Brabec, 2010. "The Trend of BMI Values by Centiles of US Adults, Birth Cohorts 1882-1986," CESifo Working Paper Series 3132, CESifo.
    17. Malavasi, Matteo & Peters, Gareth W. & Shevchenko, Pavel V. & Trück, Stefan & Jang, Jiwook & Sofronov, Georgy, 2022. "Cyber risk frequency, severity and insurance viability," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 90-114.
    18. Wiemann, Paul F.V. & Klein, Nadja & Kneib, Thomas, 2022. "Correcting for sample selection bias in Bayesian distributional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    19. Brian Francis & Jiayi Liu, 2015. "Modelling escalation in crime seriousness: a latent variable approach," METRON, Springer;Sapienza Università di Roma, vol. 73(2), pages 277-297, August.
    20. Zhang, Daowen & Davidian, Marie, 2004. "Likelihood and conditional likelihood inference for generalized additive mixed models for clustered data," Journal of Multivariate Analysis, Elsevier, vol. 91(1), pages 90-106, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2008:i:2:p:381-393. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.