IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v40y2025i1d10.1007_s00180-024-01491-4.html
   My bibliography  Save this article

A simple algorithm for computing the probabilities of count models based on pure birth processes

Author

Listed:
  • Mongkol Hunkrajok

    (Independent Researcher)

  • Wanrudee Skulpakdee

    (National Institute of Development Administration)

Abstract

Recently, non-monotonic rate sequences of pure birth processes have been the focus of much attention in the analysis of count data due to their ability to provide a combination of over-, under-, and equidispersed distributions without the need to reuse covariates (traditional methods). They also permit the modeling of excess counts, a frequent issue arising when using count models based on monotonic rate sequences such as the Poisson, gamma, Weibull, Conway-Maxwell-Poisson (CMP), Faddy (1997), etc. Matrix-exponential approaches have always been used for computing the probabilities for count models based on pure birth processes, although none have been proposed for them as a specific algorithm. It is intractable to calculate these pure birth probabilities numerically in an analytic form because severe numerical cancellations may occur. However, we circumvent this difficulty by exploiting a Taylor series expansion, and then a new analytic form is derived. We developed a simple algorithm for efficiently implementing the new formula and conducted numerical experiments to study the efficiency and accuracy of the developed algorithm. The results indicate that this new approach is faster and more accurate than the matrix-exponential methods.

Suggested Citation

  • Mongkol Hunkrajok & Wanrudee Skulpakdee, 2025. "A simple algorithm for computing the probabilities of count models based on pure birth processes," Computational Statistics, Springer, vol. 40(1), pages 249-272, January.
  • Handle: RePEc:spr:compst:v:40:y:2025:i:1:d:10.1007_s00180-024-01491-4
    DOI: 10.1007/s00180-024-01491-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-024-01491-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-024-01491-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Winkelmann, Rainer, 1995. "Duration Dependence and Dispersion in Count-Data Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(4), pages 467-474, October.
    2. Seth D. Guikema & Jeremy P. Goffelt, 2008. "A Flexible Count Data Regression Model for Risk Analysis," Risk Analysis, John Wiley & Sons, vol. 28(1), pages 213-223, February.
    3. M. J. Faddy & D. M. Smith, 2011. "Analysis of count data with covariate dependence in both mean and variance," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(12), pages 2683-2694, February.
    4. M. J. Faddy & D. M. Smith, 2008. "Extended Poisson process modelling of dilution series data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 57(4), pages 461-471, September.
    5. Smith, David M. & Faddy, Malcolm J., 2016. "Mean and Variance Modeling of Under- and Overdispersed Count Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(i06).
    6. Alina Peluso & Veronica Vinciotti & Keming Yu, 2019. "Discrete Weibull generalized additive model: an application to count fertility data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 68(3), pages 565-583, April.
    7. Gordon K. Smyth & Heather M. Podlich, 2002. "An Improved Saddlepoint Approximation Based on the Negative Binomial Distribution for the General Birth Process," Computational Statistics, Springer, vol. 17(1), pages 17-28, March.
    8. Mabel Morales-Otero & Vicente Núñez-Antón, 2021. "Comparing Bayesian Spatial Conditional Overdispersion and the Besag–York–Mollié Models: Application to Infant Mortality Rates," Mathematics, MDPI, vol. 9(3), pages 1-33, January.
    9. Marcelo Bourguignon & Rodrigo M. R. Medeiros, 2022. "A simple and useful regression model for fitting count data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(3), pages 790-827, September.
    10. Robert Jung & Gerd Ronning & A. Tremayne, 2005. "Estimation in conditional first order autoregression with discrete support," Statistical Papers, Springer, vol. 46(2), pages 195-224, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Célestin C. Kokonendji & Sobom M. Somé & Youssef Esstafa & Marcelo Bourguignon, 2023. "On Underdispersed Count Kernels for Smoothing Probability Mass Functions," Stats, MDPI, vol. 6(4), pages 1-15, November.
    2. Smith, David M. & Faddy, Malcolm J., 2016. "Mean and Variance Modeling of Under- and Overdispersed Count Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 69(i06).
    3. Sáez-Castillo, A.J. & Conde-Sánchez, A., 2013. "A hyper-Poisson regression model for overdispersed and underdispersed count data," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 148-157.
    4. S. Hadi Khazraee & Antonio Jose Sáez‐Castillo & Srinivas Reddy Geedipally & Dominique Lord, 2015. "Application of the Hyper‐Poisson Generalized Linear Model for Analyzing Motor Vehicle Crashes," Risk Analysis, John Wiley & Sons, vol. 35(5), pages 919-930, May.
    5. Kimberly F. Sellers & Tong Li & Yixuan Wu & Narayanaswamy Balakrishnan, 2021. "A Flexible Multivariate Distribution for Correlated Count Data," Stats, MDPI, vol. 4(2), pages 1-19, April.
    6. Christian Weiß, 2008. "Thinning operations for modeling time series of counts—a survey," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(3), pages 319-341, August.
    7. Eugenio J. Miravete, 2004. "The Doubtful Profitability of Foggy Pricing," Working Papers 04-07, NET Institute.
    8. Bijwaard, Govert E. & Franses, Philip Hans & Paap, Richard, 2006. "Modeling Purchases as Repeated Events," Journal of Business & Economic Statistics, American Statistical Association, vol. 24, pages 487-502, October.
    9. Gurmu, Shiferaw & Rilstone, Paul & Stern, Steven, 1998. "Semiparametric estimation of count regression models1," Journal of Econometrics, Elsevier, vol. 88(1), pages 123-150, November.
    10. Darcy Steeg Morris & Kimberly F. Sellers, 2022. "A Flexible Mixed Model for Clustered Count Data," Stats, MDPI, vol. 5(1), pages 1-18, January.
    11. Royce A. Francis & Srinivas Reddy Geedipally & Seth D. Guikema & Soma Sekhar Dhavala & Dominique Lord & Sarah LaRocca, 2012. "Characterizing the Performance of the Conway‐Maxwell Poisson Generalized Linear Model," Risk Analysis, John Wiley & Sons, vol. 32(1), pages 167-183, January.
    12. Rainer Winkelmann & Klaus Zimmermann, 1998. "Is job stability declining in Germany? Evidence from count data models," Applied Economics, Taylor & Francis Journals, vol. 30(11), pages 1413-1420.
    13. Azizpour, S & Giesecke, K. & Schwenkler, G., 2018. "Exploring the sources of default clustering," Journal of Financial Economics, Elsevier, vol. 129(1), pages 154-183.
    14. Stefano Mainardi, 2003. "Testing convergence in life expectancies: count regression models on panel data," Prague Economic Papers, Prague University of Economics and Business, vol. 2003(4), pages 350-370.
    15. Jan Beran & Frieder Droullier, 2024. "On strongly dependent zero-inflated INAR(1) processes," Statistical Papers, Springer, vol. 65(4), pages 2527-2553, June.
    16. Bhati, Avinash, 2007. "Learning from multiple analogies: an Information Theoretic framework for predicting criminal recidivism," MPRA Paper 11850, University Library of Munich, Germany.
    17. Hoyos, David & Riera, Pere, 2013. "Convergent validity between revealed and stated recreation demand data: Some empirical evidence from the Basque Country, Spain," Journal of Forest Economics, Elsevier, vol. 19(3), pages 234-248.
    18. Douglas Toledo & Cristiane Akemi Umetsu & Antonio Fernando Monteiro Camargo & Idemauro Antonio Rodrigues Lara, 2022. "Flexible models for non-equidispersed count data: comparative performance of parametric models to deal with underdispersion," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(3), pages 473-497, September.
    19. Somayeh Ghorbani Gholiabad & Abbas Moghimbeigi & Javad Faradmal, 2021. "Three-level zero-inflated Conway–Maxwell–Poisson regression model for analyzing dispersed clustered count data with extra zeros," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 415-439, November.
    20. Eugenio J. Miravete, 2009. "Competing with Menus of Tariff Options," Journal of the European Economic Association, MIT Press, vol. 7(1), pages 188-205, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:40:y:2025:i:1:d:10.1007_s00180-024-01491-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.