IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v070c01.html
   My bibliography  Save this article

Generating Correlated and/or Overdispersed Count Data: A SAS Implementation

Author

Listed:
  • Kalema, George
  • Molenberghs, Geert

Abstract

Analysis of longitudinal count data has, for long, been done using a generalized linear mixed model (GLMM), in its Poisson-normal version, to account for correlation by specifying normal random effects. Univariate counts are often handled with the negativebinomial (NEGBIN) model taking into account overdispersion by use of gamma random effects. Inherently though, longitudinal count data commonly exhibit both features of correlation and overdispersion simultaneously, necessitating analysis methodology that can account for both. The introduction of the combined model (CM) by Molenberghs, Verbeke, and Demétrio (2007) and Molenberghs, Verbeke, Demétrio, and Vieira (2010) serves this purpose, not only for count data but for the general exponential family of distributions. Here, a Poisson model is specified as the parent distribution of the data with a normally distributed random effect at the subject or cluster level and/or a gamma distribution at observation level. The GLMM and NEGBIN model are special cases. Data can be simulated from (1) the general CM, with random effects, or, (2) its marginal version directly. This paper discusses an implementation of (1) in SAS software (SAS Inc. 2011). One needs to reflect on the mean of both the combined (hierarchical) and marginal models in order to generate correlated and/or overdispersed counts. A pre-specification of the desired marginal mean (in terms of covariates and marginal parameters), a marginal variance-covariance structure and the hierarchical mean (in terms of covariates and regression parameters) is required. The implied hierarchical parameters, the variance-covariance matrix of the random effects, and the variance-covariance matrix of the overdispersion part are then derived from which correlated Poisson data are generated. Sample calls of the SAS macro are presented as well as output.

Suggested Citation

  • Kalema, George & Molenberghs, Geert, 2016. "Generating Correlated and/or Overdispersed Count Data: A SAS Implementation," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(c01).
  • Handle: RePEc:jss:jstsof:v:070:c01
    DOI: http://hdl.handle.net/10.18637/jss.v070.c01
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v070c01/v70c01.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v070c01/CorrPoisson.sas
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v070c01/v70c01.sas
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v070.c01?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dimitris Karlis, 2003. "An EM algorithm for multivariate Poisson distribution and related models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 30(1), pages 63-77.
    2. Athanassios N. Avramidis & Nabil Channouf & Pierre L'Ecuyer, 2009. "Efficient Correlation Matching for Fitting Discrete Multivariate Distributions with Arbitrary Marginals and Normal-Copula Dependence," INFORMS Journal on Computing, INFORMS, vol. 21(1), pages 88-106, February.
    3. Soumyadip Ghosh & Raghu Pasupathy, 2012. "C-NORTA: A Rejection Procedure for Sampling from the Tail of Bivariate NORTA Distributions," INFORMS Journal on Computing, INFORMS, vol. 24(2), pages 295-310, May.
    4. Kaeyoung Shin & Raghu Pasupathy, 2010. "An Algorithm for Fast Generation of Bivariate Poisson Random Vectors," INFORMS Journal on Computing, INFORMS, vol. 22(1), pages 81-92, February.
    5. Kojadinovic, Ivan & Yan, Jun, 2010. "Modeling Multivariate Distributions with Continuous Margins Using the copula R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 34(i09).
    6. Marne C. Cario & Barry L. Nelson, 1998. "Numerical Methods for Fitting and Simulating Autoregressive-to-Anything Processes," INFORMS Journal on Computing, INFORMS, vol. 10(1), pages 72-81, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaeyoung Shin & Raghu Pasupathy, 2010. "An Algorithm for Fast Generation of Bivariate Poisson Random Vectors," INFORMS Journal on Computing, INFORMS, vol. 22(1), pages 81-92, February.
    2. Mohamed A. Ayadi & Hatem Ben-Ameur & Nabil Channouf & Quang Khoi Tran, 2019. "NORTA for portfolio credit risk," Annals of Operations Research, Springer, vol. 281(1), pages 99-119, October.
    3. Soumyadip Ghosh & Raghu Pasupathy, 2012. "C-NORTA: A Rejection Procedure for Sampling from the Tail of Bivariate NORTA Distributions," INFORMS Journal on Computing, INFORMS, vol. 24(2), pages 295-310, May.
    4. Song, Zhi & Mukherjee, Amitava & Zhang, Jiujun, 2021. "Some robust approaches based on copula for monitoring bivariate processes and component-wise assessment," European Journal of Operational Research, Elsevier, vol. 289(1), pages 177-196.
    5. Arturo Cortés Aguilar, 2011. "Estimación del residual de un bono respaldado por hipotecas mediante un modelo de riesgo crédito: una comparación de resultados de la teoría de cópulas y el modelo IRB de Basilea II en datos del merca," Revista de Administración, Finanzas y Economía (Journal of Management, Finance and Economics), Tecnológico de Monterrey, Campus Ciudad de México, vol. 5(1), pages 50-64.
    6. Righi, Marcelo Brutti & Ceretta, Paulo Sergio, 2013. "Estimating non-linear serial and cross-interdependence between financial assets," Journal of Banking & Finance, Elsevier, vol. 37(3), pages 837-846.
    7. Chen, Huifen & Cheng, Yuyen, 2009. "Designing charts for known autocorrelations and unknown marginal distribution," European Journal of Operational Research, Elsevier, vol. 198(2), pages 520-529, October.
    8. Elberg, Christina & Hagspiel, Simeon, 2015. "Spatial dependencies of wind power and interrelations with spot price dynamics," European Journal of Operational Research, Elsevier, vol. 241(1), pages 260-272.
    9. Tianyang Wang & James S. Dyer & Warren J. Hahn, 2017. "Sensitivity analysis of decision making under dependent uncertainties using copulas," EURO Journal on Decision Processes, Springer;EURO - The Association of European Operational Research Societies, vol. 5(1), pages 117-139, November.
    10. Berghaus, Betina & Segers, Johan, 2017. "Weak convergence of the weighted empirical beta copula process," LIDAM Discussion Papers ISBA 2017015, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    11. Hejn Nielsen, Erland, 2007. "Autocorrelation in queuing network-type production systems--Revisited," International Journal of Production Economics, Elsevier, vol. 110(1-2), pages 138-146, October.
    12. Max Auerswald & Morten Moshagen, 2015. "Generating Correlated, Non-normally Distributed Data Using a Non-linear Structural Model," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 920-937, December.
    13. Aloui, Riadh & Aïssa, Mohamed Safouane Ben & Hammoudeh, Shawkat & Nguyen, Duc Khuong, 2014. "Dependence and extreme dependence of crude oil and natural gas prices with applications to risk management," Energy Economics, Elsevier, vol. 42(C), pages 332-342.
    14. Punzo, Antonio & Bagnato, Luca, 2022. "Dimension-wise scaled normal mixtures with application to finance and biometry," Journal of Multivariate Analysis, Elsevier, vol. 191(C).
    15. Jorge A. Sefair & Oscar Guaje & Andrés L. Medaglia, 2021. "A column-oriented optimization approach for the generation of correlated random vectors," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 43(3), pages 777-808, September.
    16. He, Miao & Zhao, Lei & Powell, Warren B., 2012. "Approximate dynamic programming algorithms for optimal dosage decisions in controlled ovarian hyperstimulation," European Journal of Operational Research, Elsevier, vol. 222(2), pages 328-340.
    17. Hofert, Marius & Mächler, Martin & McNeil, Alexander J., 2012. "Likelihood inference for Archimedean copulas in high dimensions under known margins," Journal of Multivariate Analysis, Elsevier, vol. 110(C), pages 133-150.
    18. Guo-Liang Tian & Xiqian Ding & Yin Liu & Man-Lai Tang, 2019. "Some new statistical methods for a class of zero-truncated discrete distributions with applications," Computational Statistics, Springer, vol. 34(3), pages 1393-1426, September.
    19. Enkelejd Hashorva & Simone A. Padoan & Stefano Rizzelli, 2021. "Multivariate extremes over a random number of observations," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(3), pages 845-880, September.
    20. Steve Hyun & Jimin Lee & Jong-Min Kim & Chulhee Jun, 2019. "What Coins Lead in the Cryptocurrency Market: Using Copula and Neural Networks Models," JRFM, MDPI, vol. 12(3), pages 1-14, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:070:c01. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.