IDEAS home Printed from https://ideas.repec.org/p/rtv/ceisrp/51.html
   My bibliography  Save this paper

Semiparametric Mixture Models for Multivariate Count Data, with Application

Author

Listed:
  • Marco Alfò

    (Università degli Studi La Sapienza)

  • Giovanni Trovato

    (University of Rome II - Faculty of Economics)

Abstract

The analysis of overdispersed counts has been the focus of a large amount of literature, with the general objective of providing reliable parameter estimates in the presence of heterogeneity or dependence among subjects. In this paper we extend the standard variance component models to the analysis of multivariate counts, defining the dependence among counts through a set of correlated random coefficients. Estimation is carried out by numerical integration through an EM algorithm without parametric assumptions upon the random coefficients distribution. The proposed model is computationally parsimonious and, when applied to a real dataset, seems to produce better results than parametric models. A simulation study has been carried out to investigate the behavior of the proposed models in a series of empirical situations.

Suggested Citation

  • Marco Alfò & Giovanni Trovato, 2004. "Semiparametric Mixture Models for Multivariate Count Data, with Application," CEIS Research Paper 51, Tor Vergata University, CEIS.
  • Handle: RePEc:rtv:ceisrp:51
    as

    Download full text from publisher

    File URL: https://ceistorvergata.it/RePEc/rpaper/No-51-Alfo-Trovato.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chib, Siddhartha, 2001. "Markov chain Monte Carlo methods: computation and inference," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 57, pages 3569-3649, Elsevier.
    2. Jung, Robert C & Winkelmann, Rainer, 1993. "Two Aspects of Labor Mobility: A Bivariate Poisson Regression Approach," Empirical Economics, Springer, vol. 18(3), pages 543-556.
    3. Murat K. Munkin & Pravin K. Trivedi, 1999. "Simulated maximum likelihood estimation of multivariate mixed-Poisson regression models, with application," Econometrics Journal, Royal Economic Society, vol. 2(1), pages 29-48.
    4. Leonhard Knorr-Held & Günter Raßer, 2000. "Bayesian Detection of Clusters and Discontinuities in Disease Maps," Biometrics, The International Biometric Society, vol. 56(1), pages 13-21, March.
    5. Davies, Richard B., 1993. "Nonparametric control for residual heterogeneity in modelling recurrent behaviour," Computational Statistics & Data Analysis, Elsevier, vol. 16(2), pages 143-160, August.
    6. Hausman, Jerry & Hall, Bronwyn H & Griliches, Zvi, 1984. "Econometric Models for Count Data with an Application to the Patents-R&D Relationship," Econometrica, Econometric Society, vol. 52(4), pages 909-938, July.
    7. Cameron, A Colin & Trivedi, Pravin K, 1993. "Tests of Independence in Parametric Models with Applications and Illustrations," Journal of Business & Economic Statistics, American Statistical Association, vol. 11(1), pages 29-43, January.
    8. Davidson, Russell & MacKinnon, James G., 1993. "Estimation and Inference in Econometrics," OUP Catalogue, Oxford University Press, number 9780195060119.
    9. Gurmu, Shiferaw & Elder, John, 2000. "Generalized bivariate count data regression models," Economics Letters, Elsevier, vol. 68(1), pages 31-36, July.
    10. A. C. Cameron & P. K. Trivedi & Frank Milne & J. Piggott, 1988. "A Microeconometric Model of the Demand for Health Care and Health Insurance in Australia," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 55(1), pages 85-106.
    11. G. J. McLachlan, 1987. "On Bootstrapping the Likelihood Ratio Test Statistic for the Number of Components in a Normal Mixture," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 36(3), pages 318-324, November.
    12. Heckman, James & Singer, Burton, 1984. "A Method for Minimizing the Impact of Distributional Assumptions in Econometric Models for Duration Data," Econometrica, Econometric Society, vol. 52(2), pages 271-320, March.
    13. Murray Aitkin, 1999. "A General Maximum Likelihood Analysis of Variance Components in Generalized Linear Models," Biometrics, The International Biometric Society, vol. 55(1), pages 117-128, March.
    14. Chib, Siddhartha & Winkelmann, Rainer, 2001. "Markov Chain Monte Carlo Analysis of Correlated Count Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(4), pages 428-435, October.
    15. Rainer Winkelmann, 2000. "Seemingly Unrelated Negative Binomial Regression," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 62(4), pages 553-560, September.
    16. Brannas, Kurt & Rosenqvist, Gunnar, 1994. "Semiparametric estimation of heterogeneous count data models," European Journal of Operational Research, Elsevier, vol. 76(2), pages 247-258, July.
    17. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard, 2003. "Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 561-575, January.
    18. Colin Cameron, A. & Windmeijer, Frank A. G., 1997. "An R-squared measure of goodness of fit for some common nonlinear regression models," Journal of Econometrics, Elsevier, vol. 77(2), pages 329-342, April.
    19. van Ophem, Hans, 1999. "A General Method To Estimate Correlated Discrete Random Variables," Econometric Theory, Cambridge University Press, vol. 15(2), pages 228-237, April.
    20. Cameron, A Colin & Johansson, Per, 1997. "Count Data Regression Using Series Expansions: With Applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 12(3), pages 203-223, May-June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lim, Hwa Kyung & Li, Wai Keung & Yu, Philip L.H., 2014. "Zero-inflated Poisson regression mixture model," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 151-158.
    2. Marco Alfò & Lorenzo Carbonari & Giovanni Trovato, 2020. "On the Effects of Taxation on Growth: an Empirical Assessment," CEIS Research Paper 480, Tor Vergata University, CEIS, revised 08 May 2020.
    3. Stefano Caiazza & Alberto Franco Pozzolo & Giovanni Trovato, 2016. "Bank efficiency measures, M&A decision and heterogeneity," Journal of Productivity Analysis, Springer, vol. 46(1), pages 25-41, August.
    4. Leonardo Becchetti & Roberto Rocci & Giovanni Trovato, 2007. "Industry and time specific deviations from fundamental values in a random coefficient model," Annals of Finance, Springer, vol. 3(2), pages 257-276, March.
    5. Leonardo Becchetti & Giovanni Trovato, 2011. "Corporate social responsibility and firm efficiency: a latent class stochastic frontier analysis," Journal of Productivity Analysis, Springer, vol. 36(3), pages 231-246, December.
    6. Payandeh Najafabadi Amir T. & MohammadPour Saeed, 2018. "A k-Inflated Negative Binomial Mixture Regression Model: Application to Rate–Making Systems," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 12(2), pages 1-31, July.
    7. Fabio Padovano & Nicolas Gavoille, 2013. "The Dual Political Legislation Cycle in France," Economics Working Paper from Condorcet Center for political Economy at CREM-CNRS 2013-02-ccr, Condorcet Center for political Economy, revised Jun 2014.
    8. Becchetti, L. & Corrado, L. & Rossetti , F., 2008. "Easterlin-types and Frustrated Achievers: the Heterogeneous Effects of Income Changes on Life Satisfaction," Cambridge Working Papers in Economics 0816, Faculty of Economics, University of Cambridge.
    9. Becchetti, Leonardo & Castriota, Stefano, 2010. "Wage differentials in social enterprises," AICCON Working Papers 68-2010, Associazione Italiana per la Cultura della Cooperazione e del Non Profit.
    10. Chadha, Alka, 2009. "TRIPs and patenting activity: Evidence from the Indian pharmaceutical industry," Economic Modelling, Elsevier, vol. 26(2), pages 499-505, March.
    11. Giuseppe Galloppo & Giovanni Trovato, 2017. "Fundamental driver of fund style drift," Journal of Asset Management, Palgrave Macmillan, vol. 18(2), pages 99-123, March.
    12. Landry, Craig E. & Liu, Haiyong, 2009. "A semi-parametric estimator for revealed and stated preference data--An application to recreational beach visitation," Journal of Environmental Economics and Management, Elsevier, vol. 57(2), pages 205-218, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eugenio Miravete, 2014. "Testing for complementarities among countable strategies," Empirical Economics, Springer, vol. 46(4), pages 1521-1544, June.
    2. Marco Alfò & Cecilia Vitiello, 2003. "Finite mixtures approach to ecological regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 12(1), pages 93-108, February.
    3. Borislava Mihaylova & Andrew Briggs & Anthony O'Hagan & Simon G. Thompson, 2011. "Review of statistical methods for analysing healthcare resources and costs," Health Economics, John Wiley & Sons, Ltd., vol. 20(8), pages 897-916, August.
    4. George Tzougas & Despoina Makariou, 2022. "The multivariate Poisson‐Generalized Inverse Gaussian claim count regression model with varying dispersion and shape parameters," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 25(4), pages 401-417, December.
    5. A. Colin Cameron & Per Johansson, 2004. "Bivariate Count Data Regression Using Series Expansions: With Applications," Working Papers 9815, University of California, Davis, Department of Economics.
    6. Atella, Vincenzo & Deb, Partha, 2008. "Are primary care physicians, public and private sector specialists substitutes or complements? Evidence from a simultaneous equations model for count data," Journal of Health Economics, Elsevier, vol. 27(3), pages 770-785, May.
    7. Antonello Maruotti & Pierfrancesco Alaimo Di Loro, 2023. "CO2 emissions and growth: A bivariate bidimensional mean‐variance random effects model," Environmetrics, John Wiley & Sons, Ltd., vol. 34(5), August.
    8. Tzougas, George & Makariou, Despoina, 2022. "The multivariate Poisson-Generalized Inverse Gaussian claim count regression model with varying dispersion and shape parameters," LSE Research Online Documents on Economics 117197, London School of Economics and Political Science, LSE Library.
    9. Bauer, Thomas K. & Million, Andreas & Rotte, Ralph & Zimmermann, Klaus F., 1998. "Immigration Labor and Workplace Safety," IZA Discussion Papers 16, Institute of Labor Economics (IZA).
    10. Mauro Laudicella & Paolo Li Donni, 2022. "The dynamic interdependence in the demand of primary and emergency secondary care: A hidden Markov approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(3), pages 521-536, April.
    11. A. Colin Cameron & Tong Li & Pravin K. Trivedi & David M. Zimmer, 2004. "Modelling the differences in counted outcomes using bivariate copula models with application to mismeasured counts," Econometrics Journal, Royal Economic Society, vol. 7(2), pages 566-584, December.
    12. William Greene, 2001. "Fixed and Random Effects in Nonlinear Models," Working Papers 01-01, New York University, Leonard N. Stern School of Business, Department of Economics.
    13. Bijwaard, G.E. & Franses, Ph.H.B.F., 2006. "Does rounding matter for payment efficiency?," Econometric Institute Research Papers EI 2006-43, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    14. Egan, Kevin & Herriges, Joseph, 2006. "Multivariate count data regression models with individual panel data from an on-site sample," Journal of Environmental Economics and Management, Elsevier, vol. 52(2), pages 567-581, September.
    15. Marco Alfo & Giovanni Trovato & Robert J. Waldmann, 2008. "Testing for country heterogeneity in growth models using a finite mixture approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(4), pages 487-514.
    16. Jie Q. Guo & Pravin K. Trivedi, 2002. "Flexible Parametric Models for Long‐tailed Patent Count Distributions," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 64(1), pages 63-82, February.
    17. Sarker, Rakhal & Surry, Yves R., 2003. "The Fast Decay Process In Recreational Demand Activities And The Use Of Alternative Count Data Models," Working Papers 34147, University of Guelph, Department of Food, Agricultural and Resource Economics.
    18. Bermúdez, Lluís & Karlis, Dimitris, 2012. "A finite mixture of bivariate Poisson regression models with an application to insurance ratemaking," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 3988-3999.
    19. Alfò, Marco & Rocchetti, Irene, 2013. "A flexible approach to finite mixture regression models for multivariate mixed responses," Statistics & Probability Letters, Elsevier, vol. 83(7), pages 1754-1758.
    20. Begoña Álvarez & Daniel Miles, 2003. "Gender effect on housework allocation: Evidence from Spanish two-earner couples," Journal of Population Economics, Springer;European Society for Population Economics, vol. 16(2), pages 227-242, May.

    More about this item

    Keywords

    Correlated counts; Multivariate counts; Correlated random effects; Non-parametric ML;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:rtv:ceisrp:51. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Barbara Piazzi (email available below). General contact details of provider: https://edirc.repec.org/data/csrotit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.