IDEAS home Printed from https://ideas.repec.org/p/tin/wpaper/20210017.html
   My bibliography  Save this paper

Bayes estimates of multimodal density features using DNA and Economic Data

Author

Listed:
  • Nalan Basturk

    (Maastricht University)

  • Lennart Hoogerheide

    (Vrije Universiteit Amsterdam)

  • Herman K. van Dijk

    (Erasmus University Rotterdam)

Abstract

In several scientific fields, like bioinformatics, financial and macro-economics, important theoretical and practical issues exist that involve multimodal data distributions. We propose a Bayesian approach using mixtures distributions to approximate accurately such data distributions. Shape and other features of the mixture approximations are estimated including their uncertainty. For discrete data, we introduce a novel mixture of shifted Poisson distributions with an unknown number of components, which overcomes the equidispersion restriction in the standard Poisson which accomodates a wide range of shapes such as multimodality and long tails. Our simulation-based Bayesian inference treats the density features as random variables and highest credibility regions around features are easily obtained. For discrete data we develop an adapted version of the Reversible Jump Markov Chain Monte Carlo (RJMCMC) method, which allows for an unknown number of components instead of the more restrictive approach of choosing a particular number of mixture components using information criteria. Using simulated data, we show that our approach works successfully for three issues that one encounters during the estimation of mixtures: label switching; mixture complexity and prior information and mode membership versus component membership. The proposed method is applied to three empirical data sets: The count data method yields a novel perspective of the data on DNA tandem repeats in \cite{DNA_leiden}; the bimodal distribution of payment details of clients obtaining a loan from a financial institution in Spain in 1990 gives insight into the repayment ability of individual clients; and the distribution of the modes of real GDP growth data from the PennWorld Tables and their evolution over time explores possible world-wide economic convergence as well as group convergence between the US and European countries. The results of our descriptive analysis may be used as input for forecasting and policy analysis.

Suggested Citation

  • Nalan Basturk & Lennart Hoogerheide & Herman K. van Dijk, 2021. "Bayes estimates of multimodal density features using DNA and Economic Data," Tinbergen Institute Discussion Papers 21-017/III, Tinbergen Institute.
  • Handle: RePEc:tin:wpaper:20210017
    as

    Download full text from publisher

    File URL: https://papers.tinbergen.nl/21017.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dionne, Georges & Artis, Manuel & Guillen, Montserrat, 1996. "Count data models for a credit scoring system," Journal of Empirical Finance, Elsevier, vol. 3(3), pages 303-325, September.
    2. Basturk, N. & Paap, R. & van Dijk, D.J.C., 2010. "Financial Development and Convergence Clubs," Econometric Institute Research Papers EI 2010-52, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    3. Quah, Danny T., 1996. "Empirics for economic growth and convergence," European Economic Review, Elsevier, vol. 40(6), pages 1353-1375, June.
    4. Paapaa, Richard & van Dijk, Herman K., 1998. "Distribution and mobility of wealth of nations," European Economic Review, Elsevier, vol. 42(7), pages 1269-1293, July.
    5. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    6. Umashanger, T. & Sriram, T.N., 2009. "L2E estimation of mixture complexity for count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4243-4254, October.
    7. P. M. Hartigan, 1985. "Computation of the Dip Statistic to Test for Unimodality," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 34(3), pages 320-325, November.
    8. Grn, Bettina & Leisch, Friedrich, 2009. "Dealing with label switching in mixture models under genuine multimodality," Journal of Multivariate Analysis, Elsevier, vol. 100(5), pages 851-861, May.
    9. repec:dau:papers:123456789/4648 is not listed on IDEAS
    10. Fischer, N. I. & Mammen, E. & Marron, J. S., 1994. "Testing for multimodality," Computational Statistics & Data Analysis, Elsevier, vol. 18(5), pages 499-512, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jamie L. Cross & Lennart Hoogerheide & Paul Labonne & Herman K. van Dijk, 2024. "Flexible Negative Binomial Mixtures for Credible Mode Inference in Heterogeneous Count Data from Finance, Economics and Bioinformatics," Tinbergen Institute Discussion Papers 24-056/III, Tinbergen Institute.
    2. Roberto Zelli & Maria Grazia Pittau, 2006. "Empirical evidence of income dynamics across EU regions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(5), pages 605-628.
    3. Chacón, José E. & Fernández Serrano, Javier, 2024. "Bayesian taut splines for estimating the number of modes," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).
    4. Steven N. Durlauf & Andros Kourtellos & Chih Ming Tan, 2008. "Empirics of Growth and Development," Chapters, in: Amitava Krishna Dutt & Jaime Ros (ed.), International Handbook of Development Economics, Volumes 1 & 2, volume 0, chapter 3, Edward Elgar Publishing.
    5. Leone Leonida & Leone Leonida & Daniel Montolio, 2003. "Public Capital, Growth and Convergence in Spain. A Counterfactual Density Estimation Approach," Working Papers 2003/3, Institut d'Economia de Barcelona (IEB).
    6. Danny Quah, 1996. "Twin Peaks: Growth and Convergence in Models of Distribution Dynamics," CEP Discussion Papers dp0280, Centre for Economic Performance, LSE.
    7. Daniel J. Henderson & Christopher F. Parmeter & R. Robert Russell, 2008. "Modes, weighted modes, and calibrated modes: evidence of clustering using modality tests," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 23(5), pages 607-638.
    8. Shana M. Sundstrom & Craig R. Allen & David G. Angeler, 2020. "Scaling and discontinuities in the global economy," Journal of Evolutionary Economics, Springer, vol. 30(2), pages 319-345, April.
    9. Hobijn, Bart & Franses, Philip Hans, 2001. "Are living standards converging?," Structural Change and Economic Dynamics, Elsevier, vol. 12(2), pages 171-200, July.
    10. Ana Lamo, 2000. "On convergence empirics: same evidence for Spanish regions," Investigaciones Economicas, Fundación SEPI, vol. 24(3), pages 681-707, September.
    11. Gary Koop, 2004. "Modelling the evolution of distributions: an application to Major League baseball," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 167(4), pages 639-655, November.
    12. Kazuhiko Kakamu, 2022. "Bayesian analysis of mixtures of lognormal distribution with an unknown number of components from grouped data," Papers 2210.05115, arXiv.org, revised Sep 2023.
    13. Răileanu-Szeles, Monica & Albu, Lucian, 2015. "Nonlinearities and divergences in the process of European financial integration," Economic Modelling, Elsevier, vol. 46(C), pages 416-425.
    14. Sebastian Vollmer & Hajo Holzmann & Florian Ketterer & Stephan Klasen, 2013. "Distribution dynamics of regional GDP per employee in unified Germany," Empirical Economics, Springer, vol. 44(2), pages 491-509, April.
    15. Madison Terrell & Qazi Haque & Jamie L. Cross & Firmin Doko Tchatoka, 2023. "Monetary policy shocks and exchange rate dynamics in small open economies," School of Economics and Public Policy Working Papers 2023-04 Classification-C3, University of Adelaide, School of Economics and Public Policy.
    16. Amini, Shahram & Battisti, Michele & Parmeter, Christopher F., 2017. "Decomposing changes in the conditional variance of GDP over time," Economic Modelling, Elsevier, vol. 61(C), pages 376-387.
    17. Roberta Colavecchio & Declan Curran & Michael Funke, 2009. "Drifting together or falling apart? The empirics of regional economic growth in post-unification Germany," Applied Economics, Taylor & Francis Journals, vol. 43(9), pages 1087-1098.
    18. Don Webber, 2001. "A slowing of national income convergence," Applied Economics Letters, Taylor & Francis Journals, vol. 8(11), pages 709-711.
    19. Quah, Danny T, 1997. "Empirics for Growth and Distribution: Stratification, Polarization, and Convergence Clubs," Journal of Economic Growth, Springer, vol. 2(1), pages 27-59, March.
    20. Longford, N.T. & Pittau, M.G., 2006. "Stability of household income in European countries in the 1990s," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1364-1383, November.

    More about this item

    Keywords

    Multimodality; mixtures; Markov Chain Monte Carlo; Bayesian Inference;
    All these keywords.

    JEL classification:

    • C11 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Bayesian Analysis: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tin:wpaper:20210017. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tinbergen Office +31 (0)10-4088900 (email available below). General contact details of provider: https://edirc.repec.org/data/tinbenl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.