IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v28y2013i4p1571-1597.html
   My bibliography  Save this article

Finite mixtures of unimodal beta and gamma densities and the $$k$$ -bumps algorithm

Author

Listed:
  • Luca Bagnato
  • Antonio Punzo

Abstract

This paper addresses the problem of estimating a density, with either a compact support or a support bounded at only one end, exploiting a general and natural form of a finite mixture of distributions. Due to the importance of the concept of multimodality in the mixture framework, unimodal beta and gamma densities are used as mixture components, leading to a flexible modeling approach. Accordingly, a mode-based parameterization of the components is provided. A partitional clustering method, named $$k$$ -bumps, is also proposed; it is used as an ad hoc initialization strategy in the EM algorithm to obtain the maximum likelihood estimation of the mixture parameters. The performance of the $$k$$ -bumps algorithm as an initialization tool, in comparison to other common initialization strategies, is evaluated through some simulation experiments. Finally, two real applications are presented. Copyright Springer-Verlag Berlin Heidelberg 2013

Suggested Citation

  • Luca Bagnato & Antonio Punzo, 2013. "Finite mixtures of unimodal beta and gamma densities and the $$k$$ -bumps algorithm," Computational Statistics, Springer, vol. 28(4), pages 1571-1597, August.
  • Handle: RePEc:spr:compst:v:28:y:2013:i:4:p:1571-1597
    DOI: 10.1007/s00180-012-0367-4
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00180-012-0367-4
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00180-012-0367-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Calabrese, Raffaella & Zenga, Michele, 2010. "Bank loan recovery rates: Measuring and nonparametric density estimation," Journal of Banking & Finance, Elsevier, vol. 34(5), pages 903-911, May.
    2. Song Chen, 2000. "Probability Density Function Estimation Using Gamma Kernels," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 52(3), pages 471-480, September.
    3. Bruche, Max & González-Aguado, Carlos, 2010. "Recovery rates, default probabilities, and the credit cycle," Journal of Banking & Finance, Elsevier, vol. 34(4), pages 754-764, April.
    4. Peter Congdon, 1993. "Statistical Graduation in Local Demographic Analysis and Projection," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 156(2), pages 237-270, March.
    5. Leisch, Friedrich, 2004. "FlexMix: A General Framework for Finite Mixture Models and Latent Class Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i08).
    6. Simon Lee & X. Lin, 2010. "Modeling and Evaluating Insurance Losses Via Mixtures of Erlang Distributions," North American Actuarial Journal, Taylor & Francis Journals, vol. 14(1), pages 107-130.
    7. Sonia Petrone, 1999. "Random Bernstein Polynomials," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 26(3), pages 373-393, September.
    8. Chen, Song Xi, 1999. "Beta kernel estimators for density functions," Computational Statistics & Data Analysis, Elsevier, vol. 31(2), pages 131-145, August.
    9. Schilling M.F. & Watkins A.E. & Watkins W., 2002. "Is Human Height Bimodal?," The American Statistician, American Statistical Association, vol. 56, pages 223-229, August.
    10. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard, 2003. "Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 561-575, January.
    11. Celeux, Gilles & Govaert, Gerard, 1992. "A classification EM algorithm for clustering and two stochastic versions," Computational Statistics & Data Analysis, Elsevier, vol. 14(3), pages 315-332, October.
    12. J. Wessels, 1964. "Multimodality in a family of probability densities, with application to a linear mixture of two normal densities," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 18(3), pages 267-282, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Antonio Punzo & Paul. D. McNicholas, 2017. "Robust Clustering in Regression Analysis via the Contaminated Gaussian Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 34(2), pages 249-293, July.
    2. Mazza, Angelo & Punzo, Antonio, 2014. "DBKGrad: An R Package for Mortality Rates Graduation by Discrete Beta Kernel Techniques," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(c02).
    3. Ouimet, Frédéric, 2021. "Asymptotic properties of Bernstein estimators on the simplex," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    4. Ahmed Z. Afify & Ahmed M. Gemeay & Noor Akma Ibrahim, 2020. "The Heavy-Tailed Exponential Distribution: Risk Measures, Estimation, and Application to Actuarial Data," Mathematics, MDPI, vol. 8(8), pages 1-28, August.
    5. Jobst, Rainer & Kellner, Ralf & Rösch, Daniel, 2020. "Bayesian loss given default estimation for European sovereign bonds," International Journal of Forecasting, Elsevier, vol. 36(3), pages 1073-1091.
    6. Salvatore D. Tomarchio & Antonio Punzo, 2019. "Modelling the loss given default distribution via a family of zero‐and‐one inflated mixture models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1247-1266, October.
    7. Marco Centoni & Vieri Del Panta & Antonello Maruotti & Valentina Raponi, 2019. "Concomitant-Variable Latent-Class Beta Inflated Models to Assess Students’ Performance: An Italian Case Study," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 146(1), pages 7-18, November.
    8. Nema Dean & Rebecca Nugent, 2013. "Clustering student skill set profiles in a unit hypercube using mixtures of multivariate betas," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(3), pages 339-357, September.
    9. Maruotti, Antonello & Punzo, Antonio, 2017. "Model-based time-varying clustering of multivariate longitudinal data with covariates and outliers," Computational Statistics & Data Analysis, Elsevier, vol. 113(C), pages 475-496.
    10. Punzo, Antonio & Bagnato, Luca & Maruotti, Antonello, 2018. "Compound unimodal distributions for insurance losses," Insurance: Mathematics and Economics, Elsevier, vol. 81(C), pages 95-107.
    11. Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini & Simona Minotti, 2015. "The Generalized Linear Mixed Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 32(1), pages 85-113, April.
    12. Salvatore Ingrassia & Antonio Punzo, 2020. "Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition," Journal of Classification, Springer;The Classification Society, vol. 37(2), pages 526-547, July.
    13. Shi, Yue & Punzo, Antonio & Otneim, Håkon & Maruotti, Antonello, 2023. "Hidden semi-Markov models for rainfall-related insurance claims," Discussion Papers 2023/17, Norwegian School of Economics, Department of Business and Management Science.
    14. Angelo Mazza & Antonio Punzo, 2020. "Mixtures of multivariate contaminated normal regression models," Statistical Papers, Springer, vol. 61(2), pages 787-822, April.
    15. Morris, Katherine & Punzo, Antonio & McNicholas, Paul D. & Browne, Ryan P., 2019. "Asymmetric clusters and outliers: Mixtures of multivariate contaminated shifted asymmetric Laplace distributions," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 145-166.
    16. Wei Zhao & Saima K Khosa & Zubair Ahmad & Muhammad Aslam & Ahmed Z Afify, 2020. "Type-I heavy tailed family with applications in medicine, engineering and insurance," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-24, August.
    17. Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini & Simona Minotti, 2015. "Erratum to: The Generalized Linear Mixed Cluster-Weighted Model," Journal of Classification, Springer;The Classification Society, vol. 32(2), pages 327-355, July.
    18. Paolo Berta & Salvatore Ingrassia & Antonio Punzo & Giorgio Vittadini, 2016. "Multilevel cluster-weighted models for the evaluation of hospitals," METRON, Springer;Sapienza Università di Roma, vol. 74(3), pages 275-292, December.
    19. Punzo, Antonio & Bagnato, Luca, 2021. "Modeling the cryptocurrency return distribution via Laplace scale mixtures," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 563(C).
    20. Marcelo Bourguignon & Manoel Santos-Neto & Mário Castro, 2021. "A new regression model for positive random variables with skewed and long tail," METRON, Springer;Sapienza Università di Roma, vol. 79(1), pages 33-55, April.
    21. Bhati, Deepesh & Ravi, Sreenivasan, 2018. "On generalized log-Moyal distribution: A new heavy tailed size distribution," Insurance: Mathematics and Economics, Elsevier, vol. 79(C), pages 247-259.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ouimet, Frédéric, 2021. "Asymptotic properties of Bernstein estimators on the simplex," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    2. Roberto Mari & Salvatore Ingrassia & Antonio Punzo, 2023. "Local and Overall Deviance R-Squared Measures for Mixtures of Generalized Linear Models," Journal of Classification, Springer;The Classification Society, vol. 40(2), pages 233-266, July.
    3. Jeon, Yongho & Kim, Joseph H.T., 2013. "A gamma kernel density estimation for insurance loss data," Insurance: Mathematics and Economics, Elsevier, vol. 53(3), pages 569-579.
    4. Doho, Libaud Rudy Aurelien & Somé, Sobom Matthieu & Banto, Jean Michel, 2023. "Inflation and west African sectoral stock price indices: An asymmetric kernel method analysis," Emerging Markets Review, Elsevier, vol. 54(C).
    5. Adrian O’Hagan & Arthur White, 2019. "Improved model-based clustering performance using Bayesian initialization averaging," Computational Statistics, Springer, vol. 34(1), pages 201-231, March.
    6. Hirukawa, Masayuki, 2010. "Nonparametric multiplicative bias correction for kernel-type density estimation on the unit interval," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 473-495, February.
    7. Gourieroux, Christian & Lu, Yang, 2019. "Least impulse response estimator for stress test exercises," Journal of Banking & Finance, Elsevier, vol. 103(C), pages 62-77.
    8. Ouimet, Frédéric & Tolosana-Delgado, Raimon, 2022. "Asymptotic properties of Dirichlet kernel density estimators," Journal of Multivariate Analysis, Elsevier, vol. 187(C).
    9. Hagmann, M. & Scaillet, O., 2007. "Local multiplicative bias correction for asymmetric kernel density estimators," Journal of Econometrics, Elsevier, vol. 141(1), pages 213-249, November.
    10. Lebret, Rémi & Iovleff, Serge & Langrognet, Florent & Biernacki, Christophe & Celeux, Gilles & Govaert, Gérard, 2015. "Rmixmod: The R Package of the Model-Based Unsupervised, Supervised, and Semi-Supervised Classification Mixmod Library," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i06).
    11. Fernandes, Marcelo & Grammig, Joachim, 2005. "Nonparametric specification tests for conditional duration models," Journal of Econometrics, Elsevier, vol. 127(1), pages 35-68, July.
    12. Faicel Chamroukhi, 2016. "Piecewise Regression Mixture for Simultaneous Functional Data Clustering and Optimal Segmentation," Journal of Classification, Springer;The Classification Society, vol. 33(3), pages 374-411, October.
    13. Salvatore D. Tomarchio & Antonio Punzo, 2019. "Modelling the loss given default distribution via a family of zero‐and‐one inflated mixture models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1247-1266, October.
    14. Thamayanthi Chellathurai, 2017. "Probability Density Of Recovery Rate Given Default Of A Firm’S Debt And Its Constituent Tranches," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 20(04), pages 1-34, June.
    15. Tomas Konecny & Jakub Seidler & Aelta Belyaeva & Konstantin Belyaev, 2017. "The Time Dimension of the Links Between Loss Given Default and the Macroeconomy," Czech Journal of Economics and Finance (Finance a uver), Charles University Prague, Faculty of Social Sciences, vol. 67(6), pages 462-491, October.
    16. Fabian Dvorak, 2020. "stratEst: Strategy Estimation in R," TWI Research Paper Series 119, Thurgauer Wirtschaftsinstitut, Universität Konstanz.
    17. Song Li & Mervyn J. Silvapulle & Param Silvapulle & Xibin Zhang, 2015. "Bayesian Approaches to Nonparametric Estimation of Densities on the Unit Interval," Econometric Reviews, Taylor & Francis Journals, vol. 34(3), pages 394-412, March.
    18. Bauwens, Luc & Giot, Pierre & Grammig, Joachim & Veredas, David, 2004. "A comparison of financial duration models via density forecasts," International Journal of Forecasting, Elsevier, vol. 20(4), pages 589-609.
    19. Funke, Benedikt & Hirukawa, Masayuki, 2019. "Nonparametric estimation and testing on discontinuity of positive supported densities: a kernel truncation approach," Econometrics and Statistics, Elsevier, vol. 9(C), pages 156-170.
    20. Igarashi, Gaku & Kakizawa, Yoshihide, 2014. "Re-formulation of inverse Gaussian, reciprocal inverse Gaussian, and Birnbaum–Saunders kernel estimators," Statistics & Probability Letters, Elsevier, vol. 84(C), pages 235-246.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:28:y:2013:i:4:p:1571-1597. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.