IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v41y2026i2d10.1007_s00180-026-01723-9.html

Estimation and model selection of heterogeneous mixture distributions: an ECME algorithm-based approach

Author

Listed:
  • Marco Bee

    (University of Trento, Department of Economics and Management)

Abstract

Heterogeneous mixtures and spliced distributions are commonly used models for fitting skewed data, especially in loss distribution analysis. While they guarantee additional flexibility, their estimation is typically rather difficult. In this paper we develop an Expectation Conditional Maximization Either (ECME) algorithm that allows one to estimate the parameters in a completely unsupervised manner. We also propose a likelihood-ratio test for selecting between a pure lognormal and a lognormal-Pareto distribution. With respect to a recently developed profile-likelihood method, the approach has similar statistical efficiency, but is considerably faster, thus overcoming the strong computational limitations of the profile-likelihood technique. Simulation experiments and an application to vehicle insurance illustrate the effectiveness of the estimation and testing procedures. All the codes for simulating and estimating the lognormal-Pareto mixture using the methods developed in this paper are available in the LNPar R package.

Suggested Citation

  • Marco Bee, 2026. "Estimation and model selection of heterogeneous mixture distributions: an ECME algorithm-based approach," Computational Statistics, Springer, vol. 41(2), pages 1-20, February.
  • Handle: RePEc:spr:compst:v:41:y:2026:i:2:d:10.1007_s00180-026-01723-9
    DOI: 10.1007/s00180-026-01723-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-026-01723-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-026-01723-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Majda Benzidia & Michel Lubrano, 2020. "A Bayesian look at American academic wages: From wage dispersion to wage compression," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 18(2), pages 213-238, June.
    2. David Scollnik, 2007. "On composite lognormal-Pareto models," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2007(1), pages 20-33.
    3. Y. Malevergne & V. Pisarenko & D. Sornette, 2009. "Gibrat’s law for cities: uniformly most powerful unbiased test of the Pareto against the lognormal," Swiss Finance Institute Research Paper Series 09-40, Swiss Finance Institute.
    4. Hernán D. Rozenfeld & Diego Rybski & Xavier Gabaix & Hernán A. Makse, 2011. "The Area and Population of Cities: New Insights from a Different Perspective on Cities," American Economic Review, American Economic Association, vol. 101(5), pages 2205-2225, August.
    5. Bettina Grün & Tatjana Miljkovic, 2019. "Extending composite loss models using a general framework of advanced computational tools," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2019(8), pages 642-660, September.
    6. Guojun Gan & Emiliano A. Valdez, 2018. "Fat-Tailed Regression Modeling with Spliced Distributions," North American Actuarial Journal, Taylor & Francis Journals, vol. 22(4), pages 554-573, October.
    7. Miljkovic, Tatjana & Grün, Bettina, 2016. "Modeling loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 387-396.
    8. Marco Bee & Massimo Riccaboni & Stefano Schiavo, 2011. "Pareto versus lognormal: a maximum entropy test," Department of Economics Working Papers 1102, Department of Economics, University of Trento, Italia.
    9. Jun Ma & Hugo Jales & Zhengfei Yu, 2020. "Minimum Contrast Empirical Likelihood Inference of Discontinuity in Density," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(4), pages 934-950, October.
    10. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    11. Abu Bakar, S.A. & Hamzah, N.A. & Maghsoudi, M. & Nadarajah, S., 2015. "Modeling loss data using composite models," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 146-154.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Zhengxiao & Wang, Fei & Zhao, Zhengtang, 2024. "A new class of composite GBII regression models with varying threshold for modeling heavy-tailed data," Insurance: Mathematics and Economics, Elsevier, vol. 117(C), pages 45-66.
    2. Marco Bee, 2024. "On discriminating between lognormal and Pareto tail: an unsupervised mixture-based approach," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 18(2), pages 251-269, June.
    3. Bae, Taehan & Miljkovic, Tatjana, 2024. "Loss modeling with the size-biased lognormal mixture and the entropy regularized EM algorithm," Insurance: Mathematics and Economics, Elsevier, vol. 117(C), pages 182-195.
    4. Marco Bee, 2020. "On discriminating between lognormal and Pareto tail: A mixture-based approach," DEM Working Papers 2020/9, Department of Economics and Management.
    5. Hansjoerg Albrecher & Jan Beirlant, 2025. "Statistics of Extremes for the Insurance Industry," Papers 2511.22272, arXiv.org.
    6. Bhati, Deepesh & Ravi, Sreenivasan, 2018. "On generalized log-Moyal distribution: A new heavy tailed size distribution," Insurance: Mathematics and Economics, Elsevier, vol. 79(C), pages 247-259.
    7. Bee, Marco, 2025. "A parsimonious dynamic mixture for heavy-tailed distributions," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 230(C), pages 193-206.
    8. Blostein, Martin & Miljkovic, Tatjana, 2019. "On modeling left-truncated loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 85(C), pages 35-46.
    9. Girish Aradhye & George Tzougas & Deepesh Bhati, 2024. "A Copula-Based Bivariate Composite Model for Modelling Claim Costs," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    10. Walena Anesu Marambakuyana & Sandile Charles Shongwe, 2024. "Composite and Mixture Distributions for Heavy-Tailed Data—An Application to Insurance Claims," Mathematics, MDPI, vol. 12(2), pages 1-23, January.
    11. Gilberto Seravalli, 2016. "Dimensioni e crescita delle citt? in Europa: l?incertezza danneggia soprattutto le citt? medie," SCIENZE REGIONALI, FrancoAngeli Editore, vol. 2016(2), pages 91-108.
    12. Patrick Osatohanmwen, 2025. "A new two-component hybrid model for highly right-skewed data: estimation algorithm and application to finance and rainfall data," BEMPS - Bozen Economics & Management Paper Series BEMPS108, Faculty of Economics and Management at the Free University of Bozen.
    13. Shi, Yue & Punzo, Antonio & Otneim, Håkon & Maruotti, Antonello, 2025. "Hidden semi-Markov models for rainfall-related insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 120(C), pages 91-106.
    14. Shi, Yue & Punzo, Antonio & Otneim, Håkon & Maruotti, Antonello, 2023. "Hidden semi-Markov models for rainfall-related insurance claims," Discussion Papers 2023/17, Norwegian School of Economics, Department of Business and Management Science.
    15. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    16. Jakub Growiec & Fabio Pammolli & Massimo Riccaboni, 2020. "Innovation and Corporate Dynamics: A Theoretical Framework," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 12(1), pages 1-45, March.
    17. Počuča, Nikola & Jevtić, Petar & McNicholas, Paul D. & Miljkovic, Tatjana, 2020. "Modeling frequency and severity of claims with the zero-inflated generalized cluster-weighted models," Insurance: Mathematics and Economics, Elsevier, vol. 94(C), pages 79-93.
    18. Jovanovic, Franck & Schinckus, Christophe, 2017. "Econophysics and Financial Economics: An Emerging Dialogue," OUP Catalogue, Oxford University Press, number 9780190205034.
    19. Ahmed Z. Afify & Ahmed M. Gemeay & Noor Akma Ibrahim, 2020. "The Heavy-Tailed Exponential Distribution: Risk Measures, Estimation, and Application to Actuarial Data," Mathematics, MDPI, vol. 8(8), pages 1-28, August.
    20. Shahid Mohammad & Kahadawala Cooray, 2025. "Parameter Reduction of Complex Distributions Using a Tuning Method with Applications to Actuarial Data," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 14(3), pages 1-42, October.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:41:y:2026:i:2:d:10.1007_s00180-026-01723-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.