IDEAS home Printed from https://ideas.repec.org/a/eee/insuma/v117y2024icp182-195.html
   My bibliography  Save this article

Loss modeling with the size-biased lognormal mixture and the entropy regularized EM algorithm

Author

Listed:
  • Bae, Taehan
  • Miljkovic, Tatjana

Abstract

The Erlang mixture with a common scale parameter is one of many popular models for modeling insurance losses. However, the actuarial literature recognizes and discusses some limitations of aforementioned model in approximate heavy-tailed distributions. In this paper, a size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The Operational Riskdata eXchange's left-truncated internal fraud loss data set is used to illustrate applications of the proposed model. Finally, the results of a simulation study show promising performance of the proposed SB-ltLN mixture in different simulation settings.

Suggested Citation

  • Bae, Taehan & Miljkovic, Tatjana, 2024. "Loss modeling with the size-biased lognormal mixture and the entropy regularized EM algorithm," Insurance: Mathematics and Economics, Elsevier, vol. 117(C), pages 182-195.
  • Handle: RePEc:eee:insuma:v:117:y:2024:i:c:p:182-195
    DOI: 10.1016/j.insmatheco.2024.05.003
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167668724000593
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.insmatheco.2024.05.003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    2. David Scollnik, 2007. "On composite lognormal-Pareto models," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2007(1), pages 20-33.
    3. Gordon Willmot & Jae-Kyung Woo, 2007. "On the Class of Erlang Mixtures with Risk Theoretic Applications," North American Actuarial Journal, Taylor & Francis Journals, vol. 11(2), pages 99-115.
    4. Michel Denuit, 2020. "Size-Biased Risk Measures of Compound Sums," North American Actuarial Journal, Taylor & Francis Journals, vol. 24(4), pages 512-532, October.
    5. Bettina Grün & Tatjana Miljkovic, 2019. "Extending composite loss models using a general framework of advanced computational tools," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2019(8), pages 642-660, September.
    6. Vytaras Brazauskas & Andreas Kleefeld, 2016. "Modeling Severity and Measuring Tail Risk of Norwegian Fire Claims," North American Actuarial Journal, Taylor & Francis Journals, vol. 20(1), pages 1-16, January.
    7. Simon Lee & X. Lin, 2010. "Modeling and Evaluating Insurance Losses Via Mixtures of Erlang Distributions," North American Actuarial Journal, Taylor & Francis Journals, vol. 14(1), pages 107-130.
    8. Enrique Calderín-Ojeda & Chun Fung Kwok, 2016. "Modeling claims data with composite Stoppa models," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2016(9), pages 817-836, October.
    9. Marco Moscadelli, 2004. "The modelling of operational risk: experience with the analysis of the data collected by the Basel Committee," Temi di discussione (Economic working papers) 517, Bank of Italy, Economic Research and International Relations Area.
    10. Yuqian Xu & Michael Pinedo & Mei Xue, 2017. "Operational Risk in Financial Services: A Review and New Research Opportunities," Production and Operations Management, Production and Operations Management Society, vol. 26(3), pages 426-445, March.
    11. Miljkovic, Tatjana & Grün, Bettina, 2016. "Modeling loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 387-396.
    12. Pigeon, Mathieu & Denuit, Michel, 2011. "Composite Lognormal-Pareto model with random threshold," LIDAM Reprints ISBA 2011020, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    13. Tatjana Miljkovic & Bettina Grün, 2021. "Using Model Averaging to Determine Suitable Risk Measure Estimates," North American Actuarial Journal, Taylor & Francis Journals, vol. 25(4), pages 562-579, November.
    14. Blostein, Martin & Miljkovic, Tatjana, 2019. "On modeling left-truncated loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 85(C), pages 35-46.
    15. Lysa Porth & Wenjun Zhu & Ken Seng Tan, 2014. "A credibility-based Erlang mixture model for pricing crop reinsurance," Agricultural Finance Review, Emerald Group Publishing Limited, vol. 74(2), pages 162-187, July.
    16. Lee, Simon C.K. & Lin, X. Sheldon, 2012. "Modeling Dependent Risks with Multivariate Erlang Mixtures," ASTIN Bulletin, Cambridge University Press, vol. 42(1), pages 153-180, May.
    17. Denuit, Michel, 2020. "Size-Biased Risk Measures of Compound Sums," LIDAM Reprints ISBA 2020034, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    18. Lysa Porth & Wenjun Zhu & Ken Seng Tan, 2014. "A credibility-based Erlang mixture model for pricing crop reinsurance," Agricultural Finance Review, Emerald Group Publishing Limited, vol. 74(2), pages 162-187, July.
    19. S. Nadarajah & S.A.A. Bakar, 2014. "New composite models for the Danish fire insurance data," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2014(2), pages 180-187.
    20. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard, 2003. "Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 561-575, January.
    21. Semhar Michael & Tatjana Miljkovic & Volodymyr Melnykov, 2020. "Mixture modeling of data with multiple partial right-censoring levels," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 355-378, June.
    22. Dankmar Böhning & Ekkehart Dietz & Rainer Schaub & Peter Schlattmann & Bruce Lindsay, 1994. "The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 46(2), pages 373-388, June.
    23. Mathieu Pigeon & Michel Denuit, 2011. "Composite Lognormal–Pareto model with random threshold," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2011(3), pages 177-192.
    24. Verbelen, Roel & Gong, Lan & Antonio, Katrien & Badescu, Andrei & Lin, Sheldon, 2015. "Fitting Mixtures Of Erlangs To Censored And Truncated Data Using The Em Algorithm," ASTIN Bulletin, Cambridge University Press, vol. 45(3), pages 729-758, September.
    25. Francisco Venegas-Martínez & José Francisco Martínez-Sánchez & María Teresa V. Martínez-Palacios, 2016. "An analysis on operational risk in international banking: A Bayesian approach (2007–2011)," Estudios Gerenciales, Universidad Icesi, vol. 32(140), pages 208-220, September.
    26. Yin, Cuihong & Lin, X. Sheldon, 2016. "EFFICIENT ESTIMATION OF ERLANG MIXTURES USING iSCAD PENALTY WITH INSURANCE APPLICATION," ASTIN Bulletin, Cambridge University Press, vol. 46(3), pages 779-799, September.
    27. Abu Bakar, S.A. & Hamzah, N.A. & Maghsoudi, M. & Nadarajah, S., 2015. "Modeling loss data using composite models," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 146-154.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Zhengxiao & Wang, Fei & Zhao, Zhengtang, 2024. "A new class of composite GBII regression models with varying threshold for modeling heavy-tailed data," Insurance: Mathematics and Economics, Elsevier, vol. 117(C), pages 45-66.
    2. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    3. Semhar Michael & Tatjana Miljkovic & Volodymyr Melnykov, 2020. "Mixture modeling of data with multiple partial right-censoring levels," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 355-378, June.
    4. Blostein, Martin & Miljkovic, Tatjana, 2019. "On modeling left-truncated loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 85(C), pages 35-46.
    5. Shi, Yue & Punzo, Antonio & Otneim, Håkon & Maruotti, Antonello, 2023. "Hidden semi-Markov models for rainfall-related insurance claims," Discussion Papers 2023/17, Norwegian School of Economics, Department of Business and Management Science.
    6. Delong, Łukasz & Lindholm, Mathias & Wüthrich, Mario V., 2021. "Gamma Mixture Density Networks and their application to modelling insurance claim amounts," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 240-261.
    7. Bhati, Deepesh & Ravi, Sreenivasan, 2018. "On generalized log-Moyal distribution: A new heavy tailed size distribution," Insurance: Mathematics and Economics, Elsevier, vol. 79(C), pages 247-259.
    8. Fung, Tsz Chai, 2022. "Maximum weighted likelihood estimator for robust heavy-tail modelling of finite mixture models," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 180-198.
    9. Walena Anesu Marambakuyana & Sandile Charles Shongwe, 2024. "Composite and Mixture Distributions for Heavy-Tailed Data—An Application to Insurance Claims," Mathematics, MDPI, vol. 12(2), pages 1-23, January.
    10. Miljkovic, Tatjana & Grün, Bettina, 2016. "Modeling loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 387-396.
    11. Roel Verbelen & Katrien Antonio & Gerda Claeskens, 2016. "Multivariate mixtures of Erlangs for density estimation under censoring," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(3), pages 429-455, July.
    12. Počuča, Nikola & Jevtić, Petar & McNicholas, Paul D. & Miljkovic, Tatjana, 2020. "Modeling frequency and severity of claims with the zero-inflated generalized cluster-weighted models," Insurance: Mathematics and Economics, Elsevier, vol. 94(C), pages 79-93.
    13. Johan René van Dorp & Ekundayo Shittu, 2024. "Two-sided distributions with applications in insurance loss modeling," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(3), pages 827-861, July.
    14. Girish Aradhye & George Tzougas & Deepesh Bhati, 2024. "A Copula-Based Bivariate Composite Model for Modelling Claim Costs," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    15. Yin, Cuihong & Sheldon Lin, X. & Huang, Rongtan & Yuan, Haili, 2019. "On the consistency of penalized MLEs for Erlang mixtures," Statistics & Probability Letters, Elsevier, vol. 145(C), pages 12-20.
    16. Alessandro Staino & Emilio Russo & Massimo Costabile & Arturo Leccadito, 2023. "Minimum capital requirement and portfolio allocation for non-life insurance: a semiparametric model with Conditional Value-at-Risk (CVaR) constraint," Computational Management Science, Springer, vol. 20(1), pages 1-32, December.
    17. Luis Rincón & David J. Santana, 2022. "Ruin Probability for Finite Erlang Mixture Claims Via Recurrence Sequences," Methodology and Computing in Applied Probability, Springer, vol. 24(3), pages 2213-2236, September.
    18. Gildas Ratovomirija, 2015. "Multivariate Stop loss Mixed Erlang Reinsurance risk: Aggregation, Capital allocation and Default risk," Papers 1501.07297, arXiv.org.
    19. Sarra Ghaddab & Manel Kacem & Christian Peretti & Lotfi Belkacem, 2023. "Extreme severity modeling using a GLM-GPD combination: application to an excess of loss reinsurance treaty," Empirical Economics, Springer, vol. 65(3), pages 1105-1127, September.
    20. Marco Bee, 2024. "On discriminating between lognormal and Pareto tail: an unsupervised mixture-based approach," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 18(2), pages 251-269, June.

    More about this item

    Keywords

    Regularized EM algorithm; Shannon entropy; Size-biased mixture; Lognormal distribution; Left-truncated insurance losses;
    All these keywords.

    JEL classification:

    • C02 - Mathematical and Quantitative Methods - - General - - - Mathematical Economics
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions
    • C69 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:insuma:v:117:y:2024:i:c:p:182-195. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505554 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.