IDEAS home Printed from https://ideas.repec.org/a/spr/advdac/v14y2020i2d10.1007_s11634-020-00391-x.html
   My bibliography  Save this article

Mixture modeling of data with multiple partial right-censoring levels

Author

Listed:
  • Semhar Michael

    (South Dakota State University)

  • Tatjana Miljkovic

    (Miami University)

  • Volodymyr Melnykov

    (University of Alabama)

Abstract

In this paper, a new flexible approach to modeling data with multiple partial right-censoring points is proposed. This method is based on finite mixture models, flexible tool to model heterogeneity in data. A general framework to accommodate partial censoring is considered. In this setting, it is assumed that a certain portion of data points are censored and the rest are not. This situation occurs in many insurance loss data sets. A novel probability function is proposed to be used as a mixture component and the expectation-maximization algorithm is employed for estimating model parameters. The Bayesian information criterion is used for model selection. Additionally, an approach for the variability assessment of parameter estimates as well as the computation of quantiles commonly known as risk measures is considered. The proposed model is evaluated using a simulation study based on four common probability distribution functions used to model right skewed loss data and applied to a real data set with good results.

Suggested Citation

  • Semhar Michael & Tatjana Miljkovic & Volodymyr Melnykov, 2020. "Mixture modeling of data with multiple partial right-censoring levels," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(2), pages 355-378, June.
  • Handle: RePEc:spr:advdac:v:14:y:2020:i:2:d:10.1007_s11634-020-00391-x
    DOI: 10.1007/s11634-020-00391-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11634-020-00391-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11634-020-00391-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Balakrishnan, N. & Mitra, Debanjan, 2012. "Left truncated and right censored Weibull data and likelihood inference with an illustration," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4011-4025.
    2. Laurent Bordes & Didier Chauveau, 2016. "Stochastic EM algorithms for parametric and semiparametric mixture models for right-censored lifetime data," Computational Statistics, Springer, vol. 31(4), pages 1513-1538, December.
    3. Lee, Gyemin & Scott, Clayton, 2012. "EM algorithms for multivariate Gaussian mixture models with truncated and censored data," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2816-2829.
    4. Simon Lee & X. Lin, 2010. "Modeling and Evaluating Insurance Losses Via Mixtures of Erlang Distributions," North American Actuarial Journal, Taylor & Francis Journals, vol. 14(1), pages 107-130.
    5. Miljkovic, Tatjana & Grün, Bettina, 2016. "Modeling loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 387-396.
    6. Klugman, Stuart A. & Parsa, Rahul, 1999. "Fitting bivariate loss distributions with copulas," Insurance: Mathematics and Economics, Elsevier, vol. 24(1-2), pages 139-148, March.
    7. Pigeon, Mathieu & Denuit, Michel, 2011. "Composite Lognormal-Pareto model with random threshold," LIDAM Reprints ISBA 2011020, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    8. Semhar Michael & Volodymyr Melnykov, 2016. "An effective strategy for initializing the EM algorithm in finite mixture models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(4), pages 563-583, December.
    9. Blostein, Martin & Miljkovic, Tatjana, 2019. "On modeling left-truncated loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 85(C), pages 35-46.
    10. Resnick, Sidney I., 1997. "Discussion of the Danish Data on Large Fire Insurance Losses," ASTIN Bulletin, Cambridge University Press, vol. 27(1), pages 139-151, May.
    11. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    12. Biernacki, Christophe & Celeux, Gilles & Govaert, Gerard, 2003. "Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 41(3-4), pages 561-575, January.
    13. Edward Frees & Emiliano Valdez, 1998. "Understanding Relationships Using Copulas," North American Actuarial Journal, Taylor & Francis Journals, vol. 2(1), pages 1-25.
    14. McNeil, Alexander J., 1997. "Estimating the Tails of Loss Severity Distributions Using Extreme Value Theory," ASTIN Bulletin, Cambridge University Press, vol. 27(1), pages 117-137, May.
    15. Verbelen, Roel & Gong, Lan & Antonio, Katrien & Badescu, Andrei & Lin, Sheldon, 2015. "Fitting Mixtures Of Erlangs To Censored And Truncated Data Using The Em Algorithm," ASTIN Bulletin, Cambridge University Press, vol. 45(3), pages 729-758, September.
    16. Abu Bakar, S.A. & Hamzah, N.A. & Maghsoudi, M. & Nadarajah, S., 2015. "Modeling loss data using composite models," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 146-154.
    17. Sun, Zhihua & Ye, Xue & Sun, Liuquan, 2018. "Consistent test for parametric models with right-censored data using projections," Computational Statistics & Data Analysis, Elsevier, vol. 118(C), pages 112-125.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Miljkovic, Tatjana & Grün, Bettina, 2016. "Modeling loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 387-396.
    2. Reynkens, Tom & Verbelen, Roel & Beirlant, Jan & Antonio, Katrien, 2017. "Modelling censored losses using splicing: A global fit strategy with mixed Erlang and extreme value distributions," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 65-77.
    3. Bhati, Deepesh & Ravi, Sreenivasan, 2018. "On generalized log-Moyal distribution: A new heavy tailed size distribution," Insurance: Mathematics and Economics, Elsevier, vol. 79(C), pages 247-259.
    4. Fung, Tsz Chai, 2022. "Maximum weighted likelihood estimator for robust heavy-tail modelling of finite mixture models," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 180-198.
    5. Blostein, Martin & Miljkovic, Tatjana, 2019. "On modeling left-truncated loss data using mixtures of distributions," Insurance: Mathematics and Economics, Elsevier, vol. 85(C), pages 35-46.
    6. Počuča, Nikola & Jevtić, Petar & McNicholas, Paul D. & Miljkovic, Tatjana, 2020. "Modeling frequency and severity of claims with the zero-inflated generalized cluster-weighted models," Insurance: Mathematics and Economics, Elsevier, vol. 94(C), pages 79-93.
    7. Eling, Martin, 2012. "Fitting insurance claims to skewed distributions: Are the skew-normal and skew-student good models?," Insurance: Mathematics and Economics, Elsevier, vol. 51(2), pages 239-248.
    8. Shi, Yue & Punzo, Antonio & Otneim, Håkon & Maruotti, Antonello, 2023. "Hidden semi-Markov models for rainfall-related insurance claims," Discussion Papers 2023/17, Norwegian School of Economics, Department of Business and Management Science.
    9. Delong, Łukasz & Lindholm, Mathias & Wüthrich, Mario V., 2021. "Gamma Mixture Density Networks and their application to modelling insurance claim amounts," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 240-261.
    10. S. A. Abu Bakar & Saralees Nadarajah & Z. A. Absl Kamarul Adzhar, 2018. "Loss modeling using Burr mixtures," Empirical Economics, Springer, vol. 54(4), pages 1503-1516, June.
    11. Athanasios Sachlas & Takis Papaioannou, 2014. "Residual and Past Entropy in Actuarial Science and Survival Models," Methodology and Computing in Applied Probability, Springer, vol. 16(1), pages 79-99, March.
    12. Di Bernardino, Elena & Rullière, Didier, 2013. "Distortions of multivariate distribution functions and associated level curves: Applications in multivariate risk theory," Insurance: Mathematics and Economics, Elsevier, vol. 53(1), pages 190-205.
    13. Roel Verbelen & Katrien Antonio & Gerda Claeskens, 2016. "Multivariate mixtures of Erlangs for density estimation under censoring," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 22(3), pages 429-455, July.
    14. Elena Di Bernardino & Didier Rullière, 2012. "Distortions of multivariate risk measures: a level-sets based approach," Working Papers hal-00756387, HAL.
    15. Ahmed Z. Afify & Ahmed M. Gemeay & Noor Akma Ibrahim, 2020. "The Heavy-Tailed Exponential Distribution: Risk Measures, Estimation, and Application to Actuarial Data," Mathematics, MDPI, vol. 8(8), pages 1-28, August.
    16. repec:hal:wpaper:hal-00750873 is not listed on IDEAS
    17. Abu Bakar, S.A. & Hamzah, N.A. & Maghsoudi, M. & Nadarajah, S., 2015. "Modeling loss data using composite models," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 146-154.
    18. Rocco Roberto Cerchiara & Francesco Acri, 2020. "Estimating the Volatility of Non-Life Premium Risk Under Solvency II: Discussion of Danish Fire Insurance Data," Risks, MDPI, vol. 8(3), pages 1-19, July.
    19. Punzo, Antonio & Bagnato, Luca & Maruotti, Antonello, 2018. "Compound unimodal distributions for insurance losses," Insurance: Mathematics and Economics, Elsevier, vol. 81(C), pages 95-107.
    20. Verschuren, Robert Matthijs, 2022. "Frequency-severity experience rating based on latent Markovian risk profiles," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 379-392.
    21. Zhu, Xuwen & Melnykov, Volodymyr, 2018. "Manly transformation in finite mixture modeling," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 190-208.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:advdac:v:14:y:2020:i:2:d:10.1007_s11634-020-00391-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.