IDEAS home Printed from https://ideas.repec.org/a/eee/insuma/v93y2020icp84-94.html
   My bibliography  Save this article

A Bayesian nonparametric model and its application in insurance loss prediction

Author

Listed:
  • Huang, Yifan
  • Meng, Shengwang

Abstract

Predicting insurance losses is an eternal focus of actuarial science in the insurance sector. Due to the existence of complicated features such as skewness, heavy tail, and multi-modality, traditional parametric models are often inadequate to describe the distribution of losses, calling for a mature application of Bayesian methods. In this study we explore a Gaussian mixture model based on Dirichlet process priors. Using three automobile insurance datasets, we employ the probit stick-breaking method to incorporate the effect of covariates into the weight of the mixture component, improve its hierarchical structure, and propose a Bayesian nonparametric model that can identify the unique regression pattern of different samples. Moreover, an advanced updating algorithm of slice sampling is integrated to apply an improved approximation to the infinite mixture model. We compare our framework with four common regression techniques: three generalized linear models and a dependent Dirichlet process ANOVA model. The empirical results show that the proposed framework flexibly characterizes the actual loss distribution in the insurance datasets and demonstrates superior performance in the accuracy of data fitting and extrapolating predictions, thus greatly extending the application of Bayesian methods in the insurance sector.

Suggested Citation

  • Huang, Yifan & Meng, Shengwang, 2020. "A Bayesian nonparametric model and its application in insurance loss prediction," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 84-94.
  • Handle: RePEc:eee:insuma:v:93:y:2020:i:c:p:84-94
    DOI: 10.1016/j.insmatheco.2020.04.010
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167668720300548
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.insmatheco.2020.04.010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Omiros Papaspiliopoulos & Gareth O. Roberts, 2008. "Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models," Biometrika, Biometrika Trust, vol. 95(1), pages 169-186.
    2. De Iorio, Maria & Muller, Peter & Rosner, Gary L. & MacEachern, Steven N., 2004. "An ANOVA Model for Dependent Random Measures," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 205-215, January.
    3. Isadora Antoniano-Villalobos & Sara Wade & Stephen G. Walker, 2014. "A Bayesian Nonparametric Regression Model With Normalized Weights: A Study of Hippocampal Atrophy in Alzheimer's Disease," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 477-490, June.
    4. Cummins, J. David & Dionne, Georges & McDonald, James B. & Pritchett, B. Michael, 1990. "Applications of the GB2 family of distributions in modeling insurance loss processes," Insurance: Mathematics and Economics, Elsevier, vol. 9(4), pages 257-272, December.
    5. Fellingham, Gilbert W. & Kottas, Athanasios & Hartman, Brian M., 2015. "Bayesian nonparametric predictive modeling of group health claims," Insurance: Mathematics and Economics, Elsevier, vol. 60(C), pages 1-10.
    6. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149.
    7. Richardson, Robert & Hartman, Brian, 2018. "Bayesian nonparametric regression models for modeling and predicting healthcare claims," Insurance: Mathematics and Economics, Elsevier, vol. 83(C), pages 1-8.
    8. Chung, Yeonseung & Dunson, David B., 2009. "Nonparametric Bayes Conditional Distribution Modeling With Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1646-1660.
    9. Griffin, J.E. & Steel, M.F.J., 2006. "Order-Based Dependent Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 179-194, March.
    10. Stasinopoulos, D. Mikis & Rigby, Robert A., 2007. "Generalized Additive Models for Location Scale and Shape (GAMLSS) in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i07).
    11. David B. Dunson & Ju-Hyun Park, 2008. "Kernel stick-breaking processes," Biometrika, Biometrika Trust, vol. 95(2), pages 307-323.
    12. David B. Dunson & Natesh Pillai & Ju‐Hyun Park, 2007. "Bayesian density regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(2), pages 163-183, April.
    13. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    14. Liang Hong & Ryan Martin, 2017. "A Flexible Bayesian Nonparametric Model for Predicting Future Insurance Claims," North American Actuarial Journal, Taylor & Francis Journals, vol. 21(2), pages 228-241, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Minkun Kim & David Lindberg & Martin Crane & Marija Bezbradica, 2023. "Dirichlet Process Log Skew-Normal Mixture with a Missing-at-Random-Covariate in Insurance Claim Analysis," Econometrics, MDPI, vol. 11(4), pages 1-32, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    2. Debdeep Pati & David Dunson, 2014. "Bayesian nonparametric regression with varying residual density," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 1-31, February.
    3. Richardson, Robert & Hartman, Brian, 2018. "Bayesian nonparametric regression models for modeling and predicting healthcare claims," Insurance: Mathematics and Economics, Elsevier, vol. 83(C), pages 1-8.
    4. Pelenis, Justinas, 2014. "Bayesian regression with heteroscedastic error density and parametric mean function," Journal of Econometrics, Elsevier, vol. 178(P3), pages 624-638.
    5. Pelenis, Justinas, 2012. "Bayesian Semiparametric Regression," Economics Series 285, Institute for Advanced Studies.
    6. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    7. Fuentes-García, Ruth & Mena, Ramsés H. & Walker, Stephen G., 2009. "A nonparametric dependent process for Bayesian regression," Statistics & Probability Letters, Elsevier, vol. 79(8), pages 1112-1119, April.
    8. Yeonseung Chung & David Dunson, 2011. "The local Dirichlet process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(1), pages 59-80, February.
    9. Norets, Andriy, 2015. "Bayesian regression with nonparametric heteroskedasticity," Journal of Econometrics, Elsevier, vol. 185(2), pages 409-419.
    10. Gutiérrez, Luis & Mena, Ramsés H. & Ruggiero, Matteo, 2016. "A time dependent Bayesian nonparametric model for air quality analysis," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 161-175.
    11. Miller, Jeffrey W., 2019. "An elementary derivation of the Chinese restaurant process from Sethuraman’s stick-breaking process," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 112-117.
    12. Norets, Andriy & Pelenis, Justinas, 2012. "Bayesian modeling of joint and conditional distributions," Journal of Econometrics, Elsevier, vol. 168(2), pages 332-346.
    13. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    14. Michael L. Pennell & David B. Dunson, 2008. "Nonparametric Bayes Testing of Changes in a Response Distribution with an Ordinal Predictor," Biometrics, The International Biometric Society, vol. 64(2), pages 413-423, June.
    15. Hatjispyros, Spyridon J. & Nicoleris, Theodoros & Walker, Stephen G., 2016. "Random density functions with common atoms and pairwise dependence," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 236-249.
    16. Bassetti, Federico & Casarin, Roberto & Leisen, Fabrizio, 2014. "Beta-product dependent Pitman–Yor processes for Bayesian inference," Journal of Econometrics, Elsevier, vol. 180(1), pages 49-72.
    17. Maria De Iorio & Wesley O. Johnson & Peter Müller & Gary L. Rosner, 2009. "Bayesian Nonparametric Nonproportional Hazards Survival Modeling," Biometrics, The International Biometric Society, vol. 65(3), pages 762-771, September.
    18. Griffin, J.E. & Steel, M.F.J., 2011. "Stick-breaking autoregressive processes," Journal of Econometrics, Elsevier, vol. 162(2), pages 383-396, June.
    19. Barrientos, Andrés F. & Canale, Antonio, 2021. "A Bayesian goodness-of-fit test for regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    20. repec:jss:jstsof:40:i05 is not listed on IDEAS
    21. Hatjispyros, Spyridon J. & Merkatas, Christos & Nicoleris, Theodoros & Walker, Stephen G., 2018. "Dependent mixtures of geometric weights priors," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 1-18.

    More about this item

    Keywords

    Bayesian nonparametric model; Dirichlet process mixture; Probit stick-breaking; Slice sampling; Nonzero loss;
    All these keywords.

    JEL classification:

    • C11 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Bayesian Analysis: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C46 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Specific Distributions
    • G22 - Financial Economics - - Financial Institutions and Services - - - Insurance; Insurance Companies; Actuarial Studies

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:insuma:v:93:y:2020:i:c:p:84-94. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505554 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.