IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v8y2020i1p19-d322684.html
   My bibliography  Save this article

Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing

Author

Listed:
  • Simon CK Lee

    (Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong)

Abstract

This study proposes an efficacious approach to analyze the over-dispersed insurance frequency data as it is imperative for the insurers to have decisive informative insights for precisely underwriting and pricing insurance products, retaining existing customer base and gaining an edge in the highly competitive retail insurance market. The delta boosting implementation of the negative binomial regression, both by one-parameter estimation and a novel two-parameter estimation, was tested on the empirical data. Accurate parameter estimation of the negative binomial regression is complicated with considerations of incomplete insurance exposures, negative convexity, and co-linearity. The issues mainly originate from the unique nature of insurance operations and the adoption of distribution outside the exponential family. We studied how the issues could significantly impact the quality of estimation. In addition to a novel approach to simultaneously estimate two parameters in regression through boosting, we further enrich the study by proposing an alteration of the base algorithm to address the problems. The algorithm was able to withstand the competition against popular regression methodologies in a real-life dataset. Common diagnostics were applied to compare the performance of the relevant candidates, leading to our conclusion to move from light-tail Poisson to negative binomial for over-dispersed data, from generalized linear model (GLM) to boosting for non-linear and interaction patterns, from one-parameter to two-parameter estimation to reflect more closely the reality.

Suggested Citation

  • Simon CK Lee, 2020. "Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing," Risks, MDPI, vol. 8(1), pages 1-21, February.
  • Handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/8/1/19/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/8/1/19/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Mihaela & Jemna Dănuţ-Vasile, 2015. "Modeling the Frequency of Auto Insurance Claims by Means of Poisson and Negative Binomial Models," Scientific Annals of Economics and Business, Sciendo, vol. 62(2), pages 151-168, July.
    2. Maximilien Baudry & Christian Y. Robert, 2019. "A machine learning approach for individual claims reserving in insurance," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(5), pages 1127-1155, September.
    3. Jean‐Philippe Boucher & Michel Denuit & Montserrat Guillen, 2009. "Number of Accidents or Number of Claims? An Approach with Zero‐Inflated Poisson Models for Panel Data," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 76(4), pages 821-846, December.
    4. Jozef L. Teugels & Petra Vynckier, 1996. "The structure distribution in a mixed Poisson process," International Journal of Stochastic Analysis, Hindawi, vol. 9, pages 1-8, January.
    5. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149, September.
    6. Yip, Karen C.H. & Yau, Kelvin K.W., 2005. "On modeling claim frequency data in general insurance with extra zeros," Insurance: Mathematics and Economics, Elsevier, vol. 36(2), pages 153-163, April.
    7. Simon C. K. Lee & Sheldon Lin, 2018. "Delta Boosting Machine with Application to General Insurance," North American Actuarial Journal, Taylor & Francis Journals, vol. 22(3), pages 405-425, July.
    8. Kevin Kuo, 2018. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Papers 1804.09253, arXiv.org, revised Sep 2019.
    9. Yi Yang & Wei Qian & Hui Zou, 2018. "Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 456-470, July.
    10. David Scollnik, 2001. "Actuarial Modeling with MCMC and BUGs," North American Actuarial Journal, Taylor & Francis Journals, vol. 5(2), pages 96-124.
    11. Kevin Kuo, 2019. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Risks, MDPI, vol. 7(3), pages 1-12, September.
    12. Greg Taylor, 2019. "Loss Reserving Models: Granular and Machine Learning Forms," Risks, MDPI, vol. 7(3), pages 1-18, July.
    13. Mario V. Wüthrich, 2018. "Machine learning in individual claims reserving," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2018(6), pages 465-480, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    2. Stephan M. Bischofberger, 2020. "In-Sample Hazard Forecasting Based on Survival Models with Operational Time," Risks, MDPI, vol. 8(1), pages 1-17, January.
    3. Łukasz Delong & Mario V. Wüthrich, 2020. "Neural Networks for the Joint Development of Individual Payments and Claim Incurred," Risks, MDPI, vol. 8(2), pages 1-34, April.
    4. Greg Taylor, 2019. "Risks Special Issue on “Granular Models and Machine Learning Models”," Risks, MDPI, vol. 8(1), pages 1-2, December.
    5. Kevin Kuo & Daniel Lupton, 2020. "Towards Explainability of Machine Learning Models in Insurance Pricing," Papers 2003.10674, arXiv.org.
    6. Avanzi, Benjamin & Taylor, Greg & Wang, Melantha & Wong, Bernard, 2021. "SynthETIC: An individual insurance claim simulator with feature control," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 296-308.
    7. Payandeh Najafabadi Amir T. & MohammadPour Saeed, 2018. "A k-Inflated Negative Binomial Mixture Regression Model: Application to Rate–Making Systems," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 12(2), pages 1-31, July.
    8. Francis Duval & Mathieu Pigeon, 2019. "Individual Loss Reserving Using a Gradient Boosting-Based Approach," Risks, MDPI, vol. 7(3), pages 1-18, July.
    9. Benjamin Avanzi & Yanfeng Li & Bernard Wong & Alan Xian, 2022. "Ensemble distributional forecasting for insurance loss reserving," Papers 2206.08541, arXiv.org, revised Jun 2024.
    10. Xu, Shuzhe & Zhang, Chuanlong & Hong, Don, 2022. "BERT-based NLP techniques for classification and severity modeling in basic warranty data study," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 57-67.
    11. Mihaela Covrig & Iulian Mircea & Gheorghita Zbaganu & Alexandru Coser & Alexandru Tindeche, 2015. "Using R In Generalized Linear Models," Romanian Statistical Review, Romanian Statistical Review, vol. 63(3), pages 33-45, September.
    12. Zhao, Xiaobing & Zhou, Xian, 2012. "Copula models for insurance claim numbers with excess zeros and time-dependence," Insurance: Mathematics and Economics, Elsevier, vol. 50(1), pages 191-199.
    13. Zhiyu Quan & Changyue Hu & Panyi Dong & Emiliano A. Valdez, 2024. "Improving Business Insurance Loss Models by Leveraging InsurTech Innovation," Papers 2401.16723, arXiv.org.
    14. Marjan Qazvini, 2019. "On the Validation of Claims with Excess Zeros in Liability Insurance: A Comparative Study," Risks, MDPI, vol. 7(3), pages 1-17, June.
    15. Xuejun Jiang & Yunxian Li & Aijun Yang & Ruowei Zhou, 2020. "Bayesian semiparametric quantile regression modeling for estimating earthquake fatality risk," Empirical Economics, Springer, vol. 58(5), pages 2085-2103, May.
    16. Zhengmin Duan & Yonglian Chang & Qi Wang & Tianyao Chen & Qing Zhao, 2018. "A Logistic Regression Based Auto Insurance Rate-Making Model Designed for the Insurance Rate Reform," IJFS, MDPI, vol. 6(1), pages 1-16, February.
    17. Tzougas, George & Hoon, W. L. & Lim, J. M., 2019. "The negative binomial-inverse Gaussian regression model with an application to insurance ratemaking," LSE Research Online Documents on Economics 101728, London School of Economics and Political Science, LSE Library.
    18. Lee, Woojoo & Kim, Jeonghwan & Ahn, Jae Youn, 2020. "The Poisson random effect model for experience ratemaking: Limitations and alternative solutions," Insurance: Mathematics and Economics, Elsevier, vol. 91(C), pages 26-36.
    19. Shengkun Xie & Anna T. Lawniczak, 2018. "Estimating Major Risk Factor Relativities in Rate Filings Using Generalized Linear Models," IJFS, MDPI, vol. 6(4), pages 1-14, October.
    20. Shengkun Xie & Rebecca Luo, 2022. "Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions," Mathematics, MDPI, vol. 10(10), pages 1-19, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.