IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v8y2020i1p19-d322684.html
   My bibliography  Save this article

Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing

Author

Listed:
  • Simon CK Lee

    (Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong)

Abstract

This study proposes an efficacious approach to analyze the over-dispersed insurance frequency data as it is imperative for the insurers to have decisive informative insights for precisely underwriting and pricing insurance products, retaining existing customer base and gaining an edge in the highly competitive retail insurance market. The delta boosting implementation of the negative binomial regression, both by one-parameter estimation and a novel two-parameter estimation, was tested on the empirical data. Accurate parameter estimation of the negative binomial regression is complicated with considerations of incomplete insurance exposures, negative convexity, and co-linearity. The issues mainly originate from the unique nature of insurance operations and the adoption of distribution outside the exponential family. We studied how the issues could significantly impact the quality of estimation. In addition to a novel approach to simultaneously estimate two parameters in regression through boosting, we further enrich the study by proposing an alteration of the base algorithm to address the problems. The algorithm was able to withstand the competition against popular regression methodologies in a real-life dataset. Common diagnostics were applied to compare the performance of the relevant candidates, leading to our conclusion to move from light-tail Poisson to negative binomial for over-dispersed data, from generalized linear model (GLM) to boosting for non-linear and interaction patterns, from one-parameter to two-parameter estimation to reflect more closely the reality.

Suggested Citation

  • Simon CK Lee, 2020. "Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing," Risks, MDPI, vol. 8(1), pages 1-21, February.
  • Handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/8/1/19/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/8/1/19/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kevin Kuo, 2019. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Risks, MDPI, vol. 7(3), pages 1-12, September.
    2. David Mihaela & Jemna Dănuţ-Vasile, 2015. "Modeling the Frequency of Auto Insurance Claims by Means of Poisson and Negative Binomial Models," Scientific Annals of Economics and Business, Sciendo, vol. 62(2), pages 151-168, July.
    3. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149.
    4. Kevin Kuo, 2018. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Papers 1804.09253, arXiv.org, revised Sep 2019.
    5. Yi Yang & Wei Qian & Hui Zou, 2018. "Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 456-470, July.
    6. Greg Taylor, 2019. "Loss Reserving Models: Granular and Machine Learning Forms," Risks, MDPI, vol. 7(3), pages 1-18, July.
    7. Maximilien Baudry & Christian Y. Robert, 2019. "A machine learning approach for individual claims reserving in insurance," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(5), pages 1127-1155, September.
    8. Yip, Karen C.H. & Yau, Kelvin K.W., 2005. "On modeling claim frequency data in general insurance with extra zeros," Insurance: Mathematics and Economics, Elsevier, vol. 36(2), pages 153-163, April.
    9. David Scollnik, 2001. "Actuarial Modeling with MCMC and BUGs," North American Actuarial Journal, Taylor & Francis Journals, vol. 5(2), pages 96-124.
    10. Mario V. Wüthrich, 2018. "Machine learning in individual claims reserving," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2018(6), pages 465-480, July.
    11. Simon C. K. Lee & Sheldon Lin, 2018. "Delta Boosting Machine with Application to General Insurance," North American Actuarial Journal, Taylor & Francis Journals, vol. 22(3), pages 405-425, July.
    12. Jean‐Philippe Boucher & Michel Denuit & Montserrat Guillen, 2009. "Number of Accidents or Number of Claims? An Approach with Zero‐Inflated Poisson Models for Panel Data," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 76(4), pages 821-846, December.
    13. Jozef L. Teugels & Petra Vynckier, 1996. "The structure distribution in a mixed Poisson process," International Journal of Stochastic Analysis, Hindawi, vol. 9, pages 1-8, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stephan M. Bischofberger, 2020. "In-Sample Hazard Forecasting Based on Survival Models with Operational Time," Risks, MDPI, vol. 8(1), pages 1-17, January.
    2. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    3. Łukasz Delong & Mario V. Wüthrich, 2020. "Neural Networks for the Joint Development of Individual Payments and Claim Incurred," Risks, MDPI, vol. 8(2), pages 1-34, April.
    4. Greg Taylor, 2019. "Risks Special Issue on “Granular Models and Machine Learning Models”," Risks, MDPI, vol. 8(1), pages 1-2, December.
    5. Kevin Kuo & Daniel Lupton, 2020. "Towards Explainability of Machine Learning Models in Insurance Pricing," Papers 2003.10674, arXiv.org.
    6. Tzougas, George & Hoon, W. L. & Lim, J. M., 2019. "The negative binomial-inverse Gaussian regression model with an application to insurance ratemaking," LSE Research Online Documents on Economics 101728, London School of Economics and Political Science, LSE Library.
    7. Avanzi, Benjamin & Taylor, Greg & Wang, Melantha & Wong, Bernard, 2021. "SynthETIC: An individual insurance claim simulator with feature control," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 296-308.
    8. Lee, Woojoo & Kim, Jeonghwan & Ahn, Jae Youn, 2020. "The Poisson random effect model for experience ratemaking: Limitations and alternative solutions," Insurance: Mathematics and Economics, Elsevier, vol. 91(C), pages 26-36.
    9. Payandeh Najafabadi Amir T. & MohammadPour Saeed, 2018. "A k-Inflated Negative Binomial Mixture Regression Model: Application to Rate–Making Systems," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 12(2), pages 1-31, July.
    10. Shengkun Xie & Anna T. Lawniczak, 2018. "Estimating Major Risk Factor Relativities in Rate Filings Using Generalized Linear Models," IJFS, MDPI, vol. 6(4), pages 1-14, October.
    11. Francis Duval & Mathieu Pigeon, 2019. "Individual Loss Reserving Using a Gradient Boosting-Based Approach," Risks, MDPI, vol. 7(3), pages 1-18, July.
    12. Shengkun Xie & Rebecca Luo, 2022. "Measuring Variable Importance in Generalized Linear Models for Modeling Size of Loss Distributions," Mathematics, MDPI, vol. 10(10), pages 1-19, May.
    13. Tan, Chong It, 2016. "Varying transition rules in bonus–malus systems: From rules specification to determination of optimal relativities," Insurance: Mathematics and Economics, Elsevier, vol. 68(C), pages 134-140.
    14. Benjamin Avanzi & Yanfeng Li & Bernard Wong & Alan Xian, 2022. "Ensemble distributional forecasting for insurance loss reserving," Papers 2206.08541, arXiv.org, revised Feb 2024.
    15. Muhammed Taher Al-Mudafer & Benjamin Avanzi & Greg Taylor & Bernard Wong, 2021. "Stochastic loss reserving with mixture density neural networks," Papers 2108.07924, arXiv.org.
    16. Mihaela COVRIG & Dumitru BADEA, 2017. "Some Generalized Linear Models for the Estimation of the Mean Frequency of Claims in Motor Insurance," ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, Faculty of Economic Cybernetics, Statistics and Informatics, vol. 51(4), pages 91-107.
    17. Freek Holvoet & Katrien Antonio & Roel Henckaerts, 2023. "Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff," Papers 2310.12671, arXiv.org, revised Oct 2023.
    18. George Tzougas, 2020. "EM Estimation for the Poisson-Inverse Gamma Regression Model with Varying Dispersion: An Application to Insurance Ratemaking," Risks, MDPI, vol. 8(3), pages 1-23, September.
    19. Xu, Shuzhe & Zhang, Chuanlong & Hong, Don, 2022. "BERT-based NLP techniques for classification and severity modeling in basic warranty data study," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 57-67.
    20. Mihaela Covrig & Iulian Mircea & Gheorghita Zbaganu & Alexandru Coser & Alexandru Tindeche, 2015. "Using R In Generalized Linear Models," Romanian Statistical Review, Romanian Statistical Review, vol. 63(3), pages 33-45, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.