IDEAS home Printed from https://ideas.repec.org/p/aiz/louvad/2025024.html

Wasserstein boosting trees algorithm for count data, with application to claim frequencies in motor insurance

Author

Listed:
  • Denuit, Michel

    (Université catholique de Louvain, LIDAM/ISBA, Belgium)

  • Michaelides, Marie

    (Heriot-Watt University)

  • Trufin, Julien

    (ULB)

  • Verelst, Harrison

    (Detralytics)

Abstract

This paper proposes a variant of the well-known boosting trees algorithm to estimate conditional distributions. Since regression trees partition observations into subgroups, the corresponding empirical distributions can be used to define the splitting criterion. Precisely, the parametric approach using Poisson deviance is replaced with a non-parametric one maximizing probabilistic distances between empirical distributions in child nodes. Proceeding inthis way, the actuary obtains an estimated conditional distribution for the response, from which a conditional mean can be derived as well as any other quantity of interest in risk management. The numerical performances of the proposed method are assessed with simulated data while a case study demonstrates its usefulness for insurance applications.

Suggested Citation

  • Denuit, Michel & Michaelides, Marie & Trufin, Julien & Verelst, Harrison, 2025. "Wasserstein boosting trees algorithm for count data, with application to claim frequencies in motor insurance," LIDAM Discussion Papers ISBA 2025024, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
  • Handle: RePEc:aiz:louvad:2025024
    as

    Download full text from publisher

    File URL: https://dial.uclouvain.be/pr/boreal/en/object/boreal%3A308509/datastream/PDF_01/view
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Delong, Łukasz & Kozak, Anna, 2023. "The use of autoencoders for training neural networks with mixed categorical and numerical features," ASTIN Bulletin, Cambridge University Press, vol. 53(2), pages 213-232, May.
    2. Denuit, Michel & Hainaut, Donatien & Trufin, Julien, 2020. "Effective Statistical Learning Methods for Actuaries II : Tree-Based Methods and Extensions," LIDAM Reprints ISBA 2020035, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Freek Holvoet & Katrien Antonio & Roel Henckaerts, 2023. "Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff," Papers 2310.12671, arXiv.org, revised Jan 2025.
    2. Ronald Richman & Salvatore Scognamiglio & Mario V. Wuthrich, 2024. "The Credibility Transformer," Papers 2409.16653, arXiv.org.
    3. Denuit, Michel & Trufin, Julien & Verdebout, Thomas, 2021. "Testing for more positive expectation dependence with application to model comparison," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 163-172.
    4. Muhammed Taher Al-Mudafer & Benjamin Avanzi & Greg Taylor & Bernard Wong, 2021. "Stochastic loss reserving with mixture density neural networks," Papers 2108.07924, arXiv.org.
    5. Freek Holvoet & Christopher Blier-Wong & Katrien Antonio, 2025. "A multi-view contrastive learning framework for spatial embeddings in risk modelling," Papers 2511.17954, arXiv.org.
    6. Yves Staudt & Joël Wagner, 2021. "Assessing the Performance of Random Forests for Modeling Claim Severity in Collision Car Insurance," Risks, MDPI, vol. 9(3), pages 1-28, March.
    7. Trufin, Julien & Denuit, Michel, 2021. "Boosting cost-complexity pruned trees On Tweedie responses: the ABT machine," LIDAM Discussion Papers ISBA 2021015, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    8. Hainaut, Donatien & Trufin, Julien & Denuit, Michel, 2021. "Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link," LIDAM Discussion Papers ISBA 2021012, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    9. Zuleyka Díaz Martínez & José Fernández Menéndez & Luis Javier García Villalba, 2023. "Tariff Analysis in Automobile Insurance: Is It Time to Switch from Generalized Linear Models to Generalized Additive Models?," Mathematics, MDPI, vol. 11(18), pages 1-16, September.
    10. Alexandre Brouste & Christophe Dutang & Tom Rohmer, 2022. "A Closed-form Alternative Estimator for GLM with Categorical Explanatory Variables," Post-Print hal-03689206, HAL.
    11. Willame, Gireg & Trufin, Julien & Denuit, Michel, 2023. "Boosted Poisson regression trees: A guide to the BT package in R," LIDAM Discussion Papers ISBA 2023008, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    12. Denuit, Michel & Trufin, Julien & Verdebout, Thomas, 2022. "Boosting on the responses with Tweedie loss functions," LIDAM Discussion Papers ISBA 2022039, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    13. Denuit, Michel & Trufin, Julien & Verdebout, Thomas, 2021. "Testing for more positive expectation dependence with application to model comparison," LIDAM Discussion Papers ISBA 2021021, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aiz:louvad:2025024. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Nadja Peiffer (email available below). General contact details of provider: https://edirc.repec.org/data/isuclbe.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.