IDEAS home Printed from https://ideas.repec.org/a/gam/jecnmx/v10y2022i2p19-d791038.html

Combining Predictions of Auto Insurance Claims

Author

Listed:
  • Chenglong Ye

    (Dr. Bing Zhang Department of Statistics, University of Kentucky, 317 Multidisciplinary Science Building, 725 Rose St., Lexington, KY 40536, USA)

  • Lin Zhang

    (First American Financial, Santa Ana, CA 92707, USA)

  • Mingxuan Han

    (School of Computing, University of Utah, Salt Lake City, UT 84112, USA)

  • Yanjia Yu

    (School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA)

  • Bingxin Zhao

    (School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA)

  • Yuhong Yang

    (School of Statistics, University of Minnesota, Minneapolis, MN 55455, USA)

Abstract

This paper aims to better predict highly skewed auto insurance claims by combining candidate predictions. We analyze a version of the Kangaroo Auto Insurance company data and study the effects of combining different methods using five measures of prediction accuracy. The results show the following. First, when there is an outstanding (in terms of Gini Index) prediction among the candidates, the “forecast combination puzzle” phenomenon disappears. The simple average method performs much worse than the more sophisticated model combination methods, indicating that combining different methods could help us avoid performance degradation. Second, the choice of the prediction accuracy measure is crucial in defining the best candidate prediction for “low frequency and high severity” (LFHS) data. For example, mean square error (MSE) does not distinguish well between model combination methods, as the values are close. Third, the performances of different model combination methods can differ drastically. We propose using a new model combination method, named ARM-Tweedie, for such LFHS data; it benefits from an optimal rate of convergence and exhibits a desirable performance in several measures for the Kangaroo data. Fourth, overall, model combination methods improve the prediction accuracy for auto insurance claim costs. In particular, Adaptive Regression by Mixing (ARM), ARM-Tweedie, and constrained Linear Regression can improve forecast performance when there are only weak learners or when no dominant learner exists.

Suggested Citation

  • Chenglong Ye & Lin Zhang & Mingxuan Han & Yanjia Yu & Bingxin Zhao & Yuhong Yang, 2022. "Combining Predictions of Auto Insurance Claims," Econometrics, MDPI, vol. 10(2), pages 1-15, April.
  • Handle: RePEc:gam:jecnmx:v:10:y:2022:i:2:p:19-:d:791038
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2225-1146/10/2/19/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2225-1146/10/2/19/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yang, Yuhong, 2004. "Combining Forecasting Procedures: Some Theoretical Results," Econometric Theory, Cambridge University Press, vol. 20(1), pages 176-222, February.
    2. Edward W. (Jed) Frees & Glenn Meyers & A. David Cummings, 2014. "Insurance Ratemaking and a Gini Index," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 81(2), pages 335-366, June.
    3. Yang Y., 2001. "Adaptive Regression by Mixing," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 574-588, June.
    4. de Jong,Piet & Heller,Gillian Z., 2008. "Generalized Linear Models for Insurance Data," Cambridge Books, Cambridge University Press, number 9780521879149, August.
    5. Frees, Edward W. & Valdez, Emiliano A., 2008. "Hierarchical Insurance Claims Modeling," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1457-1469.
    6. Xinyu Zhang & Dalei Yu & Guohua Zou & Hua Liang, 2016. "Optimal Model Averaging Estimation for Generalized Linear Models and Generalized Linear Mixed-Effects Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1775-1790, October.
    7. Bailey, Robert A. & Simon, LeRoy J., 1960. "Two Studies in Automobile Insurance Ratemaking," ASTIN Bulletin, Cambridge University Press, vol. 1(4), pages 192-217, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Tim J. Boonen & Xinyue Fan & Zixiao Quan, 2025. "Fairness-Aware Insurance Pricing: A Multi-Objective Optimization Approach," Papers 2512.24747, arXiv.org.
    2. Jaiswal, Rachana & Gupta, Shashank & Tiwari, Aviral Kumar, 2024. "Big data and machine learning-based decision support system to reshape the vaticination of insurance claims," Technological Forecasting and Social Change, Elsevier, vol. 209(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Deprez, Laurens & Antonio, Katrien & Boute, Robert, 2021. "Pricing service maintenance contracts using predictive analytics," European Journal of Operational Research, Elsevier, vol. 290(2), pages 530-545.
    2. Wei Qian & Craig A. Rolling & Gang Cheng & Yuhong Yang, 2019. "On the Forecast Combination Puzzle," Econometrics, MDPI, vol. 7(3), pages 1-26, September.
    3. Yu, Dalei & Tang, Nian-Sheng & Shi, Yang, 2025. "Adaptively aggregated forecast for exponential family panel model," International Journal of Forecasting, Elsevier, vol. 41(2), pages 733-747.
    4. Qingfeng Liu & Qingsong Yao & Guoqing Zhao, 2020. "Model averaging estimation for conditional volatility models with an application to stock market volatility forecast," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(5), pages 841-863, August.
    5. Zou, Hui & Yang, Yuhong, 2004. "Combining time series models for forecasting," International Journal of Forecasting, Elsevier, vol. 20(1), pages 69-84.
    6. Aivars Spilbergs & Andris Fomins & Māris Krastiņš, 2022. "Multivariate Modelling of Motor Third Party Liability Insurance Claims," European Journal of Business Science and Technology, Mendel University in Brno, Faculty of Business and Economics, vol. 8(1), pages 5-18.
    7. Kaiwen Wang & Jiehui Ding & Kristen R. Lidwell & Scott Manski & Gee Y. Lee & Emilio Xavier Esposito, 2019. "Treatment Level and Store Level Analyses of Healthcare Data," Risks, MDPI, vol. 7(2), pages 1-22, April.
    8. Haili Zhang & Guohua Zou, 2020. "Cross-Validation Model Averaging for Generalized Functional Linear Model," Econometrics, MDPI, vol. 8(1), pages 1-35, February.
    9. Liao, Jun & Wan, Alan T.K. & He, Shuyuan & Zou, Guohua, 2022. "Optimal model averaging for multivariate regression models," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    10. Wei, Yuting & Wang, Qihua, 2021. "Cross-validation-based model averaging in linear models with response missing at random," Statistics & Probability Letters, Elsevier, vol. 171(C).
    11. Fang, Fang & Li, Jialiang & Xia, Xiaochao, 2022. "Semiparametric model averaging prediction for dichotomous response," Journal of Econometrics, Elsevier, vol. 229(2), pages 219-245.
    12. Richardson, Robert & Hartman, Brian, 2018. "Bayesian nonparametric regression models for modeling and predicting healthcare claims," Insurance: Mathematics and Economics, Elsevier, vol. 83(C), pages 1-8.
    13. Wei Qian & Craig A. Rolling & Gang Cheng & Yuhong Yang, 2015. "On the Forecast Combination Puzzle," Papers 1505.00475, arXiv.org.
    14. Song Liu & Yuhong Yang, 2012. "Combining models in longitudinal data analysis," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 64(2), pages 233-254, April.
    15. Chen Zhuo & Yang Yuhong, 2007. "Time Series Models for Forecasting: Testing or Combining?," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 11(1), pages 1-37, March.
    16. Zhang, Xinyu & Liu, Chu-An, 2023. "Model averaging prediction by K-fold cross-validation," Journal of Econometrics, Elsevier, vol. 235(1), pages 280-301.
    17. Wei, Xiaoqiao & Yang, Yuhong, 2012. "Robust forecast combinations," Journal of Econometrics, Elsevier, vol. 166(2), pages 224-236.
    18. Lin, Tzu-Chi & Liu, Chu-An, 2025. "Model averaging prediction for possibly nonstationary autoregressions," Journal of Econometrics, Elsevier, vol. 249(PB).
    19. Yuqing Zhang & Neil Walton, 2019. "Adaptive Pricing in Insurance: Generalized Linear Models and Gaussian Process Regression Approaches," Papers 1907.05381, arXiv.org.
    20. Yuting Wei & Qihua Wang & Wei Liu, 2021. "Model averaging for linear models with responses missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 535-553, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jecnmx:v:10:y:2022:i:2:p:19-:d:791038. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.