IDEAS home Printed from https://ideas.repec.org/a/eee/insuma/v99y2021icp42-62.html
   My bibliography  Save this article

Incorporating statistical clustering methods into mortality models to improve forecasting performances

Author

Listed:
  • Tsai, Cary Chi-Liang
  • Cheng, Echo Sihan

Abstract

Statistical clustering is a procedure of classifying a set of objects such that objects in the same class (called cluster) are more homogeneous, with respect to some features or characteristics of objects, to each other than to those in any other classes. In this paper, we apply four clustering approaches to improving forecasting performances of the Lee–Carter and CBD models. First, each of four clustering methods (Ward’s hierarchical clustering, divisive hierarchical clustering, K-means clustering, and Gaussian mixture model clustering) is adopted to determine, based on some characteristics of mortality rates, the number and partition of age clusters from the whole study ages 25-84. Next, we forecast 10-year and 20-year mortality rates for each of the age clusters using the Lee–Carter and CBD models, respectively. Finally, numerical illustrations are given with two R packages “NbClust” and “mclust” for clustering. Mortality data for both genders of the US and the UK are obtained from the Human Mortality Database, and the MAPE (mean absolute percentage error) measure is adopted to evaluate forecasting performance. Comparisons of MAPE values are made with and without clustering, which demonstrate that all the proposed clustering methods can improve forecasting performances of the Lee–Carter and CBD models.

Suggested Citation

  • Tsai, Cary Chi-Liang & Cheng, Echo Sihan, 2021. "Incorporating statistical clustering methods into mortality models to improve forecasting performances," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 42-62.
  • Handle: RePEc:eee:insuma:v:99:y:2021:i:c:p:42-62
    DOI: 10.1016/j.insmatheco.2021.03.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016766872100038X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.insmatheco.2021.03.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hua Chen & Samuel H. Cox, 2009. "Modeling Mortality With Jumps: Applications to Mortality Securitization," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 76(3), pages 727-751, September.
    2. Andrew Cairns & David Blake & Kevin Dowd & Guy Coughlan & David Epstein & Alen Ong & Igor Balevich, 2009. "A Quantitative Comparison of Stochastic Mortality Models Using Data From England and Wales and the United States," North American Actuarial Journal, Taylor & Francis Journals, vol. 13(1), pages 1-35.
    3. Tzuling Lin & Cary Chi‐liang Tsai, 2015. "A Simple Linear Regression Approach to Modeling and Forecasting Mortality Rates," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 34(7), pages 543-559, November.
    4. Hyndman, Rob J. & Booth, Heather, 2008. "Stochastic population forecasts using functional data models for mortality, fertility and migration," International Journal of Forecasting, Elsevier, vol. 24(3), pages 323-342.
    5. Stephen Johnson, 1967. "Hierarchical clustering schemes," Psychometrika, Springer;The Psychometric Society, vol. 32(3), pages 241-254, September.
    6. Renshaw, A.E. & Haberman, S., 2006. "A cohort-based extension to the Lee-Carter model for mortality reduction factors," Insurance: Mathematics and Economics, Elsevier, vol. 38(3), pages 556-570, June.
    7. Carter, Lawrence R. & Lee, Ronald D., 1992. "Modeling and forecasting US sex differentials in mortality," International Journal of Forecasting, Elsevier, vol. 8(3), pages 393-411, November.
    8. Wai-Sum Chan & Johnny Li & Jackie Li, 2014. "The CBD Mortality Indexes: Modeling and Applications," North American Actuarial Journal, Taylor & Francis Journals, vol. 18(1), pages 38-58.
    9. Haberman, Steven & Renshaw, Arthur, 2009. "On age-period-cohort parametric mortality rate projections," Insurance: Mathematics and Economics, Elsevier, vol. 45(2), pages 255-270, October.
    10. Bock, Hans H., 1996. "Probabilistic models in cluster analysis," Computational Statistics & Data Analysis, Elsevier, vol. 23(1), pages 5-28, November.
    11. Li, Han & O’Hare, Colin & Zhang, Xibin, 2015. "A semiparametric panel approach to mortality modeling," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 264-270.
    12. Andrew J. G. Cairns & David Blake & Kevin Dowd, 2006. "A Two‐Factor Model for Stochastic Mortality with Parameter Uncertainty: Theory and Calibration," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 73(4), pages 687-718, December.
    13. Cary Chi-Liang Tsai & Shuai Yang, 2015. "A Linear Regression Approach to Modeling Mortality Rates of Different Forms," North American Actuarial Journal, Taylor & Francis Journals, vol. 19(1), pages 1-23, January.
    14. Li, Johnny Siu-Hang & Hardy, Mary R. & Tan, Ken Seng, 2009. "Uncertainty in Mortality Forecasting: An Extension to the Classical Lee-Carter Approach," ASTIN Bulletin, Cambridge University Press, vol. 39(1), pages 137-164, May.
    15. Cox, Samuel H. & Lin, Yijia & Pedersen, Hal, 2010. "Mortality risk modeling: Applications to insurance securitization," Insurance: Mathematics and Economics, Elsevier, vol. 46(1), pages 242-253, February.
    16. Charrad, Malika & Ghazzali, Nadia & Boiteau, Véronique & Niknafs, Azam, 2014. "NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i06).
    17. Gabor J. Szekely & Maria L. Rizzo, 2005. "Hierarchical Clustering via Joint Between-Within Distances: Extending Ward's Minimum Variance Method," Journal of Classification, Springer;The Classification Society, vol. 22(2), pages 151-183, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Blake, David & Cairns, Andrew J.G., 2021. "Longevity risk and capital markets: The 2019-20 update," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 395-439.
    2. Blake, David & El Karoui, Nicole & Loisel, Stéphane & MacMinn, Richard, 2018. "Longevity risk and capital markets: The 2015–16 update," Insurance: Mathematics and Economics, Elsevier, vol. 78(C), pages 157-173.
    3. Tsai, Cary Chi-Liang & Kim, Seyeon, 2022. "Model mortality rates using property and casualty insurance reserving methods," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 326-340.
    4. Li, Johnny Siu-Hang, 2010. "Pricing longevity risk with the parametric bootstrap: A maximum entropy approach," Insurance: Mathematics and Economics, Elsevier, vol. 47(2), pages 176-186, October.
    5. Basellini, Ugofilippo & Camarda, Carlo Giovanni & Booth, Heather, 2023. "Thirty years on: A review of the Lee–Carter method for forecasting mortality," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1033-1049.
    6. Hunt, Andrew & Blake, David, 2015. "Modelling longevity bonds: Analysing the Swiss Re Kortis bond," Insurance: Mathematics and Economics, Elsevier, vol. 63(C), pages 12-29.
    7. Wang, Chou-Wen & Huang, Hong-Chih & Hong, De-Chuan, 2013. "A feasible natural hedging strategy for insurance companies," Insurance: Mathematics and Economics, Elsevier, vol. 52(3), pages 532-541.
    8. Benchimol, Andrés Gustavo & Albarrán Lozano, Irene & Marín Díazaraque, Juan Miguel & Alonso, Pablo J., 2015. "Hierarchical Lee-Carter model estimation through data cloning applied to demographically linked countries," DES - Working Papers. Statistics and Econometrics. WS ws1510, Universidad Carlos III de Madrid. Departamento de Estadística.
    9. James Risk & Michael Ludkovski, 2015. "Statistical Emulators for Pricing and Hedging Longevity Risk Products," Papers 1508.00310, arXiv.org, revised Sep 2015.
    10. Li, J.S.H. & Ng, A.C.Y. & Chan, W.S., 2013. "Stochastic life table forecasting: A time-simultaneous fan chart application," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 93(C), pages 98-107.
    11. Li, Hong & Lu, Yang, 2017. "Coherent Forecasting Of Mortality Rates: A Sparse Vector-Autoregression Approach," ASTIN Bulletin, Cambridge University Press, vol. 47(2), pages 563-600, May.
    12. Marie-Pier Bergeron-Boucher & Søren Kjærgaard & James E. Oeppen & James W. Vaupel, 2019. "The impact of the choice of life table statistics when forecasting mortality," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(43), pages 1235-1268.
    13. Liu, Yanxin & Li, Johnny Siu-Hang, 2015. "The age pattern of transitory mortality jumps and its impact on the pricing of catastrophic mortality bonds," Insurance: Mathematics and Economics, Elsevier, vol. 64(C), pages 135-150.
    14. Apostolos Bozikas & Georgios Pitselis, 2019. "Credible Regression Approaches to Forecast Mortality for Populations with Limited Data," Risks, MDPI, vol. 7(1), pages 1-22, February.
    15. Li, Johnny Siu-Hang & Zhou, Rui & Hardy, Mary, 2015. "A step-by-step guide to building two-population stochastic mortality models," Insurance: Mathematics and Economics, Elsevier, vol. 63(C), pages 121-134.
    16. Benchimol, Andrés, 2017. "Proyección de mortalidad en España mediante mixturas de modelos y análisis del impacto económico del riesgo de longevidad /Mortality Projection in Spain through Mixtures of Models and Analysis of the ," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 35, pages 341-366, Mayo.
    17. Basellini, Ugofilippo & Camarda, Carlo Giovanni & Booth, Heather, 2022. "Thirty years on: A review of the Lee-Carter method for forecasting mortality," SocArXiv 8u34d, Center for Open Science.
    18. Lin, Tzuling & Tsai, Cary Chi-Liang, 2013. "On the mortality/longevity risk hedging with mortality immunization," Insurance: Mathematics and Economics, Elsevier, vol. 53(3), pages 580-596.
    19. Li, Han & O’Hare, Colin, 2017. "Semi-parametric extensions of the Cairns–Blake–Dowd model: A one-dimensional kernel smoothing approach," Insurance: Mathematics and Economics, Elsevier, vol. 77(C), pages 166-176.
    20. Risk, J. & Ludkovski, M., 2016. "Statistical emulators for pricing and hedging longevity risk products," Insurance: Mathematics and Economics, Elsevier, vol. 68(C), pages 45-60.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:insuma:v:99:y:2021:i:c:p:42-62. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505554 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.