Author
Listed:
- Rina Wang
- Haolun Shi
- Jiguo Cao
Abstract
The generalized linear model (GLM) is a popular modeling choice for pricing non-life insurance policies. However, high-cardinality categorical insurance data presents significant challenges for these GLM rate-making models. Additionally, insurance regulators often require rating territories, which are clusters of insurance policies’ geographic locations for setting insurance rates, to meet certain standards. For instance, (1) the credibility standard ensures that the number of policies in a territory is large enough to be credible and representative, (2) the contiguity standard requires the locations in each territory to be geographically adjacent to promote a logical and practical spatial grouping, and (3) the cardinality standard specifies an acceptable range for the number of territories in a geographic area. To address these challenges, this article proposes a nested GLM framework for non-life insurance rate-making applications. In this framework, neural network models with categorical embedding layers are constructed to model the residual deviance from simple GLMs, using high-cardinality categorical variables as input. Low-dimensionly features extracted from the neural network model effectively translate categorical variables into meaningful numerical representations, capturing their effects on the initial model’s residuals. The features corresponding to the location-related variable are further converted into a contiguous territory rating variable via spatially constrained clustering models. By incorporating outcomes from these models, the nested GLM not only satisfies regulatory requirements but also enhances the model’s predictive power, while maintaining the interpretability from the (generalized) linear form. The construction of a nested Poisson GLM is presented in this article. Its performance is demonstrated using a real-life Brazil auto insurance data to model claim frequency.
Suggested Citation
Rina Wang & Haolun Shi & Jiguo Cao, 2025.
"A Nested GLM Framework with Neural Network Encoding and Spatially Constrained Clustering in Non-Life Insurance Ratemaking,"
North American Actuarial Journal, Taylor & Francis Journals, vol. 29(3), pages 645-661, July.
Handle:
RePEc:taf:uaajxx:v:29:y:2025:i:3:p:645-661
DOI: 10.1080/10920277.2024.2442416
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:uaajxx:v:29:y:2025:i:3:p:645-661. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/uaaj .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.