IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2511.17954.html

A multi-view contrastive learning framework for spatial embeddings in risk modelling

Author

Listed:
  • Freek Holvoet
  • Christopher Blier-Wong
  • Katrien Antonio

Abstract

Incorporating spatial information, particularly those influenced by climate, weather, and demographic factors, is crucial for improving underwriting precision and enhancing risk management in insurance. However, spatial data are often unstructured, high-dimensional, and difficult to integrate into predictive models. Embedding methods are needed to convert spatial data into meaningful representations for modelling tasks. We propose a novel multi-view contrastive learning framework for generating spatial embeddings that combine information from multiple spatial data sources. To train the model, we construct a spatial dataset that merges satellite imagery and OpenStreetMap features across Europe. The framework aligns these spatial views with coordinate-based encodings, producing low-dimensional embeddings that capture both spatial structure and contextual similarity. Once trained, the model generates embeddings directly from latitude-longitude pairs, enabling any dataset with coordinates to be enriched with meaningful spatial features without requiring access to the original spatial inputs. In a case study on French real estate prices, we compare models trained on raw coordinates against those using our spatial embeddings as inputs. The embeddings consistently improve predictive accuracy across generalised linear, additive, and boosting models, while providing interpretable spatial effects and demonstrating transferability to unseen regions.

Suggested Citation

  • Freek Holvoet & Christopher Blier-Wong & Katrien Antonio, 2025. "A multi-view contrastive learning framework for spatial embeddings in risk modelling," Papers 2511.17954, arXiv.org.
  • Handle: RePEc:arx:papers:2511.17954
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2511.17954
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Susanne Gschlößl & Claudia Czado, 2007. "Spatial modelling of claim frequency and claim size in non-life insurance," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2007(3), pages 202-225.
    2. Peng Shi & Kun Shi, 2023. "Non-Life Insurance Risk Classification Using Categorical Embedding," North American Actuarial Journal, Taylor & Francis Journals, vol. 27(3), pages 579-601, July.
    3. Richman, Ronald, 2021. "AI in actuarial science – a review of recent advances – part 2," Annals of Actuarial Science, Cambridge University Press, vol. 15(2), pages 230-258, July.
    4. Taylor, Greg, 2001. "Geographic Premium Rating by Whittaker Spatial Smoothing," ASTIN Bulletin, Cambridge University Press, vol. 31(1), pages 147-160, May.
    5. Boskov, M. & Verrall, R. J., 1994. "Premium Rating by Geographic Area Using Spatial Models," ASTIN Bulletin, Cambridge University Press, vol. 24(1), pages 131-143, May.
    6. Roel Henckaerts & Katrien Antonio & Maxime Clijsters & Roel Verbelen, 2018. "A data driven binning strategy for the construction of insurance tariff classes," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2018(8), pages 681-705, September.
    7. Xeni Dimakos & Arnoldo Di Rattalma, 2002. "Bayesian Premium Rating with Latent Structure," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2002(3), pages 162-184.
    8. Delong, Łukasz & Kozak, Anna, 2023. "The use of autoencoders for training neural networks with mixed categorical and numerical features," ASTIN Bulletin, Cambridge University Press, vol. 53(2), pages 213-232, May.
    9. Freek Holvoet & Katrien Antonio & Roel Henckaerts, 2023. "Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff," Papers 2310.12671, arXiv.org, revised Jan 2025.
    10. Richman, Ronald, 2021. "AI in actuarial science – a review of recent advances – part 1," Annals of Actuarial Science, Cambridge University Press, vol. 15(2), pages 207-229, July.
    11. Shi, Peng & Shi, Kun, 2017. "Territorial Risk Classification Using Spatially Dependent Frequency-Severity Models," ASTIN Bulletin, Cambridge University Press, vol. 47(2), pages 437-465, May.
    12. Freek Holvoet & Katrien Antonio & Roel Henckaerts, 2025. "Neural Networks for Insurance Pricing with Frequency and Severity Data: A Benchmark Study from Data Preprocessing to Technical Tariff," North American Actuarial Journal, Taylor & Francis Journals, vol. 29(3), pages 519-562, July.
    13. Gao, Lisa & Shi, Peng, 2022. "Leveraging high-resolution weather information to predict hail damage claims: A spatial point process for replicated point patterns," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 161-179.
    14. Blier-Wong, Christopher & Cossette, Hélène & Lamontagne, Luc & Marceau, Etienne, 2022. "Geographic Ratemaking With Spatial Embeddings," ASTIN Bulletin, Cambridge University Press, vol. 52(1), pages 1-31, January.
    15. Avanzi, Benjamin & Taylor, Greg & Wang, Melantha & Wong, Bernard, 2024. "Machine Learning with High-Cardinality Categorical Features in Actuarial Applications," ASTIN Bulletin, Cambridge University Press, vol. 54(2), pages 213-238, May.
    16. Roel Henckaerts & Marie-Pier Côté & Katrien Antonio & Roel Verbelen, 2021. "Boosting Insights in Insurance Tariff Plans with Tree-Based Machine Learning Methods," North American Actuarial Journal, Taylor & Francis Journals, vol. 25(2), pages 255-285, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jaiswal, Rachana & Gupta, Shashank & Tiwari, Aviral Kumar, 2024. "Big data and machine learning-based decision support system to reshape the vaticination of insurance claims," Technological Forecasting and Social Change, Elsevier, vol. 209(C).
    2. Simon Hatzesberger & Iris Nonneman, 2025. "Advanced Applications of Generative AI in Actuarial Science: Case Studies Beyond ChatGPT," Papers 2506.18942, arXiv.org.
    3. Yang Qiao & Chou-Wen Wang & Wenjun Zhu, 2024. "Machine learning in long-term mortality forecasting," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 49(2), pages 340-362, April.
    4. Freek Holvoet & Katrien Antonio & Roel Henckaerts, 2023. "Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff," Papers 2310.12671, arXiv.org, revised Jan 2025.
    5. Benjamin Avanzi & Greg Taylor & Melantha Wang & Bernard Wong, 2023. "Machine Learning with High-Cardinality Categorical Features in Actuarial Applications," Papers 2301.12710, arXiv.org.
    6. Li, Li & Li, Han & Panagiotelis, Anastasios, 2025. "Boosting domain-specific models with shrinkage: An application in mortality forecasting," International Journal of Forecasting, Elsevier, vol. 41(1), pages 191-207.
    7. Giorgio Alfredo Spedicato & Christophe Dutang & Quentin Guibert, 2025. "Adjusting Manual Rates to Own Experience: Comparing the Credibility Approach to Machine Learning," Post-Print hal-04821310, HAL.
    8. Aleksandar Arandjelovi'c & Julia Eisenberg, 2024. "Optimal risk mitigation by deep reinsurance," Papers 2408.06168, arXiv.org, revised Nov 2025.
    9. Jamotton, Charlotte & Hainaut, Donatien, 2024. "Latent Dirichlet Allocation for structured insurance data," LIDAM Discussion Papers ISBA 2024008, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    10. Diego Zappa & Gian Paolo Clemente & Francesco Della Corte & Nino Savelli, 2023. "Editorial on the Special Issue on Insurance: complexity, risks and its connection with social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(2), pages 125-130, December.
    11. Zhu, Felix & Dong, Yumo & Huang, Fei, 2025. "Data-rich economic forecasting for actuarial applications," Insurance: Mathematics and Economics, Elsevier, vol. 124(C).
    12. Katrien Antonio & Christophe Dutang & Andreas Tsanakas, 2021. "Editorial," Post-Print hal-04748464, HAL.
    13. Yves L. Grize, 2015. "Applications of Statistics in the Field of General Insurance: An Overview," International Statistical Review, International Statistical Institute, vol. 83(1), pages 135-159, April.
    14. Francesca Perla & Salvatore Scognamiglio, 2023. "Locally-coherent multi-population mortality modelling via neural networks," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 46(1), pages 157-176, June.
    15. Hung-Tsung Hsiao & Chou-Wen Wang & I.-Chien Liu & Ko-Lun Kung, 2024. "Mortality improvement neural-network models with autoregressive effects," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 49(2), pages 363-383, April.
    16. Corsaro, Stefania & Marino, Zelda & Scognamiglio, Salvatore, 2024. "Quantile mortality modelling of multiple populations via neural networks," Insurance: Mathematics and Economics, Elsevier, vol. 116(C), pages 114-133.
    17. Zhang, Yaojun & Ji, Lanpeng & Aivaliotis, Georgios & Taylor, Charles, 2024. "Bayesian CART models for insurance claims frequency," Insurance: Mathematics and Economics, Elsevier, vol. 114(C), pages 108-131.
    18. Tadayon, Vahid & Ghanbarzadeh, Mitra, 2024. "Spatial copula-based modeling of claim frequency and claim size in third-party car insurance: A Poisson-mixed approach for predictive analysis," Insurance: Mathematics and Economics, Elsevier, vol. 119(C), pages 119-129.
    19. Yu Wang, Muhammad Asraf bin Abdullah, 2024. "Time Series Analysis and Optimization of the Prediction Model of Agricultural Insurance Loss Ratio," Research on World Agricultural Economy, Nan Yang Academy of Sciences Pte Ltd (NASS), vol. 5(4), November.
    20. Ronald Richman & Salvatore Scognamiglio & Mario V. Wuthrich, 2024. "The Credibility Transformer," Papers 2409.16653, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2511.17954. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.