IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.00478.html

Multimodal Insights into Credit Risk Modelling: Integrating Climate and Text Data for Default Prediction

Author

Listed:
  • Zongxiao Wu
  • Ran Liu
  • Jiang Dai
  • Dan Luo

Abstract

Credit risk assessment increasingly relies on diverse sources of information beyond traditional structured financial data, particularly for micro and small enterprises (mSEs) with limited financial histories. This study proposes a multimodal framework that integrates structured credit variables, climate panel data, and unstructured textual narratives within a unified learning architecture. Specifically, we use long short-term memory (LSTM), the gated recurrent unit (GRU), and transformer models to analyse the interplay between these data modalities. The empirical results demonstrate that unimodal models based on climate or text data outperform those relying solely on structured data, while the integration of multiple data modalities yields significant improvements in credit default prediction. Using SHAP-based explainability methods, we find that physical climate risks play an important role in default prediction, with water-logging by rain emerging as the most influential factor. Overall, this study demonstrates the potential of multimodal approaches in AI-enabled decision-making, which provides robust tools for credit risk assessment while contributing to the broader integration of environmental and textual insights into predictive analytics.

Suggested Citation

  • Zongxiao Wu & Ran Liu & Jiang Dai & Dan Luo, 2026. "Multimodal Insights into Credit Risk Modelling: Integrating Climate and Text Data for Default Prediction," Papers 2601.00478, arXiv.org.
  • Handle: RePEc:arx:papers:2601.00478
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.00478
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2021. "A transformer-based model for default prediction in mid-cap corporate markets," Papers 2111.09902, arXiv.org, revised Apr 2023.
    2. Anderson, Raymond, 2007. "The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation," OUP Catalogue, Oxford University Press, number 9780199226405.
    3. Stevenson, Matthew & Mues, Christophe & Bravo, Cristián, 2021. "The value of text for small business default prediction: A Deep Learning approach," European Journal of Operational Research, Elsevier, vol. 295(2), pages 758-771.
    4. Duc Duy Nguyen & Steven Ongena & Shusen Qi & Vathunyoo Sila, 2022. "Climate Change Risk and the Cost of Mortgage Credit [Does climate change affect real estate prices? Only if you believe in it]," Review of Finance, European Finance Association, vol. 26(6), pages 1509-1549.
    5. Benjamin Collier & Ani L. Katchova & Jerry R. Skees, 2011. "Loan portfolio performance and El Niño, an intervention analysis," Agricultural Finance Review, Emerald Group Publishing Limited, vol. 71(1), pages 98-119, May.
    6. Calomiris, Charles W. & Larrain, Mauricio & Liberti, José & Sturgess, Jason, 2017. "How collateral laws shape lending and sectoral activity," Journal of Financial Economics, Elsevier, vol. 123(1), pages 163-188.
    7. Emilio Gutierrez & David Jaume & Martín Tobal, 2023. "Do Credit Supply Shocks Affect Employment in Middle-Income Countries?," American Economic Journal: Economic Policy, American Economic Association, vol. 15(4), pages 1-36, November.
    8. Addoum, Jawad M. & Ng, David T. & Ortiz-Bobea, Ariel, 2023. "Temperature shocks and industry earnings news," Journal of Financial Economics, Elsevier, vol. 150(1), pages 1-45.
    9. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    10. Babak Abedin & Christian Meske & Iris Junglas & Fethi Rabhi & Hamid R. Motahari-Nezhad, 2022. "Designing and Managing Human-AI Interactions," Information Systems Frontiers, Springer, vol. 24(3), pages 691-697, June.
    11. Korangi, Kamesh & Mues, Christophe & Bravo, Cristián, 2023. "A transformer-based model for default prediction in mid-cap corporate markets," European Journal of Operational Research, Elsevier, vol. 308(1), pages 306-320.
    12. Benjamin Collier & Ani L. Katchova & Jerry R. Skees, 2011. "Loan portfolio performance and El Niño, an intervention analysis," Agricultural Finance Review, Emerald Group Publishing Limited, vol. 71(1), pages 98-119, May.
    13. Gregory Lane, 2024. "Adapting to Climate Risk With Guaranteed Credit: Evidence From Bangladesh," Econometrica, Econometric Society, vol. 92(2), pages 355-386, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baesens, Bart & Smedts, Kristien, 2025. "Boosting credit risk models," The British Accounting Review, Elsevier, vol. 57(4).
    2. Schwab, Brandon & Kriebel, Johannes, 2026. "Mitigating adversarial attacks on transformer models in credit scoring," European Journal of Operational Research, Elsevier, vol. 328(1), pages 309-323.
    3. Katsafados, Apostolos G. & Leledakis, George N. & Pyrgiotakis, Emmanouil G. & Androutsopoulos, Ion & Fergadiotis, Manos, 2024. "Machine learning in bank merger prediction: A text-based approach," European Journal of Operational Research, Elsevier, vol. 312(2), pages 783-797.
    4. Brei, Michael & Mohan, Preeya & Perez Barahona, Agustin & Strobl, Eric, 2024. "Transmission of natural disasters to the banking sector: Evidence from thirty years of tropical storms in the Caribbean," Journal of International Money and Finance, Elsevier, vol. 141(C).
    5. Aguilar-Gomez, Sandra & Gutierrez, Emilio & Heres, David & Jaume, David & Tobal, Martin, 2024. "Thermal stress and financial distress: Extreme temperatures and firms’ loan defaults in Mexico," Journal of Development Economics, Elsevier, vol. 168(C).
    6. Sahab Zandi & Kamesh Korangi & Juan C. Moreno-Paredes & Mar'ia 'Oskarsd'ottir & Christophe Mues & Cristi'an Bravo, 2025. "A Multimodal Approach to SME Credit Scoring Integrating Transaction and Ownership Networks," Papers 2510.09407, arXiv.org.
    7. Jiaji Wang & Qianting Ma & Chao Wang & Tianxiang Sheng, 2024. "Climate change and credit risk in rural financial institutions: A study based on transition risk," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 45(6), pages 4208-4226, September.
    8. Weng, Futian & Zhu, Miao & Buckle, Mike & Hajek, Petr & Abedin, Mohammad Zoynul, 2025. "Class imbalance Bayesian model averaging for consumer loan default prediction: The role of soft credit information," Research in International Business and Finance, Elsevier, vol. 74(C).
    9. Francisco E. Ilabaca & Robert Mann & Philip Mulder, 2024. "Global Banks and Natural Disasters," Working Papers 24-05, Office of Financial Research, US Department of the Treasury.
    10. Yi Lu & Aifan Ling & Chaoqun Wang & Yaxin Xu, 2025. "Why Bonds Fail Differently? Explainable Multimodal Learning for Multi-Class Default Prediction," Papers 2509.10802, arXiv.org.
    11. Das, Ronnie & Ahmed, Wasim & Sharma, Kshitij & Hardey, Mariann & Dwivedi, Yogesh K. & Zhang, Ziqi & Apostolidis, Chrysostomos & Filieri, Raffaele, 2024. "Towards the development of an explainable e-commerce fake review index: An attribute analytics approach," European Journal of Operational Research, Elsevier, vol. 317(2), pages 382-400.
    12. Ma, Xuejiao & Che, Tianqi & Jiang, Qichuan, 2025. "A three-stage prediction model for firm default risk: An integration of text sentiment analysis," Omega, Elsevier, vol. 131(C).
    13. Mahsa Tavakoli & Rohitash Chandra & Fengrui Tian & Cristi'an Bravo, 2023. "Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams," Papers 2304.10740, arXiv.org, revised Nov 2024.
    14. Vairetti, Carla & Aránguiz, Ignacio & Maldonado, Sebastián & Karmy, Juan Pablo & Leal, Alonso, 2024. "Analytics-driven complaint prioritisation via deep learning and multicriteria decision-making," European Journal of Operational Research, Elsevier, vol. 312(3), pages 1108-1118.
    15. Mario Sanz-Guerrero & Javier Arroyo, 2024. "Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending," Papers 2401.16458, arXiv.org, revised Mar 2025.
    16. Ma, Xi-Ao & Liu, Haibo & Liu, Yi & Zhang, Justin Zuopeng, 2025. "Multi-label feature selection considering label importance-weighted relevance and label-dependency redundancy," European Journal of Operational Research, Elsevier, vol. 322(1), pages 215-236.
    17. Wu, Zongxiao & Dong, Yizhe & Li, Yaoyiran & Shi, Baofeng, 2025. "Unleashing the power of text for credit default prediction: Comparing human-written and generative AI-refined texts," European Journal of Operational Research, Elsevier, vol. 326(3), pages 691-706.
    18. Shi, Yong & Qu, Yi & Chen, Zhensong & Mi, Yunlong & Wang, Yunong, 2024. "Improved credit risk prediction based on an integrated graph representation learning approach with graph transformation," European Journal of Operational Research, Elsevier, vol. 315(2), pages 786-801.
    19. Maarouf, Abdurahman & Feuerriegel, Stefan & Pröllochs, Nicolas, 2025. "A fused large language model for predicting startup success," European Journal of Operational Research, Elsevier, vol. 322(1), pages 198-214.
    20. Shanyan Lai, 2025. "Asset Pricing in Pre-trained Transformer," Papers 2505.01575, arXiv.org, revised May 2025.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.00478. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.