IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v326y2025i3p691-706.html

Unleashing the power of text for credit default prediction: Comparing human-written and generative AI-refined texts

Author

Listed:
  • Wu, Zongxiao
  • Dong, Yizhe
  • Li, Yaoyiran
  • Shi, Baofeng

Abstract

This study explores the integration of a representative large language model, ChatGPT, into lending decision-making with a focus on credit default prediction. Specifically, we use ChatGPT to analyse and interpret loan assessments written by loan officers and generate refined versions of these texts. Our comparative analysis reveals significant differences between generative artificial intelligence (AI)-refined and human-written texts in terms of text length, semantic similarity, and linguistic representations. Using deep learning techniques, we show that incorporating unstructured text data, particularly ChatGPT-refined texts, alongside conventional structured data significantly enhances credit default predictions. Furthermore, we demonstrate how the contents of both human-written and ChatGPT-refined assessments contribute to the models’ prediction and show that the effect of essential words is highly context-dependent. Moreover, we find that ChatGPT’s analysis of borrower delinquency contributes the most to improving predictive accuracy. We also evaluate the business impact of the models based on human-written and ChatGPT-refined texts, and find that, in most cases, the latter yields higher profitability than the former. This study provides valuable insights into the transformative potential of generative AI in financial services.

Suggested Citation

  • Wu, Zongxiao & Dong, Yizhe & Li, Yaoyiran & Shi, Baofeng, 2025. "Unleashing the power of text for credit default prediction: Comparing human-written and generative AI-refined texts," European Journal of Operational Research, Elsevier, vol. 326(3), pages 691-706.
  • Handle: RePEc:eee:ejores:v:326:y:2025:i:3:p:691-706
    DOI: 10.1016/j.ejor.2025.04.032
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221725003170
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2025.04.032?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Jiang, Cuiqing & Wang, Zhao & Zhao, Huimin, 2019. "A prediction-driven mixture cure model and its application in credit scoring," European Journal of Operational Research, Elsevier, vol. 277(1), pages 20-31.
    2. Boyang Chen & Zongxiao Wu & Ruoran Zhao, 2023. "From fiction to fact: the growing role of generative AI in business and finance," Journal of Chinese Economic and Business Studies, Taylor & Francis Journals, vol. 21(4), pages 471-496, October.
    3. Fitzpatrick, Trevor & Mues, Christophe, 2021. "How can lenders prosper? Comparing machine learning approaches to identify profitable peer-to-peer loan investments," European Journal of Operational Research, Elsevier, vol. 294(2), pages 711-722.
    4. Ching-Nam Hang & Pei-Duo Yu & Roberto Morabito & Chee-Wei Tan, 2024. "Large Language Models Meet Next-Generation Networking Technologies: A Review," Future Internet, MDPI, vol. 16(10), pages 1-29, October.
    5. Zhi Da & Joseph Engelberg & Pengjie Gao, 2015. "Editor's Choice The Sum of All FEARS Investor Sentiment and Asset Prices," The Review of Financial Studies, Society for Financial Studies, vol. 28(1), pages 1-32.
    6. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    7. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    8. Shijie Wu & Ozan Irsoy & Steven Lu & Vadim Dabravolski & Mark Dredze & Sebastian Gehrmann & Prabhanjan Kambadur & David Rosenberg & Gideon Mann, 2023. "BloombergGPT: A Large Language Model for Finance," Papers 2303.17564, arXiv.org, revised Dec 2023.
    9. Kozodoi, Nikita & Lessmann, Stefan & Alamgir, Morteza & Moreira-Matias, Luis & Papakonstantinou, Konstantinos, 2025. "Fighting sampling bias: A framework for training and evaluating credit scoring models," European Journal of Operational Research, Elsevier, vol. 324(2), pages 616-628.
    10. Rajkamal Iyer & Asim Ijaz Khwaja & Erzo F. P. Luttmer & Kelly Shue, 2016. "Screening Peers Softly: Inferring the Quality of Small Borrowers," Management Science, INFORMS, vol. 62(6), pages 1554-1577, June.
    11. Dowling, Michael & Lucey, Brian, 2023. "ChatGPT for (Finance) research: The Bananarama Conjecture," Finance Research Letters, Elsevier, vol. 53(C).
    12. Stevenson, Matthew & Mues, Christophe & Bravo, Cristián, 2021. "The value of text for small business default prediction: A Deep Learning approach," European Journal of Operational Research, Elsevier, vol. 295(2), pages 758-771.
    13. William J. Bazley & Henrik Cronqvist & Milica Mormann, 2021. "Visual Finance: The Pervasive Effects of Red on Investor Behavior," Management Science, INFORMS, vol. 67(9), pages 5616-5641, September.
    14. Katsafados, Apostolos G. & Leledakis, George N. & Pyrgiotakis, Emmanouil G. & Androutsopoulos, Ion & Fergadiotis, Manos, 2024. "Machine learning in bank merger prediction: A text-based approach," European Journal of Operational Research, Elsevier, vol. 312(2), pages 783-797.
    15. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    16. Cuiqing Jiang & Zhao Wang & Ruiya Wang & Yong Ding, 2018. "Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending," Annals of Operations Research, Springer, vol. 266(1), pages 511-529, July.
    17. Maarouf, Abdurahman & Feuerriegel, Stefan & Pröllochs, Nicolas, 2025. "A fused large language model for predicting startup success," European Journal of Operational Research, Elsevier, vol. 322(1), pages 198-214.
    18. Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
    19. Mukherjee, Abhiroop & Panayotov, George & Shon, Janghoon, 2021. "Eye in the sky: Private satellites and government macro data," Journal of Financial Economics, Elsevier, vol. 141(1), pages 234-254.
    20. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    21. Wenzlaff, Karsten & Spaeth, Sebastian, 2022. "Smarter than humans? Validating how OpenAI's ChatGPT model explains crowdfunding, alternative finance and community finance," WiSo-HH Working Paper Series 75, University of Hamburg, Faculty of Business, Economics and Social Sciences, WISO Research Laboratory.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Akhil Bhardwaj, 2025. "Artificial Intelligence and the limits of reason: a framework for responsible use in public and private sectors," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 12(1), pages 1-7, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Schwab, Brandon & Kriebel, Johannes, 2026. "Mitigating adversarial attacks on transformer models in credit scoring," European Journal of Operational Research, Elsevier, vol. 328(1), pages 309-323.
    2. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    3. Wang, Weiqing & Chen, Yuxi & Wang, Liukai & Xiong, Yu, 2025. "Developing the value of legal judgments of supply chain finance for credit risk prediction through novel ACWGAN-GPSA approach," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 196(C).
    4. Mahsa Tavakoli & Rohitash Chandra & Fengrui Tian & Cristi'an Bravo, 2023. "Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams," Papers 2304.10740, arXiv.org, revised Nov 2024.
    5. Shi, Yong & Qu, Yi & Chen, Zhensong & Mi, Yunlong & Wang, Yunong, 2024. "Improved credit risk prediction based on an integrated graph representation learning approach with graph transformation," European Journal of Operational Research, Elsevier, vol. 315(2), pages 786-801.
    6. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    7. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    8. Nisha Arora & Pankaj Deep Kaur, 2026. "A comparative review on AI empowered CRA process of banking and P2P lending models," Operational Research, Springer, vol. 26(2), pages 1-64, June.
    9. Weng, Futian & Zhu, Miao & Buckle, Mike & Hajek, Petr & Abedin, Mohammad Zoynul, 2025. "Class imbalance Bayesian model averaging for consumer loan default prediction: The role of soft credit information," Research in International Business and Finance, Elsevier, vol. 74(C).
    10. Yi Lu & Aifan Ling & Chaoqun Wang & Yaxin Xu, 2025. "Why Bonds Fail Differently? Explainable Multimodal Learning for Multi-Class Default Prediction," Papers 2509.10802, arXiv.org.
    11. Christopher Gerling & Stefan Lessmann, 2023. "Multimodal Document Analytics for Banking Process Automation," Papers 2307.11845, arXiv.org, revised Nov 2023.
    12. Das, Ronnie & Ahmed, Wasim & Sharma, Kshitij & Hardey, Mariann & Dwivedi, Yogesh K. & Zhang, Ziqi & Apostolidis, Chrysostomos & Filieri, Raffaele, 2024. "Towards the development of an explainable e-commerce fake review index: An attribute analytics approach," European Journal of Operational Research, Elsevier, vol. 317(2), pages 382-400.
    13. Mario Sanz-Guerrero & Javier Arroyo, 2024. "Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending," Papers 2401.16458, arXiv.org, revised Mar 2025.
    14. Xia, Yufei & Han, Zhiyin & Li, Yawen & He, Lingyun, 2025. "Credit scoring model for fintech lending: An integration of large language models and FocalPoly loss," International Journal of Forecasting, Elsevier, vol. 41(3), pages 894-919.
    15. Baesens, Bart & Smedts, Kristien, 2025. "Boosting credit risk models," The British Accounting Review, Elsevier, vol. 57(4).
    16. Maarouf, Abdurahman & Feuerriegel, Stefan & Pröllochs, Nicolas, 2025. "A fused large language model for predicting startup success," European Journal of Operational Research, Elsevier, vol. 322(1), pages 198-214.
    17. Li, Zhiyong & Li, Aimin & Bellotti, Anthony & Yao, Xiao, 2023. "The profitability of online loans: A competing risks analysis on default and prepayment," European Journal of Operational Research, Elsevier, vol. 306(2), pages 968-985.
    18. Xuhui Wang & Lifang Zhang & Jianzhou Wang & Zhenkun Liu & Xinsong Niu, 2026. "Profit-oriented loan default prediction for the financial industry: a fusion framework with interpretability," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 12(1), pages 1-33, December.
    19. Dong, Mengming Michael & Stratopoulos, Theophanis C. & Wang, Victor Xiaoqi, 2024. "A scoping review of ChatGPT research in accounting and finance," International Journal of Accounting Information Systems, Elsevier, vol. 55(C).
    20. Sultan Amed & Tanmay Sen & Sayantan Banerjee, 2026. "FSL-BDP: Federated Survival Learning with Bayesian Differential Privacy for Credit Risk Modeling," Papers 2601.11134, arXiv.org.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:326:y:2025:i:3:p:691-706. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.