IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v44y2025i7p2132-2150.html

A Two‐Stage Interpretable Model to Explain Classifier in Credit Risk Prediction

Author

Listed:
  • Lu Wang
  • Zecheng Yu
  • Jingling Ma
  • Xiaofang Chen
  • Chong Wu

Abstract

In the financial sector, credit risk represents a critical issue, and accurate prediction is essential for mitigating financial risk and ensuring economic stability. Although artificial intelligence methods can achieve satisfactory accuracy, explaining their predictive results poses a significant challenge, thereby prompting research on interpretability. Current research primarily focuses on individual interpretability methods and seldom investigates the combined application of multiple approaches. To address the limitations of existing research, this study proposes a two‐stage interpretability model that integrates SHAP and counterfactual explanations. In the first stage, SHAP is employed to analyze feature importance, categorizing features into subsets according to their positive or negative impact on predicted outcomes. In the second stage, a genetic algorithm generates counterfactual explanations by considering feature importance and applying perturbations in various directions based on predefined subsets, thereby accurately identifying counterfactual samples that can modify predicted outcomes. We conducted experiments on the German credit datasets, HMEQ datasets, and the Taiwan Default of Credit Card Clients dataset using SVM, XGB, MLP, and LSTM as base classifiers, respectively. The experimental results indicate that the frequency of feature changes in the counterfactual explanations generated closely aligns with the feature importance derived from the SHAP method. Under the evaluation metrics of effectiveness and sparsity, the performance demonstrates improvements over both basic counterfactual explanation methods and prototype‐based counterfactuals. Furthermore, this study offers recommendations based on features derived from SHAP analysis results and counterfactual explanations to reduce the risk of classification as a default.

Suggested Citation

  • Lu Wang & Zecheng Yu & Jingling Ma & Xiaofang Chen & Chong Wu, 2025. "A Two‐Stage Interpretable Model to Explain Classifier in Credit Risk Prediction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(7), pages 2132-2150, November.
  • Handle: RePEc:wly:jforec:v:44:y:2025:i:7:p:2132-2150
    DOI: 10.1002/for.3288
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.3288
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.3288?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yang Liu & Fei Huang & Lili Ma & Qingguo Zeng & Jiale Shi, 2024. "Credit scoring prediction leveraging interpretable ensemble learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 286-308, March.
    2. Petter Eilif de Lange & Borger Melsom & Christian Bakke Vennerød & Sjur Westgaard, 2022. "Explainable AI for Credit Assessment in Banks," JRFM, MDPI, vol. 15(12), pages 1-23, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yang Liu, 2024. "Analyzing the effect of user‐generated content on studio performance: A combined approach," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 45(4), pages 2228-2248, June.
    2. Lee, Min-Jae & Choi, Sun-Yong, 2025. "The impact of financial statement indicators on bank credit ratings: Insights from machine learning and SHAP techniques," Finance Research Letters, Elsevier, vol. 85(PA).
    3. M. K. Nallakaruppan & Himakshi Chaturvedi & Veena Grover & Balamurugan Balusamy & Praveen Jaraut & Jitendra Bahadur & V. P. Meena & Ibrahim A. Hameed, 2024. "Credit Risk Assessment and Financial Decision Support Using Explainable Artificial Intelligence," Risks, MDPI, vol. 12(10), pages 1-18, October.
    4. Krishna Neupane & Prem Sapkota & Ujjwal Prajapati, 2026. "Beyond the Numbers: Causal Effects of Financial Report Sentiment on Bank Profitability," Papers 2602.17851, arXiv.org.
    5. Bauer, Kevin & Gill, Andrej & Langenbucher, Katja & Franke, Lucia, 2025. "Institutionalizing explainability: On credit scoring, AI, and consumer agency," SAFE White Paper Series 116, Leibniz Institute for Financial Research SAFE.
    6. Tingting Cheng & Jiachen Cong & Fei Liu & Xuanbin Yang, 2025. "Binary Response Forecasting under a Factor-Augmented Framework," Papers 2507.16462, arXiv.org.
    7. Song, Cen & Ma, Xiaoqian & Ardizzone, Catherine & Zhuang, Jun, 2024. "The adverse impact of flight delays on passenger satisfaction: An innovative prediction model utilizing wide & deep learning," Journal of Air Transport Management, Elsevier, vol. 114(C).
    8. Felix Haag & Konstantin Hopf & Thorsten Staake, 2026. "Validating Explainer Methods: A Functionally Grounded Approach for Numerical Forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 45(2), pages 819-836, March.
    9. Nils-Gunnar Birkeland Abrahamsen & Emil Nylén-Forthun & Mats Møller & Petter Eilif de Lange & Morten Risstad, 2024. "Financial Distress Prediction in the Nordics: Early Warnings from Machine Learning Models," JRFM, MDPI, vol. 17(10), pages 1-23, September.
    10. XianZhu Shao & YongQiang Du & LuoFei Liang & Xue Xu & Zhiyi Lu, 2026. "A Dynamic Cost‐Adjusted AdaCost Model for Credit Prediction of Smallholder Farmers," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 45(3), pages 997-1019, April.
    11. Ali, Hassnian & Zafar, Muhammad Bilal & Aysan, Ahmet Faruk, 2025. "Generative AI in finance: Replicability, methodological contingencies, and future research directions," Finance Research Letters, Elsevier, vol. 86(PF).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:44:y:2025:i:7:p:2132-2150. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.