IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v44y2025i7p2067-2088.html

Default Prediction Framework With Optimal Feature Set and Matching Ratio

Author

Listed:
  • Guotai Chi
  • Fengshan Bai
  • Hongping Tan
  • Ying Zhou

Abstract

We propose a default prediction framework that incorporates imbalance handling and feature selection. For imbalance handling, we determine the optimal ratio of non‐default to default firms by minimizing the Type‐II error of the majority voting deep fully connected network (MV‐DFCN) model. For feature selection, we design a two‐stage process that first eliminates highly correlated and redundant features, and then refines the feature set using backward selection. Experimental results show that the DFCN model within the proposed framework outperforms baseline models in terms of G‐Mean and AUC and achieves the lowest Type‐II error rate. Furthermore, the framework outperforms eight baseline combinations of imbalance handling and feature selection strategies. Additionally, SHAP values are used to assess feature contributions, and nine features with statistically significant impacts are identified.

Suggested Citation

  • Guotai Chi & Fengshan Bai & Hongping Tan & Ying Zhou, 2025. "Default Prediction Framework With Optimal Feature Set and Matching Ratio," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(7), pages 2067-2088, November.
  • Handle: RePEc:wly:jforec:v:44:y:2025:i:7:p:2067-2088
    DOI: 10.1002/for.3284
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.3284
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.3284?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2021. "A transformer-based model for default prediction in mid-cap corporate markets," Papers 2111.09902, arXiv.org, revised Apr 2023.
    2. Minkwan Ahn & Samuel B. Bonsall & Andrew Buskirk, 2019. "Do managers withhold bad news from credit rating agencies?," Review of Accounting Studies, Springer, vol. 24(3), pages 972-1021, September.
    3. Alnoor Bhimani & Mohamed Azzim Gulamhussen & Samuel Rocha Lopes, 2014. "Owner liability and financial reporting information as predictors of firm default in bank loans," Review of Accounting Studies, Springer, vol. 19(2), pages 769-804, June.
    4. Geng, Ruibin & Bose, Indranil & Chen, Xi, 2015. "Prediction of financial distress: An empirical study of listed Chinese companies using data mining," European Journal of Operational Research, Elsevier, vol. 241(1), pages 236-247.
    5. Ming Xu & Chu Zhang, 2009. "Bankruptcy prediction: the case of Japanese listed companies," Review of Accounting Studies, Springer, vol. 14(4), pages 534-558, December.
    6. Korangi, Kamesh & Mues, Christophe & Bravo, Cristián, 2023. "A transformer-based model for default prediction in mid-cap corporate markets," European Journal of Operational Research, Elsevier, vol. 308(1), pages 306-320.
    7. Flavio Bazzana & Marco Bee & Ahmed Almustfa Hussin Adam Khatir, 2024. "Machine learning techniques for default prediction: an application to small Italian companies," Risk Management, Palgrave Macmillan, vol. 26(1), pages 1-23, February.
    8. Bart Baesens & Rudy Setiono & Christophe Mues & Jan Vanthienen, 2003. "Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation," Management Science, INFORMS, vol. 49(3), pages 312-329, March.
    9. Kyle D. Chen & Warren H. Hausman, 2000. "Technical Note: Mathematical Properties of the Optimal Product Line Selection Problem Using Choice-Based Conjoint Analysis," Management Science, INFORMS, vol. 46(2), pages 327-332, February.
    10. Giordani, Paolo & Jacobson, Tor & Schedvin, Erik von & Villani, Mattias, 2014. "Taking the Twists into Account: Predicting Firm Bankruptcy Risk with Splines of Financial Ratios," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 49(4), pages 1071-1099, August.
    11. du Jardin, Philippe, 2021. "Forecasting corporate failure using ensemble of self-organizing neural networks," European Journal of Operational Research, Elsevier, vol. 288(3), pages 869-885.
    12. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    13. Zhu, You & Zhou, Li & Xie, Chi & Wang, Gang-Jin & Nguyen, Truong V., 2019. "Forecasting SMEs' credit risk in supply chain finance with an enhanced hybrid ensemble machine learning approach," International Journal of Production Economics, Elsevier, vol. 211(C), pages 22-33.
    14. Juha-Pekka Kallunki & Elina Pyykkö, 2013. "Do defaulting CEOs and directors increase the likelihood of financial distress of the firm?," Review of Accounting Studies, Springer, vol. 18(1), pages 228-260, March.
    15. Juan Laborda & Seyong Ryoo, 2021. "Feature Selection in a Credit Scoring Model," Mathematics, MDPI, vol. 9(7), pages 1-22, March.
    16. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 18(1), pages 109-131.
    17. Sigurdur Ólafsson & Jaekyung Yang, 2005. "Intelligent Partitioning for Feature Selection," INFORMS Journal on Computing, INFORMS, vol. 17(3), pages 339-355, August.
    18. Stewart Jones, 2017. "Corporate bankruptcy prediction: a high dimensional analysis," Review of Accounting Studies, Springer, vol. 22(3), pages 1366-1422, September.
    19. Elizabeth Gutierrez & Jake Krupa & Miguel Minutti-Meza & Maria Vulcheva, 2020. "Do going concern opinions provide incremental information to predict corporate defaults?," Review of Accounting Studies, Springer, vol. 25(4), pages 1344-1381, December.
    20. Emmanuel O. Ogundimu, 2019. "Prediction of default probability by using statistical models for rare events," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1143-1162, October.
    21. Yan Zhang, 2018. "Assessing Fair Lending Risks Using Race/Ethnicity Proxies," Management Science, INFORMS, vol. 64(1), pages 178-197, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Correia, Maria, 2025. "Accounting and corporate failure: the evolving role of accounting information in bankruptcy prediction," LSE Research Online Documents on Economics 128340, London School of Economics and Political Science, LSE Library.
    2. Jiaqi Kuang & Zihao Guo & Jinghan Wang & Yezhen Wang & Kaiwen Zhang, 2026. "Financial distress prediction using signatures: evidence from Chinese listed firms," Risk Management, Palgrave Macmillan, vol. 28(1), pages 1-27, February.
    3. Thao Tran & Ngoc Hong Nguyen & Binh Thien Le & Nam Thanh Vu & Duc Hong Vo, 2023. "Examining financial distress of the Vietnamese listed firms using accounting-based models," PLOS ONE, Public Library of Science, vol. 18(5), pages 1-17, May.
    4. Ken Li, 2024. "Liquidity ratios and corporate failures," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 64(1), pages 1111-1134, March.
    5. Ma, Xuejiao & Che, Tianqi & Jiang, Qichuan, 2025. "A three-stage prediction model for firm default risk: An integration of text sentiment analysis," Omega, Elsevier, vol. 131(C).
    6. Papík, Mário & Papíková, Lenka, 2025. "The possibilities of using AutoML in bankruptcy prediction: Case of Slovakia," Technological Forecasting and Social Change, Elsevier, vol. 215(C).
    7. Lin, Yu-Cheng & Padliansyah, Roni & Lu, Yu-Hsin & Liu, Wen-Rang, 2025. "Bankruptcy prediction: Integration of convolutional neural networks and explainable artificial intelligence techniques," International Journal of Accounting Information Systems, Elsevier, vol. 56(C).
    8. Christoph Braunsberger & Ewald Aschauer, 2025. "Corporate Failure Prediction: A Literature Review of Altman Z-Score and Machine Learning Models Within a Technology Adoption Framework," JRFM, MDPI, vol. 18(8), pages 1-32, August.
    9. Jiaming Liu & Chengzhang Li & Peng Ouyang & Jiajia Liu & Chong Wu, 2023. "Interpreting the prediction results of the tree‐based gradient boosting models for financial distress prediction with an explainable machine learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1112-1137, August.
    10. Katsafados, Apostolos G. & Leledakis, George N. & Pyrgiotakis, Emmanouil G. & Androutsopoulos, Ion & Fergadiotis, Manos, 2024. "Machine learning in bank merger prediction: A text-based approach," European Journal of Operational Research, Elsevier, vol. 312(2), pages 783-797.
    11. Sami Ben Jabeur & Nicolae Stef & Pedro Carmona, 2023. "Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 715-741, February.
    12. Mohammad Shamsu Uddin & Guotai Chi & Mazin A. M. Al Janabi & Tabassum Habib & Kunpeng Yuan, 2022. "Modeling credit risk with a multi‐stage hybrid model: An alternative statistical approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(7), pages 1386-1415, November.
    13. Alberto Tron & Maurizio Dallocchio & Salvatore Ferri & Federico Colantoni, 2023. "Corporate governance and financial distress: lessons learned from an unconventional approach," Journal of Management & Governance, Springer;Accademia Italiana di Economia Aziendale (AIDEA), vol. 27(2), pages 425-456, June.
    14. Zhao, Qi & Xu, Weijun & Ji, Yucheng, 2023. "Predicting financial distress of Chinese listed companies using machine learning: To what extent does textual disclosure matter?," International Review of Financial Analysis, Elsevier, vol. 89(C).
    15. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    16. Philippe Jardin, 2025. "Designing Ensemble-Based Models Using Neural Networks and Temporal Financial Profiles to Forecast Firms’ Financial Failure," Computational Economics, Springer;Society for Computational Economics, vol. 65(1), pages 149-209, January.
    17. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    18. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    19. Yulian Zhang & Shigeyuki Hamori, 2020. "Forecasting Crude Oil Market Crashes Using Machine Learning Technologies," Energies, MDPI, vol. 13(10), pages 1-14, May.
    20. Youssef Zizi & Mohamed Oudgou & Abdeslam El Moudden, 2020. "Determinants and Predictors of SMEs’ Financial Failure: A Logistic Regression Approach," Risks, MDPI, vol. 8(4), pages 1-21, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:44:y:2025:i:7:p:2067-2088. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.