Author
Listed:
- Ya-Han Hu
(National Central University)
- Chih-Fong Tsai
(National Central University)
- Pei-Ting Wang
(National Central University)
Abstract
Financial distress prediction (FDP) is a critical task for financial institutions and is typically framed as a class imbalance learning problem. To address this challenge, this paper proposes two ensemble-based strategies: the homogeneous and heterogeneous approaches, which combine multiple data re-sampling algorithms to generate diverse re-balanced training sets for classifier construction. Experimental results on seven FDP datasets demonstrate that the heterogeneous approach, which integrates under-, over-, and hybrid sampling methods with their optimal imbalance ratio settings, achieves superior performance in terms of AUC, particularly when applied with the LightGBM and XGBoost classifiers. Regarding Type I error, the heterogeneous combinations consistently outperform the homogeneous and other baseline approaches across various classifiers. The generalizability of the proposed methods is further validated using 37 additional class-imbalanced datasets from different domains, where the heterogeneous approach again shows the most robust performance. These findings suggest that the proposed models can serve as effective decision support tools for financial institutions to enhance credit risk evaluation and lending strategies. From a policy perspective, adopting such predictive frameworks can improve financial stability by reducing exposure to high-risk loans and enabling more accurate early warning systems for economic distress.
Suggested Citation
Ya-Han Hu & Chih-Fong Tsai & Pei-Ting Wang, 2025.
"Combining multiple data resampling methods and classifier ensembles for better financial distress prediction: homogeneous and heterogeneous approaches,"
Annals of Operations Research, Springer, vol. 353(2), pages 793-814, October.
Handle:
RePEc:spr:annopr:v:353:y:2025:i:2:d:10.1007_s10479-025-06706-5
DOI: 10.1007/s10479-025-06706-5
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:353:y:2025:i:2:d:10.1007_s10479-025-06706-5. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.