IDEAS home Printed from https://ideas.repec.org/a/ids/injbaf/v15y2025i5p1-32.html

Influence of sampling methods on bankruptcy prediction: normal vs. abnormal economic conditions

Author

Listed:
  • Asif M. Huq
  • Wonder Mahembe

Abstract

Bankruptcy prediction research has largely emphasised model performance through feature selection and algorithm optimisation, while the equally important challenge of class imbalance remains underexplored. Most studies also focus on publicly listed firms, reflecting the accessibility of standardised data. Our study makes a novel and valuable contribution by leveraging a large-scale dataset of private firms - an economically significant yet understudied segment. Using 2,039,222 firm-year observations from 430,800 private firms between 2012 and 2021, we evaluate four machine learning models, five sampling techniques, and two distinct economic periods. Results show that sampling choice strongly influences accuracy and feature relevance, depending on macroeconomic conditions. Importantly, simple interpretable models built on theoretically grounded features (e.g., Altman, 1968) achieve robust predictions, challenging prevailing reliance on complex methods, while Extreme Gradient Boosting (XGBoost) consistently outperforms alternatives. By focusing on private firms, the study provides unique insights and underscores methodological choices crucial for reliable bankruptcy prediction.

Suggested Citation

  • Asif M. Huq & Wonder Mahembe, 2025. "Influence of sampling methods on bankruptcy prediction: normal vs. abnormal economic conditions," International Journal of Banking, Accounting and Finance, Inderscience Enterprises Ltd, vol. 15(5), pages 1-32.
  • Handle: RePEc:ids:injbaf:v:15:y:2025:i:5:p:1-32
    as

    Download full text from publisher

    File URL: http://www.inderscience.com/link.php?id=149819
    Download Restriction: Open Access
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ids:injbaf:v:15:y:2025:i:5:p:1-32. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sarah Parker (email available below). General contact details of provider: http://www.inderscience.com/browse/index.php?journalID=277 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.