IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1911.09858.html
   My bibliography  Save this paper

Investigating bankruptcy prediction models in the presence of extreme class imbalance and multiple stages of economy

Author

Listed:
  • Sheikh Rabiul Islam
  • William Eberle
  • Sheikh K. Ghafoor
  • Sid C. Bundy
  • Douglas A. Talbert
  • Ambareen Siraj

Abstract

In the area of credit risk analytics, current Bankruptcy Prediction Models (BPMs) struggle with (a) the availability of comprehensive and real-world data sets and (b) the presence of extreme class imbalance in the data (i.e., very few samples for the minority class) that degrades the performance of the prediction model. Moreover, little research has compared the relative performance of well-known BPM's on public datasets addressing the class imbalance problem. In this work, we apply eight classes of well-known BPMs, as suggested by a review of decades of literature, on a new public dataset named Freddie Mac Single-Family Loan-Level Dataset with resampling (i.e., adding synthetic minority samples) of the minority class to tackle class imbalance. Additionally, we apply some recent AI techniques (e.g., tree-based ensemble techniques) that demonstrate potentially better results on models trained with resampled data. In addition, from the analysis of 19 years (1999-2017) of data, we discover that models behave differently when presented with sudden changes in the economy (e.g., a global financial crisis) resulting in abrupt fluctuations in the national default rate. In summary, this study should aid practitioners/researchers in determining the appropriate model with respect to data that contains a class imbalance and various economic stages.

Suggested Citation

  • Sheikh Rabiul Islam & William Eberle & Sheikh K. Ghafoor & Sid C. Bundy & Douglas A. Talbert & Ambareen Siraj, 2019. "Investigating bankruptcy prediction models in the presence of extreme class imbalance and multiple stages of economy," Papers 1911.09858, arXiv.org.
  • Handle: RePEc:arx:papers:1911.09858
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1911.09858
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
    2. Bhattacharya, Arnab & Wilson, Simon P. & Soyer, Refik, 2019. "A Bayesian approach to modeling mortgage default and prepayment," European Journal of Operational Research, Elsevier, vol. 274(3), pages 1112-1124.
    3. Dimitras, A. I. & Slowinski, R. & Susmaga, R. & Zopounidis, C., 1999. "Business failure prediction using rough sets," European Journal of Operational Research, Elsevier, vol. 114(2), pages 263-280, April.
    4. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 71-111.
    5. von Furstenberg, George M, 1969. "Default Risk on FHA-Insured Home Mortgages as a Function of the Terms of Financing: A Quantitative Analysis," Journal of Finance, American Finance Association, vol. 24(3), pages 459-477, June.
    6. Constantin Zopounidis & Michael Doumpos, 1999. "Business failure prediction using the UTADIS multicriteria analysis method," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 50(11), pages 1138-1148, November.
    7. Alireza Hooman & Govindan Marthandan & Wan Fadzilah Wan Yusoff & Mohana Omid & Sasan Karamizadeh, 2016. "Statistical and data mining methods in credit scoring," Journal of Developing Areas, Tennessee State University, College of Business, vol. 50(5), pages 371-381, Special I.
    8. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    9. Sheikh Rabiul Islam & Sheikh Khaled Ghafoor & William Eberle, 2018. "Mining Illegal Insider Trading of Stocks: A Proactive Approach," Papers 1807.00939, arXiv.org, revised Nov 2018.
    10. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    11. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    12. Beaver, Wh, 1966. "Financial Ratios As Predictors Of Failure - Reply," Journal of Accounting Research, Wiley Blackwell, vol. 4, pages 123-127.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    2. Haoming Wang & Xiangdong Liu, 2021. "Undersampling bankruptcy prediction: Taiwan bankruptcy data," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-17, July.
    3. Thomas E. Mckee, 2000. "Developing a bankruptcy prediction model via rough sets theory," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 9(3), pages 159-173, September.
    4. Jie Sun, 2012. "Integration Of Random Sample Selection, Support Vector Machines And Ensembles For Financial Risk Forecasting With An Empirical Analysis On The Necessity Of Feature Selection," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 19(4), pages 229-246, October.
    5. Fayçal Mraihi, 2016. "Distressed Company Prediction Using Logistic Regression: Tunisian’s Case," Quarterly Journal of Business Studies, Research Academy of Social Sciences, vol. 2(1), pages 34-54.
    6. Salwa Kessioui & Michalis Doumpos & Constantin Zopounidis, 2023. "A Bibliometric Overview of the State-of-the-Art in Bankruptcy Prediction Methods and Applications," World Scientific Book Chapters, in: Emilios Galariotis & Alexandros Garefalakis & Christos Lemonakis & Marios Menexiadis & Constantin Zo (ed.), Governance and Financial Performance Current Trends and Perspectives, chapter 6, pages 123-153, World Scientific Publishing Co. Pte. Ltd..
    7. Nikolaos Daskalakis & Nikolaos Aggelakis & John Filos, 2022. "Applying, Updating and Comparing Bankruptcy Forecasting Models. The Case of Greece," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 21(3), pages 335-354, September.
    8. Li, Hui & Sun, Jie, 2012. "Forecasting business failure: The use of nearest-neighbour support vectors and correcting imbalanced samples – Evidence from the Chinese hotel industry," Tourism Management, Elsevier, vol. 33(3), pages 622-634.
    9. Şaban Çelik, 2013. "Micro Credit Risk Metrics: A Comprehensive Review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 20(4), pages 233-272, October.
    10. Zeineb Affes & Rania Hentati-Kaffel, 2019. "Predicting US Banks Bankruptcy: Logit Versus Canonical Discriminant Analysis," Computational Economics, Springer;Society for Computational Economics, vol. 54(1), pages 199-244, June.
    11. Fayçal Mraihi & Inane Kanzari & Mohamed Tahar Rajhi, 2015. "Development of a Prediction Model of Failure in Tunisian Companies: Comparison between Logistic Regression and Support Vector Machines," International Journal of Empirical Finance, Research Academy of Social Sciences, vol. 4(3), pages 184-205.
    12. Bhanu Pratap Singh & Alok Kumar Mishra, 2016. "Re-estimation and comparisons of alternative accounting based bankruptcy prediction models for Indian companies," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 2(1), pages 1-28, December.
    13. Antonio Davila & George Foster & Xiaobin He & Carlos Shimizu, 2015. "The rise and fall of startups: Creation and destruction of revenue and jobs by young companies," Australian Journal of Management, Australian School of Business, vol. 40(1), pages 6-35, February.
    14. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    15. Guido Max Mantovani & Gregory Gadzinski, 2022. "How to Rate the Financial Performance of Private Companies? A Tailored Integrated Rating Methodology Applied to North-Eastern Italian Districts," JRFM, MDPI, vol. 15(11), pages 1-18, October.
    16. Enrico Supino & Nicola Piras, 2022. "Le performance dei modelli di credit scoring in contesti di forte instabilit? macroeconomica: il ruolo delle Reti Neurali Artificiali," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2022(2), pages 41-61.
    17. Adriana Csikosova & Maria Janoskova & Katarina Culkova, 2020. "Application of Discriminant Analysis for Avoiding the Risk of Quarry Operation Failure," JRFM, MDPI, vol. 13(10), pages 1-14, September.
    18. Trueck, Stefan & Rachev, Svetlozar T., 2008. "Rating Based Modeling of Credit Risk," Elsevier Monographs, Elsevier, edition 1, number 9780123736833.
    19. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    20. Ahsan Habib & Mabel D' Costa & Hedy Jiaying Huang & Md. Borhan Uddin Bhuiyan & Li Sun, 2020. "Determinants and consequences of financial distress: review of the empirical literature," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 60(S1), pages 1023-1075, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1911.09858. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.