IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v61y2023i2d10.1007_s10614-021-10227-1.html
   My bibliography  Save this article

Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering

Author

Listed:
  • Sami Ben Jabeur

    (Sciences and Humanities Confluence Research Center - UCLY, ESDES)

  • Nicolae Stef

    (Université Bourgogne Franche-Comté)

  • Pedro Carmona

    (University of Valencia)

Abstract

The emergence of big data, information technology, and social media provides an enormous amount of information about firms’ current financial health. When facing this abundance of data, decision makers must identify the crucial information to build upon an effective and operative prediction model with a high quality of the estimated output. The feature selection technique can be used to select significant variables without lowering the quality of performance classification. In addition, one of the main goals of bankruptcy prediction is to identify the model specification with the strongest explanatory power. Building on this premise, an improved XGBoost algorithm based on feature importance selection (FS-XGBoost) is proposed. FS-XGBoost is compared with seven machine learning algorithms based on three well-known feature selection methods that are frequently used in bankruptcy prediction: stepwise discriminant analysis, stepwise logistic regression, and partial least squares discriminant analysis (PLS-DA). Our experimental results confirm that FS-XGBoost provides more accurate predictions, outperforming traditional feature selection methods.

Suggested Citation

  • Sami Ben Jabeur & Nicolae Stef & Pedro Carmona, 2023. "Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 715-741, February.
  • Handle: RePEc:kap:compec:v:61:y:2023:i:2:d:10.1007_s10614-021-10227-1
    DOI: 10.1007/s10614-021-10227-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-021-10227-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-021-10227-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Becker, Jan-Michael & Ismail, Ida Rosnita, 2016. "Accounting for sampling weights in PLS path modeling: Simulations and empirical examples," European Management Journal, Elsevier, vol. 34(6), pages 606-617.
    2. du Jardin, Philippe, 2010. "Predicting bankruptcy using neural networks and other classification methods: the influence of variable selection techniques on model accuracy," MPRA Paper 44375, University Library of Munich, Germany.
    3. Ligang Zhou & Kin Keung Lai, 2017. "AdaBoost Models for Corporate Bankruptcy Prediction with Missing Data," Computational Economics, Springer;Society for Computational Economics, vol. 50(1), pages 69-94, June.
    4. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    5. Bellini, Silvia & Cardinali, Maria Grazia & Grandi, Benedetta, 2017. "A structural equation model of impulse buying behaviour in grocery retailing," Journal of Retailing and Consumer Services, Elsevier, vol. 36(C), pages 164-171.
    6. Bastien, Philippe & Vinzi, Vincenzo Esposito & Tenenhaus, Michel, 2005. "PLS generalised linear regression," Computational Statistics & Data Analysis, Elsevier, vol. 48(1), pages 17-46, January.
    7. Wruck, Karen Hopper, 1990. "Financial distress, reorganization, and organizational efficiency," Journal of Financial Economics, Elsevier, vol. 27(2), pages 419-444, October.
    8. Climent, Francisco & Momparler, Alexandre & Carmona, Pedro, 2019. "Anticipating bank distress in the Eurozone: An Extreme Gradient Boosting approach," Journal of Business Research, Elsevier, vol. 101(C), pages 885-896.
    9. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    10. du Jardin, Philippe & Séverin, Eric, 2012. "Forecasting financial failure using a Kohonen map: A comparative study to improve model stability over time," European Journal of Operational Research, Elsevier, vol. 221(2), pages 378-396.
    11. Geng, Ruibin & Bose, Indranil & Chen, Xi, 2015. "Prediction of financial distress: An empirical study of listed Chinese companies using data mining," European Journal of Operational Research, Elsevier, vol. 241(1), pages 236-247.
    12. Régis Blazy & Nicolae Stef, 2020. "Bankruptcy procedures in the post-transition economies," European Journal of Law and Economics, Springer, vol. 50(1), pages 7-64, August.
    13. Stef, Nicolae, 2018. "Bankruptcy and the difficulty of firing," International Review of Law and Economics, Elsevier, vol. 54(C), pages 85-94.
    14. Everett, Jim & Watson, John, 1998. "Small Business Failure and External Risk Factors," Small Business Economics, Springer, vol. 11(4), pages 371-390, December.
    15. Erkki K. Laitinen & Oliver Lukason, 2014. "Do firm failure processes differ across countries: evidence from Finland and Estonia," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 15(5), pages 810-832, November.
    16. Hernandez Tinoco, Mario & Wilson, Nick, 2013. "Financial distress and bankruptcy prediction among listed companies using accounting, market and macroeconomic variables," International Review of Financial Analysis, Elsevier, vol. 30(C), pages 394-419.
    17. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    18. Ohlson, Ja, 1980. "Financial Ratios And The Probabilistic Prediction Of Bankruptcy," Journal of Accounting Research, Wiley Blackwell, vol. 18(1), pages 109-131.
    19. Dong Zhao & Chunyu Huang & Yan Wei & Fanhua Yu & Mingjing Wang & Huiling Chen, 2017. "An Effective Computational Model for Bankruptcy Prediction Using Kernel Extreme Learning Machine Approach," Computational Economics, Springer;Society for Computational Economics, vol. 49(2), pages 325-341, February.
    20. Asma Sghaier & Sami Ben Jabeur & Boutheina Bannour, 2018. "Using partial least square discriminant analysis to distinguish between Islamic and conventional banks in the MENA region," Review of Financial Economics, John Wiley & Sons, vol. 36(2), pages 133-148, April.
    21. Platt, Harlan D. & Platt, Marjorie B., 1994. "Business cycle effects on state corporate failure rates," Journal of Economics and Business, Elsevier, vol. 46(2), pages 113-127, May.
    22. du Jardin, Philippe, 2015. "Bankruptcy prediction using terminal failure processes," European Journal of Operational Research, Elsevier, vol. 242(1), pages 286-303.
    23. Liang, Deron & Lu, Chia-Chi & Tsai, Chih-Fong & Shih, Guan-An, 2016. "Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study," European Journal of Operational Research, Elsevier, vol. 252(2), pages 561-572.
    24. Sami Ben Jabeur & Amir Sadaaoui & Asma Sghaier & Riadh Aloui, 2020. "Machine learning models and cost-sensitive decision trees for bond rating prediction," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 71(8), pages 1161-1179, August.
    25. Nicolae Stef & Sami Ben Jabeur, 2018. "The Bankruptcy Prediction Power of New Entrants," International Journal of the Economics of Business, Taylor & Francis Journals, vol. 25(3), pages 421-440, September.
    26. Delen, Dursun & Cogdell, Douglas & Kasap, Nihat, 2012. "A comparative analysis of data mining methods in predicting NCAA bowl outcomes," International Journal of Forecasting, Elsevier, vol. 28(2), pages 543-552.
    27. Bardos, Mireille, 1998. "Detecting the risk of company failure at the Banque de France," Journal of Banking & Finance, Elsevier, vol. 22(10-11), pages 1405-1419, October.
    28. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    29. P. Du Jardin & E. Séverin, 2012. "Forecasting financial failure using a Kohonen map: a comparative study to improve bankruptcy model over time," Post-Print hal-00801853, HAL.
    30. Stewart Jones, 2017. "Corporate bankruptcy prediction: a high dimensional analysis," Review of Accounting Studies, Springer, vol. 22(3), pages 1366-1422, September.
    31. Carmona, Pedro & Climent, Francisco & Momparler, Alexandre, 2019. "Predicting failure in the U.S. banking sector: An extreme gradient boosting approach," International Review of Economics & Finance, Elsevier, vol. 61(C), pages 304-323.
    32. Nicolae Stef, 2018. "Bankruptcy and the Difficulty of Firing," Post-Print hal-01664740, HAL.
    33. Nicolae Stef, 2021. "Institutions and corporate financial distress in Central and Eastern Europe," European Journal of Law and Economics, Springer, vol. 52(1), pages 57-87, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hoang Hiep Nguyen & Jean-Laurent Viviani & Sami Ben Jabeur, 2023. "Bankruptcy prediction using machine learning and Shapley additive explanations," Post-Print hal-04223161, HAL.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    2. Ben Jabeur, Sami & Serret, Vanessa, 2023. "Bankruptcy prediction using fuzzy convolutional neural networks," Research in International Business and Finance, Elsevier, vol. 64(C).
    3. ben Jabeur, Sami & Mefteh-Wali, Salma & Carmona, Pedro, 2021. "The impact of institutional and macroeconomic conditions on aggregate business bankruptcy," Structural Change and Economic Dynamics, Elsevier, vol. 59(C), pages 108-119.
    4. Sami Ben Jabeur & Youssef Fahmi, 2018. "Forecasting financial distress for French firms: a comparative study," Empirical Economics, Springer, vol. 54(3), pages 1173-1186, May.
    5. Mohammad Mahdi Mousavi & Jamal Ouenniche, 2018. "Multi-criteria ranking of corporate distress prediction models: empirical evaluation and methodological contributions," Annals of Operations Research, Springer, vol. 271(2), pages 853-886, December.
    6. Zhou, Fanyin & Fu, Lijun & Li, Zhiyong & Xu, Jiawei, 2022. "The recurrence of financial distress: A survival analysis," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1100-1115.
    7. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    8. Mohammad Mahdi Mousavi & Jamal Ouenniche & Kaoru Tone, 2023. "A dynamic performance evaluation of distress prediction models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(4), pages 756-784, July.
    9. Yu Zhao & Huaming Du & Qing Li & Fuzhen Zhuang & Ji Liu & Gang Kou, 2022. "A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective," Papers 2211.14997, arXiv.org, revised May 2023.
    10. Elena Gregova & Katarina Valaskova & Peter Adamko & Milos Tumpach & Jaroslav Jaros, 2020. "Predicting Financial Distress of Slovak Enterprises: Comparison of Selected Traditional and Learning Algorithms Methods," Sustainability, MDPI, vol. 12(10), pages 1-17, May.
    11. Ben Jabeur, Sami, 2017. "Bankruptcy prediction using Partial Least Squares Logistic Regression," Journal of Retailing and Consumer Services, Elsevier, vol. 36(C), pages 197-202.
    12. Sami Ben Jabeur & Rabi Belhaj Hassine & Salma Mefteh‐Wali, 2021. "Firm financial performance during the financial crisis: A French case study," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(2), pages 2800-2812, April.
    13. Yi Cao & Xiaoquan Liu & Jia Zhai & Shan Hua, 2022. "A two‐stage Bayesian network model for corporate bankruptcy prediction," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(1), pages 455-472, January.
    14. Hyeongjun Kim & Hoon Cho & Doojin Ryu, 2020. "Corporate Default Predictions Using Machine Learning: Literature Review," Sustainability, MDPI, vol. 12(16), pages 1-11, August.
    15. Stef, Nicolae & Zenou, Emmanuel, 2021. "Management-to-staff ratio and a firm's exit," Journal of Business Research, Elsevier, vol. 125(C), pages 252-260.
    16. Alberto Tron & Maurizio Dallocchio & Salvatore Ferri & Federico Colantoni, 2023. "Corporate governance and financial distress: lessons learned from an unconventional approach," Journal of Management & Governance, Springer;Accademia Italiana di Economia Aziendale (AIDEA), vol. 27(2), pages 425-456, June.
    17. Zeineb Affes & Rania Hentati-Kaffel, 2019. "Predicting US Banks Bankruptcy: Logit Versus Canonical Discriminant Analysis," Computational Economics, Springer;Society for Computational Economics, vol. 54(1), pages 199-244, June.
    18. Eric Séverin & David Veganzones, 2021. "Can earnings management information improve bankruptcy prediction models?," Annals of Operations Research, Springer, vol. 306(1), pages 247-272, November.
    19. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    20. Noora Alzayed & Rasol Eskandari & Hassan Yazdifar, 2023. "Bank failure prediction: corporate governance and financial indicators," Review of Quantitative Finance and Accounting, Springer, vol. 61(2), pages 601-631, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:61:y:2023:i:2:d:10.1007_s10614-021-10227-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.