IDEAS home Printed from https://ideas.repec.org/a/eee/jbrese/v130y2021icp200-209.html
   My bibliography  Save this article

Combining feature selection, instance selection, and ensemble classification techniques for improved financial distress prediction

Author

Listed:
  • Tsai, Chih-Fong
  • Sue, Kuen-Liang
  • Hu, Ya-Han
  • Chiu, Andy

Abstract

Bankruptcy prediction and credit scoring are major problems in financial distress prediction. Studies have shown that prediction models can be made more effective by performing data preprocessing procedures. Moreover, classifier ensembles are likely to outperform single classifiers. Although feature selection, instance selection, and classifier ensembles are known to affect final prediction results, their combined effects on bankruptcy prediction and credit scoring problems have not been fully explored. This study compares the performance of three feature selection algorithms, three instance selection algorithms, four classification algorithms, and two ensemble learning techniques. The results obtained using five bankruptcy prediction and five credit scoring datasets indicate that by carefully considering the combination of these three factors, better prediction models can be developed than by considering only one related factor.

Suggested Citation

  • Tsai, Chih-Fong & Sue, Kuen-Liang & Hu, Ya-Han & Chiu, Andy, 2021. "Combining feature selection, instance selection, and ensemble classification techniques for improved financial distress prediction," Journal of Business Research, Elsevier, vol. 130(C), pages 200-209.
  • Handle: RePEc:eee:jbrese:v:130:y:2021:i:c:p:200-209
    DOI: 10.1016/j.jbusres.2021.03.018
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0148296321001776
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jbusres.2021.03.018?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Maldonado, Sebastián & Pérez, Juan & Bravo, Cristián, 2017. "Cost-based feature selection for Support Vector Machines: An application in credit scoring," European Journal of Operational Research, Elsevier, vol. 261(2), pages 656-665.
    2. Ligang Zhou & Kin Keung Lai, 2017. "AdaBoost Models for Corporate Bankruptcy Prediction with Missing Data," Computational Economics, Springer;Society for Computational Economics, vol. 50(1), pages 69-94, June.
    3. Xiaobo Tang & Shixuan Li & Mingliang Tan & Wenxuan Shi, 2020. "Incorporating textual and management factors into financial distress prediction: A comparative study of machine learning methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(5), pages 769-787, August.
    4. Pérez-Martín, A. & Pérez-Torregrosa, A. & Vaca, M., 2018. "Big Data techniques to measure credit banking risk in home equity loans," Journal of Business Research, Elsevier, vol. 89(C), pages 448-454.
    5. Climent, Francisco & Momparler, Alexandre & Carmona, Pedro, 2019. "Anticipating bank distress in the Eurozone: An Extreme Gradient Boosting approach," Journal of Business Research, Elsevier, vol. 101(C), pages 885-896.
    6. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    7. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    8. Liang, Deron & Tsai, Chih-Fong & Lu, Hung-Yuan (Richard) & Chang, Li-Shin, 2020. "Combining corporate governance indicators with stacking ensembles for financial distress prediction," Journal of Business Research, Elsevier, vol. 120(C), pages 137-146.
    9. Rokach, Lior, 2009. "Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4046-4072, October.
    10. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    11. Liang, Deron & Lu, Chia-Chi & Tsai, Chih-Fong & Shih, Guan-An, 2016. "Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study," European Journal of Operational Research, Elsevier, vol. 252(2), pages 561-572.
    12. Delen, Dursun & Cogdell, Douglas & Kasap, Nihat, 2012. "A comparative analysis of data mining methods in predicting NCAA bowl outcomes," International Journal of Forecasting, Elsevier, vol. 28(2), pages 543-552.
    13. Kyoung-jae Kim & Kichun Lee & Hyunchul Ahn, 2018. "Predicting Corporate Financial Sustainability Using Novel Business Analytics," Sustainability, MDPI, vol. 11(1), pages 1-17, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Feng, Yi & Yin, Yunqiang & Wang, Dujuan & Dhamotharan, Lalitha, 2022. "A dynamic ensemble selection method for bank telemarketing sales prediction," Journal of Business Research, Elsevier, vol. 139(C), pages 368-382.
    2. Zhao, Shuping & Xu, Kai & Wang, Zhao & Liang, Changyong & Lu, Wenxing & Chen, Bo, 2022. "Financial distress prediction by combining sentiment tone features," Economic Modelling, Elsevier, vol. 106(C).
    3. Lapshin, Viktor & Anton, Markov, 2022. "MCMC-based credit rating aggregation algorithm to tackle data insufficiency," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 68, pages 50-72.
    4. Lenka Papíková & Mário Papík, 2022. "Effects of classification, feature selection, and resampling methods on bankruptcy prediction of small and medium‐sized enterprises," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(4), pages 254-281, October.
    5. Jiaming Liu & Chengzhang Li & Peng Ouyang & Jiajia Liu & Chong Wu, 2023. "Interpreting the prediction results of the tree‐based gradient boosting models for financial distress prediction with an explainable machine learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1112-1137, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sami Ben Jabeur & Nicolae Stef & Pedro Carmona, 2023. "Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering," Computational Economics, Springer;Society for Computational Economics, vol. 61(2), pages 715-741, February.
    2. Jabeur, Sami Ben & Gharib, Cheima & Mefteh-Wali, Salma & Arfi, Wissal Ben, 2021. "CatBoost model and artificial intelligence techniques for corporate failure prediction," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    3. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    4. Doumpos, Michalis & Andriosopoulos, Kostas & Galariotis, Emilios & Makridou, Georgia & Zopounidis, Constantin, 2017. "Corporate failure prediction in the European energy sector: A multicriteria approach and the effect of country characteristics," European Journal of Operational Research, Elsevier, vol. 262(1), pages 347-360.
    5. Ben Jabeur, Sami & Serret, Vanessa, 2023. "Bankruptcy prediction using fuzzy convolutional neural networks," Research in International Business and Finance, Elsevier, vol. 64(C).
    6. Almaskati, Nawaf & Bird, Ron & Yeung, Danny & Lu, Yue, 2021. "A horse race of models and estimation methods for predicting bankruptcy," Advances in accounting, Elsevier, vol. 52(C).
    7. Liang, Deron & Tsai, Chih-Fong & Lu, Hung-Yuan (Richard) & Chang, Li-Shin, 2020. "Combining corporate governance indicators with stacking ensembles for financial distress prediction," Journal of Business Research, Elsevier, vol. 120(C), pages 137-146.
    8. Serrano-Cinca, Carlos & Gutiérrez-Nieto, Begoña & Bernate-Valbuena, Martha, 2019. "The use of accounting anomalies indicators to predict business failure," European Management Journal, Elsevier, vol. 37(3), pages 353-375.
    9. du Jardin, Philippe, 2021. "Forecasting corporate failure using ensemble of self-organizing neural networks," European Journal of Operational Research, Elsevier, vol. 288(3), pages 869-885.
    10. Hyeongjun Kim & Hoon Cho & Doojin Ryu, 2020. "Corporate Default Predictions Using Machine Learning: Literature Review," Sustainability, MDPI, vol. 12(16), pages 1-11, August.
    11. Mai, Feng & Tian, Shaonan & Lee, Chihoon & Ma, Ling, 2019. "Deep learning models for bankruptcy prediction using textual disclosures," European Journal of Operational Research, Elsevier, vol. 274(2), pages 743-758.
    12. Carlos Serrano-Cinca & Yolanda Fuertes-Call鮠 & Bego uti鲲ez-Nieto & Beatriz Cuellar-Fernᮤez, 2014. "Path modelling to bankruptcy: causes and symptoms of the banking crisis," Applied Economics, Taylor & Francis Journals, vol. 46(31), pages 3798-3811, November.
    13. Jiang, Cuiqing & Zhou, Yiru & Chen, Bo, 2023. "Mining semantic features in patent text for financial distress prediction," Technological Forecasting and Social Change, Elsevier, vol. 190(C).
    14. Noora Alzayed & Rasol Eskandari & Hassan Yazdifar, 2023. "Bank failure prediction: corporate governance and financial indicators," Review of Quantitative Finance and Accounting, Springer, vol. 61(2), pages 601-631, August.
    15. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    16. María Escribano-Navas & German Gemar, 2021. "Gender and Bankruptcy: A Hotel Survival Econometric Analysis," Sustainability, MDPI, vol. 13(12), pages 1-13, June.
    17. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    18. Jingjing Long & Cuiqing Jiang & Stanko Dimitrov & Zhao Wang, 2022. "Clues from networks: quantifying relational risk for credit risk evaluation of SMEs," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-41, December.
    19. Tomasz Korol, 2019. "Dynamic Bankruptcy Prediction Models for European Enterprises," JRFM, MDPI, vol. 12(4), pages 1-15, December.
    20. Pham, Xuan T.T. & Ho, Tin H., 2021. "Using boosting algorithms to predict bank failure: An untold story," International Review of Economics & Finance, Elsevier, vol. 76(C), pages 40-54.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jbrese:v:130:y:2021:i:c:p:200-209. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jbusres .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.