IDEAS home Printed from https://ideas.repec.org/a/bit/bsrysr/v16y2025i1p198-218n1010.html
   My bibliography  Save this article

Default Prediction in the Finance Industry Based on Ensemble Learning: Combining Machine Learning and Deep Learning

Author

Listed:
  • Hoanh-Su Le

    (University of Economics and Law, Ho Chi Minh City, Vietnam, Vietnam National University, Ho Chi Minh City, Vietnam)

  • Phong Le Quang Chan

    (University of Economics and Law, Ho Chi Minh City, Vietnam, Vietnam National University, Ho Chi Minh City, Vietnam)

  • Vinh Truong Cong

    (University of Economics and Law, Ho Chi Minh City, Vietnam, Vietnam National University, Ho Chi Minh City, Vietnam)

  • Nhat Ho Mai Minh

    (University of Economics and Law, Ho Chi Minh City, Vietnam, Vietnam National University, Ho Chi Minh City, Vietnam)

  • Lee Jong-Hwa

    (Dong-Eui University, Busan City, South Korea)

Abstract

Background Financial institutions face significant challenges in predicting loan defaults, which directly impact the non-performing loan (NPL) rate. Incorrect predictions can lead to misinformed decisions and substantial financial losses. Objectives This study aims to enhance default prediction by employing advanced ensemble learning techniques in machine learning and deep learning. Methods/Approach Instead of relying on transformation, fine-tuning, or single algorithm models, this research focuses on combining multiple models using voting and stacking techniques, particularly highlighting a stacking model combining Light Gradient Boosting Machine (LGBM) and Artificial Neural Networks (ANN). Results The ensemble learning methods, especially the LGBM-LSTM and XGB-LSTM stacking models, showed higher precision in identifying borrowers who defaulted, while the LGBM-LSTM and XGB-LSTM voting models excelled in recall and achieved an F1-score 0.1% higher. Both the stacking and voting models attained AUC values close to 90%, indicating strong overall classification performance. Conclusions The findings not only contribute to the fields of lending and peer-to-peer financial operations but also offer crucial insights that aid financial organizations in making well-informed decisions regarding loan processing and management.

Suggested Citation

  • Hoanh-Su Le & Phong Le Quang Chan & Vinh Truong Cong & Nhat Ho Mai Minh & Lee Jong-Hwa, 2025. "Default Prediction in the Finance Industry Based on Ensemble Learning: Combining Machine Learning and Deep Learning," Business Systems Research, Sciendo, vol. 16(1), pages 198-218.
  • Handle: RePEc:bit:bsrysr:v:16:y:2025:i:1:p:198-218:n:1010
    DOI: 10.2478/bsrj-2025-0010
    as

    Download full text from publisher

    File URL: https://doi.org/10.2478/bsrj-2025-0010
    Download Restriction: no

    File URL: https://libkey.io/10.2478/bsrj-2025-0010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    2. Hyeongjun Kim & Hoon Cho & Doojin Ryu, 2020. "Corporate Default Predictions Using Machine Learning: Literature Review," Sustainability, MDPI, vol. 12(16), pages 1-11, August.
    3. Aida Krichene Abdelmoula, 2015. "Bank Credit Risk Analysis with K-Nearest-Neighbor Classifier: Case of Tunisian Banks," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 14(1), pages 79-106, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Modina, Michele & Pietrovito, Filomena & Gallucci, Carmen & Formisano, Vincenzo, 2023. "Predicting SMEs’ default risk: Evidence from bank-firm relationship data," The Quarterly Review of Economics and Finance, Elsevier, vol. 89(C), pages 254-268.
    2. Zhao, Shuping & Xu, Kai & Wang, Zhao & Liang, Changyong & Lu, Wenxing & Chen, Bo, 2022. "Financial distress prediction by combining sentiment tone features," Economic Modelling, Elsevier, vol. 106(C).
    3. Mario Sanz-Guerrero & Javier Arroyo, 2024. "Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending," Papers 2401.16458, arXiv.org, revised Mar 2025.
    4. Ligang Zhou & Chao Ma, 2023. "A Comparison of Different Rules on Loans Evaluation in Peer-to-Peer Lending by Gradient Boosting Models Under Moving Windows with Two Timestamps," Computational Economics, Springer;Society for Computational Economics, vol. 62(4), pages 1481-1504, December.
    5. Anton Gerunov, 2023. "Modern Approaches To Forecasting Firm Default Rates Over The Short To Medium Term: An Application To A Panel Of Polish Companies," Yearbook of the Faculty of Economics and Business Administration, Sofia University, Faculty of Economics and Business Administration, Sofia University St Kliment Ohridski - Bulgaria, vol. 22(1), pages 5-15, October.
    6. Dariusz Sala & Kostiantyn Pavlov & Olena Pavlova & Anton Demchuk & Liubomur Matiichuk & Dariusz Cichoń, 2023. "Determining of the Bankrupt Contingency as the Level Estimation Method of Western Ukraine Gas Distribution Enterprises’ Competence Capacity," Energies, MDPI, vol. 16(4), pages 1-13, February.
    7. Tian, Geran & Wang, Xiaowen & Wu, Weixing, 2021. "Borrow low, lend high: Credit arbitrage by sophisticated investors," Pacific-Basin Finance Journal, Elsevier, vol. 67(C).
    8. Lin Li, 2023. "Investigating risk assessment in post-pandemic household cryptocurrency investments: an explainable machine learning approach," Journal of Asset Management, Palgrave Macmillan, vol. 24(4), pages 255-267, July.
    9. Yasemin Lheureux, 2024. "Predictive insights: leveraging Twitter sentiments and machine learning for environmental, social and governance controversy prediction," Journal of Computational Social Science, Springer, vol. 7(1), pages 23-44, April.
    10. Fazlollah Soleymani & Houman Masnavi & Stanford Shateyi, 2020. "Classifying a Lending Portfolio of Loans with Dynamic Updates via a Machine Learning Technique," Mathematics, MDPI, vol. 9(1), pages 1-15, December.
    11. Helmut Wasserbacher & Martin Spindler, 2024. "Credit Ratings: Heterogeneous Effect on Capital Structure," Papers 2406.18936, arXiv.org.
    12. Mohammad Mahbobi & Salman Kimiagari & Marriappan Vasudevan, 2023. "Credit risk classification: an integrated predictive accuracy algorithm using artificial and deep neural networks," Annals of Operations Research, Springer, vol. 330(1), pages 609-637, November.
    13. Stefano Filomeni & Udichibarna Bose & Anastasios Megaritis & Athanasios Triantafyllou, 2024. "Can market information outperform hard and soft information in predicting corporate defaults?," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 29(3), pages 3567-3592, July.
    14. Juryon Paik & Kwangho Ko, 2024. "Seasonal Analysis and Risk Management Strategies for Credit Guarantee Funds: A Case Study from Republic of Korea," Stats, MDPI, vol. 8(1), pages 1-35, December.
    15. Alessandro Bitetto & Stefano Filomeni & Michele Modina, 2021. "Understanding corporate default using Random Forest: The role of accounting and market information," DEM Working Papers Series 205, University of Pavia, Department of Economics and Management.
    16. Bitetto, Alessandro & Cerchiello, Paola & Mertzanis, Charilaos, 2023. "Measuring financial soundness around the world: A machine learning approach," International Review of Financial Analysis, Elsevier, vol. 85(C).
    17. Hyunwoo Woo & So Young Sohn, 2022. "A credit scoring model based on the Myers–Briggs type indicator in online peer-to-peer lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-19, December.
    18. Pejman Peykani & Mostafa Sargolzaei & Negin Sanadgol & Amir Takaloo & Hamidreza Kamyabfar, 2023. "The application of structural and machine learning models to predict the default risk of listed companies in the Iranian capital market," PLOS ONE, Public Library of Science, vol. 18(11), pages 1-24, November.
    19. Xia, Yufei & Zhao, Junhao & He, Lingyun & Li, Yinguo & Yang, Xiaoli, 2021. "Forecasting loss given default for peer-to-peer loans via heterogeneous stacking ensemble approach," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1590-1613.
    20. Kui Wang & Jie Wan & Gang Li & Hao Sun, 2022. "A Hybrid Algorithm-Level Ensemble Model for Imbalanced Credit Default Prediction in the Energy Industry," Energies, MDPI, vol. 15(14), pages 1-18, July.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • G17 - Financial Economics - - General Financial Markets - - - Financial Forecasting and Simulation
    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages
    • G30 - Financial Economics - - Corporate Finance and Governance - - - General
    • O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bit:bsrysr:v:16:y:2025:i:1:p:198-218:n:1010. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.