IDEAS home Printed from https://ideas.repec.org/a/gam/jjrfmx/v16y2023i12p496-d1288822.html
   My bibliography  Save this article

Machine Learning for Enhanced Credit Risk Assessment: An Empirical Approach

Author

Listed:
  • Nicolas Suhadolnik

    (Institute of Mathematics and Computer Science, University of Sao Paulo, Sao Carlos 13566-590, Brazil
    Regional Bank for Development of the South Region, Curitiba 80030-900, Brazil)

  • Jo Ueyama

    (Institute of Mathematics and Computer Science, University of Sao Paulo, Sao Carlos 13566-590, Brazil)

  • Sergio Da Silva

    (Graduate Program in Economics, Federal University of Santa Catarina, Florianopolis 88049-970, Brazil)

Abstract

Financial institutions and regulators increasingly rely on large-scale data analysis, particularly machine learning, for credit decisions. This paper assesses ten machine learning algorithms using a dataset of over 2.5 million observations from a financial institution. We also summarize key statistical and machine learning models in credit scoring and review current research findings. Our results indicate that ensemble models, particularly XGBoost, outperform traditional algorithms such as logistic regression in credit classification. Researchers and experts in the subject of credit risk can use this work as a practical reference as it covers crucial phases of data processing, exploratory data analysis, modeling, and evaluation metrics.

Suggested Citation

  • Nicolas Suhadolnik & Jo Ueyama & Sergio Da Silva, 2023. "Machine Learning for Enhanced Credit Risk Assessment: An Empirical Approach," JRFM, MDPI, vol. 16(12), pages 1-21, November.
  • Handle: RePEc:gam:jjrfmx:v:16:y:2023:i:12:p:496-:d:1288822
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1911-8074/16/12/496/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1911-8074/16/12/496/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    2. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    3. Tobias Berg & Valentin Burg & Ana Gombović & Manju Puri, 2020. "On the Rise of FinTechs: Credit Scoring Using Digital Footprints," The Review of Financial Studies, Society for Financial Studies, vol. 33(7), pages 2845-2897.
    4. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    5. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
    2. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    3. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    4. Tobias Götze & Marc Gürtler & Eileen Witowski, 2020. "Improving CAT bond pricing models via machine learning," Journal of Asset Management, Palgrave Macmillan, vol. 21(5), pages 428-446, September.
    5. Wen, Danyan & Liu, Li & Wang, Yudong & Zhang, Yaojie, 2022. "Forecasting crude oil market returns: Enhanced moving average technical indicators," Resources Policy, Elsevier, vol. 76(C).
    6. Daníelsson, Jón & Macrae, Robert & Uthemann, Andreas, 2022. "Artificial intelligence and systemic risk," Journal of Banking & Finance, Elsevier, vol. 140(C).
    7. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.
    8. Obaid, Khaled & Pukthuanthong, Kuntara, 2022. "A picture is worth a thousand words: Measuring investor sentiment by combining machine learning and photos from news," Journal of Financial Economics, Elsevier, vol. 144(1), pages 273-297.
    9. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    10. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    11. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble of Models for Optimal Predictive Performance with Applications to Sector Rotation Strategy," Papers 2304.09947, arXiv.org.
    12. Zhao, Albert Bo & Cheng, Tingting, 2022. "Stock return prediction: Stacking a variety of models," Journal of Empirical Finance, Elsevier, vol. 67(C), pages 288-317.
    13. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    14. Bolin Mao & Chenhui Chu & Yuta Nakashima & Hajime Nagahara, 2022. "Efficient Market Hypothesis Test with Stock Tweets and Natural Language Processing Models," KIER Working Papers 1082, Kyoto University, Institute of Economic Research.
    15. Back, Kerry & Crotty, Kevin & Kazempour, Seyed Mohammad, 2022. "Validity, tightness, and forecasting power of risk premium bounds," Journal of Financial Economics, Elsevier, vol. 144(3), pages 732-760.
    16. Celso Brunetti & Marc Joëts & Valérie Mignon, 2023. "Reasons Behind Words: OPEC Narratives and the Oil Market," Working Papers hal-04196053, HAL.
    17. Shi, Qi, 2023. "The RP-PCA factors and stock return predictability: An aligned approach," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    18. Hector F. Calvo-Pardo & Tullio Mancini & Jose Olmo, 2020. "Neural Network Models for Empirical Finance," JRFM, MDPI, vol. 13(11), pages 1-22, October.
    19. Esfandiar Maasoumi & Jianqiu Wang & Zhuo Wang & Ke Wu, 2024. "Identifying factors via automatic debiased machine learning," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 438-461, April.
    20. Doron Avramov & Si Cheng & Lior Metzker, 2023. "Machine Learning vs. Economic Restrictions: Evidence from Stock Return Predictability," Management Science, INFORMS, vol. 69(5), pages 2587-2619, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jjrfmx:v:16:y:2023:i:12:p:496-:d:1288822. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.