IDEAS home Printed from https://ideas.repec.org/a/spr/fininn/v8y2022i1d10.1186_s40854-022-00366-1.html
   My bibliography  Save this article

Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction

Author

Listed:
  • Andrés Alonso Robisco

    (Banco de España)

  • José Manuel Carbó Martínez

    (Banco de España)

Abstract

Implementing new machine learning (ML) algorithms for credit default prediction is associated with better predictive performance; however, it also generates new model risks, particularly concerning the supervisory validation process. Recent industry surveys often mention that uncertainty about how supervisors might assess these risks could be a barrier to innovation. In this study, we propose a new framework to quantify model risk-adjustments to compare the performance of several ML methods. To address this challenge, we first harness the internal ratings-based approach to identify up to 13 risk components that we classify into 3 main categories—statistics, technology, and market conduct. Second, to evaluate the importance of each risk category, we collect a series of regulatory documents related to three potential use cases—regulatory capital, credit scoring, or provisioning—and we compute the weight of each category according to the intensity of their mentions, using natural language processing and a risk terminology based on expert knowledge. Finally, we test our framework using popular ML models in credit risk, and a publicly available database, to quantify some proxies of a subset of risk factors that we deem representative. We measure the statistical risk according to the number of hyperparameters and the stability of the predictions. The technological risk is assessed through the transparency of the algorithm and the latency of the ML training method, while the market conduct risk is quantified by the time it takes to run a post hoc technique (SHapley Additive exPlanations) to interpret the output.

Suggested Citation

  • Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
  • Handle: RePEc:spr:fininn:v:8:y:2022:i:1:d:10.1186_s40854-022-00366-1
    DOI: 10.1186/s40854-022-00366-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1186/s40854-022-00366-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1186/s40854-022-00366-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Iñaki Aldasoro & Leonardo Gambacorta & Paolo Giudici & Thomas Leach, 2023. "Operational and Cyber Risks in the Financial Sector," International Journal of Central Banking, International Journal of Central Banking, vol. 19(5), pages 340-402, December.
    2. Dominique Guegan & Bertrand Hassani, 2018. "Regulatory learning: How to supervise machine learning models? An application to credit scoring," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01835213, HAL.
    3. Yiping Huang & Ms. Longmei Zhang & Zhenhua Li & Han Qiu & Tao Sun & Xue Wang, 2020. "Fintech Credit Risk Assessment for SMEs: Evidence from China," IMF Working Papers 2020/193, International Monetary Fund.
    4. Sarlin, Peter, 2013. "On policymakers’ loss functions and the evaluation of early warning systems," Economics Letters, Elsevier, vol. 119(1), pages 1-7.
    5. Will Dobbie & Andres Liberman & Daniel Paravisini & Vikram Pathania, 2021. "Measuring Bias in Consumer Lending [Loan Prospecting and the Loss of Soft Information]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(6), pages 2799-2832.
    6. Stefania Albanesi & Domonkos F. Vamossy, 2019. "Predicting Consumer Default: A Deep Learning Approach," NBER Working Papers 26165, National Bureau of Economic Research, Inc.
    7. Farkas, Walter & Fringuellotti, Fulvia & Tunaru, Radu, 2020. "A cost-benefit analysis of capital requirements adjusted for model risk," Journal of Corporate Finance, Elsevier, vol. 65(C).
    8. Andrés Alonso & José Manuel Marqués, 2019. "Financial innovation for a sustainable economy," Occasional Papers 1916, Banco de España.
    9. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    10. Dominique Guegan & Bertrand Hassani, 2018. "Regulatory learning: How to supervise machine learning models? An application to credit scoring," Post-Print halshs-01835213, HAL.
    11. Majid Bazarbash, 2019. "FinTech in Financial Inclusion: Machine Learning Applications in Assessing Credit Risk," IMF Working Papers 2019/109, International Monetary Fund.
    12. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    13. Julapa Jagtiani & Catharine Lemieux, 2019. "The roles of alternative data and machine learning in fintech lending: Evidence from the LendingClub consumer platform," Financial Management, Financial Management Association International, vol. 48(4), pages 1009-1029, December.
    14. Masciandaro, Donato & Peia, Oana & Romelli, Davide, 2020. "Banking supervision and external auditors: Theory and empirics," Journal of Financial Stability, Elsevier, vol. 46(C).
    15. Wall, Larry D., 2018. "Some financial regulatory implications of artificial intelligence," Journal of Economics and Business, Elsevier, vol. 100(C), pages 55-63.
    16. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    17. Bartlett, Robert & Morse, Adair & Stanton, Richard & Wallace, Nancy, 2022. "Consumer-lending discrimination in the FinTech Era," Journal of Financial Economics, Elsevier, vol. 143(1), pages 30-56.
    18. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Are post-crisis statistical initiatives completed?, volume 49, Bank for International Settlements.
    19. Bracke, Philippe & Datta, Anupam & Jung, Carsten & Sen, Shayak, 2019. "Machine learning explainability in finance: an application to default risk analysis," Bank of England working papers 816, Bank of England.
    20. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    21. Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
    22. Ángel Iván Moreno & Teresa Caminero, 2020. "Application of text mining to the analysis of climate-related disclosures," Working Papers 2035, Banco de España.
    23. Jones, Stewart & Johnstone, David & Wilson, Roy, 2015. "An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes," Journal of Banking & Finance, Elsevier, vol. 56(C), pages 72-85.
    24. Kerkhof, Jeroen & Melenberg, Bertrand & Schumacher, Hans, 2010. "Model risk and capital reserves," Journal of Banking & Finance, Elsevier, vol. 34(1), pages 267-279, January.
    25. Tarashev, Nikola, 2010. "Measuring portfolio credit risk correctly: Why parameter uncertainty matters," Journal of Banking & Finance, Elsevier, vol. 34(9), pages 2065-2076, September.
    26. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan-level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), The use of big data analytics and artificial intelligence in central banking, volume 50, Bank for International Settlements.
    27. Jeremy D. Turiel & Tomaso Aste, 2019. "P2P Loan acceptance and default prediction with Artificial Intelligence," Papers 1907.01800, arXiv.org.
    28. Sigrist, Fabio & Hirnschall, Christoph, 2019. "Grabit: Gradient tree-boosted Tobit models for default prediction," Journal of Banking & Finance, Elsevier, vol. 102(C), pages 177-192.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. González, Marta Ramos & Ureña, Antonio Partal & Fernández-Aguado, Pilar Gómez, 2023. "Forecasting for regulatory credit loss derived from the COVID-19 pandemic: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    2. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    3. Ryuichiro Hashimoto & Kakeru Miura & Yasunori Yoshizaki, 2023. "Application of Machine Learning to a Credit Rating Classification Model: Techniques for Improving the Explainability of Machine Learning," Bank of Japan Working Paper Series 23-E-6, Bank of Japan.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    2. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    3. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    4. Buckmann, Marcus & Haldane, Andy & Hüser, Anne-Caroline, 2021. "Comparing minds and machines: implications for financial stability," Bank of England working papers 937, Bank of England.
    5. Goodell, John W. & Kumar, Satish & Lim, Weng Marc & Pattnaik, Debidutta, 2021. "Artificial intelligence and machine learning in finance: Identifying foundations, themes, and research clusters from bibliometric analysis," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    6. García-Céspedes, Rubén & Moreno, Manuel, 2022. "The generalized Vasicek credit risk model: A Machine Learning approach," Finance Research Letters, Elsevier, vol. 47(PA).
    7. Nicola Branzoli & Ilaria Supino, 2020. "FinTech credit: a critical review of empirical research," Questioni di Economia e Finanza (Occasional Papers) 549, Bank of Italy, Economic Research and International Relations Area.
    8. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    9. Luca Barbaglia & Sebastiano Manzan & Elisa Tosetti, 2023. "Forecasting Loan Default in Europe with Machine Learning," Journal of Financial Econometrics, Oxford University Press, vol. 21(2), pages 569-596.
    10. Susanna Levantesi & Giulia Zacchia, 2021. "Machine Learning and Financial Literacy: An Exploration of Factors Influencing Financial Knowledge in Italy," JRFM, MDPI, vol. 14(3), pages 1-21, March.
    11. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    12. Tobias Götze & Marc Gürtler & Eileen Witowski, 2020. "Improving CAT bond pricing models via machine learning," Journal of Asset Management, Palgrave Macmillan, vol. 21(5), pages 428-446, September.
    13. Eghbal Rahimikia & Stefan Zohren & Ser-Huang Poon, 2021. "Realised Volatility Forecasting: Machine Learning via Financial Word Embedding," Papers 2108.00480, arXiv.org, revised Mar 2023.
    14. Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
    15. Adebayo Oshingbesan & Eniola Ajiboye & Peruth Kamashazi & Timothy Mbaka, 2022. "Model-Free Reinforcement Learning for Asset Allocation," Papers 2209.10458, arXiv.org.
    16. Roy Cerqueti & Francesca Pampurini & Annagiulia Pezzola & Anna Grazia Quaranta, 2022. "Dangerous liasons and hot customers for banks," Review of Quantitative Finance and Accounting, Springer, vol. 59(1), pages 65-89, July.
    17. Fraisse, Henri & Laporte, Matthias, 2022. "Return on investment on artificial intelligence: The case of bank capital requirement," Journal of Banking & Finance, Elsevier, vol. 138(C).
    18. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    19. Aussenegg, Wolfgang & Resch, Florian & Winkler, Gerhard, 2011. "Pitfalls and remedies in testing the calibration quality of rating systems," Journal of Banking & Finance, Elsevier, vol. 35(3), pages 698-708, March.
    20. Nakagawa, Kei & Sakemoto, Ryuta, 2022. "Cryptocurrency network factors and gold," Finance Research Letters, Elsevier, vol. 46(PB).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:fininn:v:8:y:2022:i:1:d:10.1186_s40854-022-00366-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.