IDEAS home Printed from https://ideas.repec.org/p/bdi/wptemi/td_1256_19.html
   My bibliography  Save this paper

Corporate default forecasting with machine learning

Author

Listed:
  • Mirko Moscatelli

    (Bank of Italy)

  • Simone Narizzano

    (Bank of Italy)

  • Fabio Parlapiano

    (Bank of Italy)

  • Gianluca Viggiano

    (Bank of Italy)

Abstract

We analyze the performance of a set of machine learning (ML) models in predicting default risk, using standard statistical models, such as the logistic regression, as a benchmark. When only a limited information set is available, for example in the case of financial indicators, we find that ML models provide substantial gains in discriminatory power and precision compared with statistical models. This advantage diminishes when high quality information, such as credit behavioral indicators obtained from the Credit Register, is also available, and becomes negligible when the dataset is small. We also evaluate the consequences of using an ML-based rating system on the supply of credit and the number of borrowers gaining access to credit. ML models channel a larger share of credit towards safer and larger borrowers and result in lower credit losses for lenders.

Suggested Citation

  • Mirko Moscatelli & Simone Narizzano & Fabio Parlapiano & Gianluca Viggiano, 2019. "Corporate default forecasting with machine learning," Temi di discussione (Economic working papers) 1256, Bank of Italy, Economic Research and International Relations Area.
  • Handle: RePEc:bdi:wptemi:td_1256_19
    as

    Download full text from publisher

    File URL: https://www.bancaditalia.it/pubblicazioni/temi-discussione/2019/2019-1256/en_tema_1256.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    2. Chakraborty, Chiranjit & Joseph, Andreas, 2017. "Machine learning at central banks," Bank of England working papers 674, Bank of England.
    3. Stefania Albanesi & Domonkos F. Vamossy, 2019. "Predicting Consumer Default: A Deep Learning Approach," Working Papers 2019-056, Human Capital and Economic Opportunity Working Group.
    4. Andreas Joseph, 2019. "Parametric inference with universal function approximators," Papers 1903.04209, arXiv.org, revised Oct 2020.
    5. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    6. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    7. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    8. Dean Fantazzini & Silvia Figini, 2009. "Random Survival Forests Models for SME Credit Risk Measurement," Methodology and Computing in Applied Probability, Springer, vol. 11(1), pages 29-45, March.
    9. François Coppens & Fernando Gonzáles & Gerhard Winkler, 2007. "The performance of credit rating systems in the assessment of collateral used in Eurosystem monetary policy operations," Working Paper Research 118, National Bank of Belgium.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Parisa Golbayani & Ionuc{t} Florescu & Rupak Chatterjee, 2020. "A comparative study of forecasting Corporate Credit Ratings using Neural Networks, Support Vector Machines, and Decision Trees," Papers 2007.06617, arXiv.org.
    2. Dean Fantazzini & Raffaella Calabrese, 2021. "Crypto Exchanges and Credit Risk: Modeling and Forecasting the Probability of Closure," JRFM, MDPI, vol. 14(11), pages 1-23, October.
    3. Agnese Carella & Federica Ciocchetta & Valentina Michelangeli & Federico Maria Signoretti, 2020. "What can we learn about mortgage supply from online data?," Questioni di Economia e Finanza (Occasional Papers) 583, Bank of Italy, Economic Research and International Relations Area.
    4. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    5. Falco J. Bargagli-Stoffi & Jan Niederreiter & Massimo Riccaboni, 2020. "Supervised learning for the prediction of firm dynamics," Papers 2009.06413, arXiv.org.
    6. Alessandro Bitetto & Stefano Filomeni & Michele Modina, 2021. "Understanding corporate default using Random Forest: The role of accounting and market information," DEM Working Papers Series 205, University of Pavia, Department of Economics and Management.
    7. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    8. Alessandro Bitetto & Paola Cerchiello & Stefano Filomeni & Alessandra Tanda & Barbara Tarantino, 2021. "Machine Learning and Credit Risk: Empirical Evidence from SMEs," DEM Working Papers Series 201, University of Pavia, Department of Economics and Management.
    9. Branka Hadji Misheva & Joerg Osterrieder & Ali Hirsa & Onkar Kulkarni & Stephen Fung Lin, 2021. "Explainable AI in Credit Risk Management," Papers 2103.00949, arXiv.org.
    10. Henri Fraisse & Matthias Laporte, 2021. "Return on Investment on AI: The Case of Capital Requirement," Working papers 809, Banque de France.
    11. Salvatore Cardillo & Raffaele Gallo & Francesco Guarino, 2021. "Main challenges and prospects for the European banking sector: a critical review of the ongoing debate," Questioni di Economia e Finanza (Occasional Papers) 634, Bank of Italy, Economic Research and International Relations Area.
    12. Francesco Cusano & Giuseppe Marinelli & Stefano Piermattei, 2021. "Learning from revisions: a tool for detecting potential errors in banks' balance sheet statistical reporting," Questioni di Economia e Finanza (Occasional Papers) 611, Bank of Italy, Economic Research and International Relations Area.
    13. Falco J. Bargagli-Stoffi & Fabio Incerti & Massimo Riccaboni & Armando Rungi, 2023. "Machine Learning for Zombie Hunting: Predicting Distress from Firms' Accounts and Missing Values," Papers 2306.08165, arXiv.org.
    14. Flavio Bazzana & Marco Bee & Ahmed Almustfa Hussin Adam Khatir, 2024. "Machine learning techniques for default prediction: an application to small Italian companies," Risk Management, Palgrave Macmillan, vol. 26(1), pages 1-23, February.
    15. Nicola Branzoli & Ilaria Supino, 2020. "FinTech credit: a critical review of empirical research," Questioni di Economia e Finanza (Occasional Papers) 549, Bank of Italy, Economic Research and International Relations Area.
    16. Giuseppe Cascarino & Mirko Moscatelli & Fabio Parlapiano, 2022. "Explainable Artificial Intelligence: interpreting default forecasting models based on Machine Learning," Questioni di Economia e Finanza (Occasional Papers) 674, Bank of Italy, Economic Research and International Relations Area.
    17. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    18. Falco J. Bargagli-Dtoffi & Massimo Riccaboni & Armando Rungi, 2020. "Machine Learning for Zombie Hunting. Firms Failures and Financial Constraints," Working Papers 01/2020, IMT School for Advanced Studies Lucca, revised Jun 2020.
    19. Giuseppe Orlando & Roberta Pelosi, 2020. "Non-Performing Loans for Italian Companies: When Time Matters. An Empirical Research on Estimating Probability to Default and Loss Given Default," IJFS, MDPI, vol. 8(4), pages 1-22, November.
    20. Golbayani, Parisa & Florescu, Ionuţ & Chatterjee, Rupak, 2020. "A comparative study of forecasting corporate credit ratings using neural networks, support vector machines, and decision trees," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Paritosh Navinchandra Jha & Marco Cucculelli, 2021. "A New Model Averaging Approach in Predicting Credit Risk Default," Risks, MDPI, vol. 9(6), pages 1-15, June.
    2. Fraisse, Henri & Laporte, Matthias, 2022. "Return on investment on artificial intelligence: The case of bank capital requirement," Journal of Banking & Finance, Elsevier, vol. 138(C).
    3. Richard Chamboko & Jorge M. Bravo, 2016. "On the modelling of prognosis from delinquency to normal performance on retail consumer loans," Risk Management, Palgrave Macmillan, vol. 18(4), pages 264-287, December.
    4. Neuberg Richard & Hannah Lauren, 2017. "Loan pricing under estimation risk," Statistics & Risk Modeling, De Gruyter, vol. 34(1-2), pages 69-87, June.
    5. Agnese Carella & Federica Ciocchetta & Valentina Michelangeli & Federico Maria Signoretti, 2020. "What can we learn about mortgage supply from online data?," Questioni di Economia e Finanza (Occasional Papers) 583, Bank of Italy, Economic Research and International Relations Area.
    6. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    7. Natalia Nehrebecka, 2021. "Internal Credit Risk Models and Digital Transformation: What to Prepare for? An Application to Poland," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 3), pages 719-736.
    8. Casado Yusta, Silvia & Nœ–ez Letamendía, Laura & Pacheco Bonrostro, Joaqu’n Antonio, 2018. "Predicting Corporate Failure: The GRASP-LOGIT Model || Predicci—n de la quiebra empresarial: el modelo GRASP-LOGIT," Revista de Métodos Cuantitativos para la Economía y la Empresa = Journal of Quantitative Methods for Economics and Business Administration, Universidad Pablo de Olavide, Department of Quantitative Methods for Economics and Business Administration, vol. 26(1), pages 294-314, Diciembre.
    9. Yiyan Huang & Cheuk Hang Leung & Xing Yan & Qi Wu & Nanbo Peng & Dongdong Wang & Zhixiang Huang, 2020. "The Causal Learning of Retail Delinquency," Papers 2012.09448, arXiv.org.
    10. Tsukahara, Fábio Yasuhiro & Kimura, Herbert & Sobreiro, Vinicius Amorim & Zambrano, Juan Carlos Arismendi, 2016. "Validation of default probability models: A stress testing approach," International Review of Financial Analysis, Elsevier, vol. 47(C), pages 70-85.
    11. David Veganzones, 2022. "Corporate failure prediction using threshold‐based models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(5), pages 956-979, August.
    12. Giacomo De Giorgi & Costanza Naguib, 2022. "Life after Default: Credit Hardship and its Effects," Diskussionsschriften dp2206, Universitaet Bern, Departement Volkswirtschaft.
    13. Yao-Zhi Xu & Jian-Lin Zhang & Ying Hua & Lin-Yue Wang, 2019. "Dynamic Credit Risk Evaluation Method for E-Commerce Sellers Based on a Hybrid Artificial Intelligence Model," Sustainability, MDPI, vol. 11(19), pages 1-17, October.
    14. Tang, Lingxiao & Cai, Fei & Ouyang, Yao, 2019. "Applying a nonparametric random forest algorithm to assess the credit risk of the energy industry in China," Technological Forecasting and Social Change, Elsevier, vol. 144(C), pages 563-572.
    15. Dinh, Thi Huyen Thanh & Kleimeier, Stefanie, 2007. "A credit scoring model for Vietnam's retail banking market," International Review of Financial Analysis, Elsevier, vol. 16(5), pages 471-495.
    16. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Are post-crisis statistical initiatives completed?, volume 49, Bank for International Settlements.
    17. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    18. Anastasios Petropoulos & Vasilis Siakoulis & Evaggelos Stavroulakis & Aristotelis Klamargias, 2019. "A robust machine learning approach for credit risk analysis of large loan-level datasets using deep learning and extreme gradient boosting," IFC Bulletins chapters, in: Bank for International Settlements (ed.), The use of big data analytics and artificial intelligence in central banking, volume 50, Bank for International Settlements.
    19. Anton Gerunov, 2023. "Modern Approaches To Forecasting Firm Default Rates Over The Short To Medium Term: An Application To A Panel Of Polish Companies," Yearbook of the Faculty of Economics and Business Administration, Sofia University, Faculty of Economics and Business Administration, Sofia University St Kliment Ohridski - Bulgaria, vol. 22(1), pages 5-15, October.
    20. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 2020. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 22(5), pages 1009-1019, October.

    More about this item

    Keywords

    Credit Scoring; Machine Learning; Random Forest; Gradient Boosting Machine;
    All these keywords.

    JEL classification:

    • G2 - Financial Economics - - Financial Institutions and Services
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bdi:wptemi:td_1256_19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/bdigvit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.