IDEAS home Printed from https://ideas.repec.org/a/eee/soceps/v90y2023ics0038012123002586.html
   My bibliography  Save this article

Machine learning and credit risk: Empirical evidence from small- and mid-sized businesses

Author

Listed:
  • Bitetto, Alessandro
  • Cerchiello, Paola
  • Filomeni, Stefano
  • Tanda, Alessandra
  • Tarantino, Barbara

Abstract

In this paper, we compare two different approaches to estimate the credit risk for small- and mid-sized businesses (SMBs), namely a classic parametric approach, by fitting an ordered probit model, and a non-parametric approach, calibrating a machine learning historical random forest (HRF) model. The models are applied to a unique and proprietary dataset comprising granular firm-level quarterly data collected from a European investment bank and an international insurance company on a sample of 464 Italian SMBs over the period 2015–2017. Results show that the HRF approach outperforms the traditional ordered probit model, highlighting how advanced estimation methodologies that use machine learning techniques can be successfully implemented to predict SMB credit risk, i.e. when facing high asymmetries of information. Moreover, by using Shapley values, we are able to assess the relevance of each variable in predicting SMB credit risk.

Suggested Citation

  • Bitetto, Alessandro & Cerchiello, Paola & Filomeni, Stefano & Tanda, Alessandra & Tarantino, Barbara, 2023. "Machine learning and credit risk: Empirical evidence from small- and mid-sized businesses," Socio-Economic Planning Sciences, Elsevier, vol. 90(C).
  • Handle: RePEc:eee:soceps:v:90:y:2023:i:c:s0038012123002586
    DOI: 10.1016/j.seps.2023.101746
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0038012123002586
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.seps.2023.101746?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stefano Filomeni & Gregory F. Udell & Alberto Zazzaro, 2021. "Hardening soft information: does organizational distance matter?," The European Journal of Finance, Taylor & Francis Journals, vol. 27(9), pages 897-927, June.
    2. Jeffrey M. Wooldridge, 2005. "Simple solutions to the initial conditions problem in dynamic, nonlinear panel data models with unobserved heterogeneity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 20(1), pages 39-54, January.
    3. Paul Contoyannis & Andrew M. Jones & Nigel Rice, 2004. "The dynamics of health in the British Household Panel Survey," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 19(4), pages 473-503.
    4. Bitetto, Alessandro & Cerchiello, Paola & Mertzanis, Charilaos, 2023. "On the efficient synthesis of short financial time series: A Dynamic Factor Model approach," Finance Research Letters, Elsevier, vol. 53(C).
    5. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    6. Elizabeth R. Odders-White & Mark J. Ready, 2006. "Credit Ratings and Stock Liquidity," The Review of Financial Studies, Society for Financial Studies, vol. 19(1), pages 119-157.
    7. Francesco Dainelli & Francesco Giunta & Fabrizio Cipollini, 2013. "Determinants of SME credit worthiness under Basel rules: the value of credit history information," PSL Quarterly Review, Economia civile, vol. 66(264), pages 21-47.
    8. Filomeni, Stefano & Udell, Gregory F. & Zazzaro, Alberto, 2020. "Communication frictions in banking organizations: Evidence from credit score lending," Economics Letters, Elsevier, vol. 195(C).
    9. Cucinelli, Doriana & Battista, Maria Luisa Di & Marchese, Malvina & Nieri, Laura, 2018. "Credit risk in European banks: The bright side of the internal ratings based approach," Journal of Banking & Finance, Elsevier, vol. 93(C), pages 213-229.
    10. Stijn Claessens & Jan Krahnen & William Lang, 2005. "The Basel II Reform and Retail Credit Markets," Journal of Financial Services Research, Springer;Western Finance Association, vol. 28(1), pages 5-13, October.
    11. Blöchlinger, Andreas & Leippold, Markus, 2018. "Are Ratings the Worst Form of Credit Assessment Except for All the Others?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 53(1), pages 299-334, February.
    12. Bitetto, Alessandro & Cerchiello, Paola, 2023. "Initial coin offerings and ESG: Allies or enemies?," Finance Research Letters, Elsevier, vol. 57(C).
    13. Majid Bazarbash, 2019. "FinTech in Financial Inclusion: Machine Learning Applications in Assessing Credit Risk," IMF Working Papers 2019/109, International Monetary Fund.
    14. de Andres, Javier & Landajo, Manuel & Lorca, Pedro, 2005. "Forecasting business profitability by using classification techniques: A comparative analysis based on a Spanish case," European Journal of Operational Research, Elsevier, vol. 167(2), pages 518-542, December.
    15. Dean Fantazzini & Silvia Figini, 2009. "Random Survival Forests Models for SME Credit Risk Measurement," Methodology and Computing in Applied Probability, Springer, vol. 11(1), pages 29-45, March.
    16. Vlado Kysucky & Lars Norden, 2016. "The Benefits of Relationship Lending in a Cross-Country Context: A Meta-Analysis," Management Science, INFORMS, vol. 62(1), pages 90-110, January.
    17. Corazza, Marco & Funari, Stefania & Gusso, Riccardo, 2016. "Creditworthiness evaluation of Italian SMEs at the beginning of the 2007–2008 crisis: An MCDA approach," The North American Journal of Economics and Finance, Elsevier, vol. 38(C), pages 1-26.
    18. Altman, Edward I., 1980. "Commercial Bank Lending: Process, Credit Scoring, and Costs of Errors in Lending," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 15(4), pages 813-832, November.
    19. Mirko Moscatelli & Simone Narizzano & Fabio Parlapiano & Gianluca Viggiano, 2019. "Corporate default forecasting with machine learning," Temi di discussione (Economic working papers) 1256, Bank of Italy, Economic Research and International Relations Area.
    20. Berger, Allen N & Udell, Gregory F, 1995. "Relationship Lending and Lines of Credit in Small Firm Finance," The Journal of Business, University of Chicago Press, vol. 68(3), pages 351-381, July.
    21. Stefano Filomeni & Michele Modina & Elena Tabacco, 2023. "Trade credit and firm investments: empirical evidence from Italian cooperative banks," Review of Quantitative Finance and Accounting, Springer, vol. 60(3), pages 1099-1141, April.
    22. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    23. Bitetto, Alessandro & Cerchiello, Paola & Mertzanis, Charilaos, 2023. "Measuring financial soundness around the world: A machine learning approach," International Review of Financial Analysis, Elsevier, vol. 85(C).
    24. Edward I. Altman & Gabriele Sabato, 2013. "MODELING CREDIT RISK FOR SMEs: EVIDENCE FROM THE US MARKET," World Scientific Book Chapters, in: Oliviero Roggi & Edward I Altman (ed.), Managing and Measuring Risk Emerging Global Standards and Regulations After the Financial Crisis, chapter 9, pages 251-279, World Scientific Publishing Co. Pte. Ltd..
    25. José María Liberti & Mitchell A. Petersen, 2018. "Information: Hard and Soft," NBER Working Papers 25075, National Bureau of Economic Research, Inc.
    26. William H. Greene & David A. Hensher, 2008. "Modeling Ordered Choices: A Primer and Recent Developments," Working Papers 08-26, New York University, Leonard N. Stern School of Business, Department of Economics.
    27. Greta Falavigna, 2006. "Models for Default Risk Analysis: Focus on Artificial Neural Networks, Model Comparisons, Hybrid Frameworks," CERIS Working Paper 200610, CNR-IRCrES Research Institute on Sustainable Economic Growth - Torino (TO) ITALY - former Institute for Economic Research on Firms and Growth - Moncalieri (TO) ITALY.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alessandro Bitetto & Paola Cerchiello & Stefano Filomeni & Alessandra Tanda & Barbara Tarantino, 2021. "Machine Learning and Credit Risk: Empirical Evidence from SMEs," DEM Working Papers Series 201, University of Pavia, Department of Economics and Management.
    2. Modina, Michele & Pietrovito, Filomena & Gallucci, Carmen & Formisano, Vincenzo, 2023. "Predicting SMEs’ default risk: Evidence from bank-firm relationship data," The Quarterly Review of Economics and Finance, Elsevier, vol. 89(C), pages 254-268.
    3. Mizen, Paul & Tsoukas, Serafeim, 2012. "Forecasting US bond default ratings allowing for previous and initial state dependence in an ordered probit model," International Journal of Forecasting, Elsevier, vol. 28(1), pages 273-287.
    4. Marco Corazza & Giovanni Fasano & Stefania Funari & Riccardo Gusso, 2017. "PSO-based tuning of MURAME parameters for creditworthiness evaluation of Italian SMEs," Working Papers 04, Department of Management, Università Ca' Foscari Venezia.
    5. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    6. Zedda, Stefano & Modina, Michele & Gallucci, Carmen, 2024. "Cooperative credit banks and sustainability: Towards a social credit scoring," Research in International Business and Finance, Elsevier, vol. 68(C).
    7. Carro, Jesús M. & Traferri, Alejandra, 2009. "Correcting the bias in the estimation of a dynamic ordered probit with fixed effects of self-assessed health status," UC3M Working papers. Economics we094021, Universidad Carlos III de Madrid. Departamento de Economía.
    8. Dean Fantazzini & Raffaella Calabrese, 2021. "Crypto Exchanges and Credit Risk: Modeling and Forecasting the Probability of Closure," JRFM, MDPI, vol. 14(11), pages 1-23, October.
    9. Francesco Ciampi & Alessandro Giannozzi & Giacomo Marzi & Edward I. Altman, 2021. "Rethinking SME default prediction: a systematic literature review and future perspectives," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(3), pages 2141-2188, March.
    10. Geert Dhaene & Koen Jochmans, 2015. "Split-panel Jackknife Estimation of Fixed-effect Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(3), pages 991-1030.
    11. Jarko Fidrmuc & Philipp Schreiber & Martin Siddiqui, 2018. "Intangible Assets and the Determinants of a Single Bank Relation of German SMEs," European Journal of Business Science and Technology, Mendel University in Brno, Faculty of Business and Economics, vol. 4(1), pages 5-30.
    12. Lionel WILNER, 2019. "The Dynamics of Individual Happiness," Working Papers 2019-18, Center for Research in Economics and Statistics.
    13. Hussain, Inayat & Durand, Robert B. & Harris, Mark N., 2021. "Relationship lending: A source of support or a means of exploitation?," Global Finance Journal, Elsevier, vol. 48(C).
    14. Patricia Cubí‐Mollá & Mireia Jofre‐Bonet & Victoria Serra‐Sastre, 2017. "Adaptation to health states: Sick yet better off?," Health Economics, John Wiley & Sons, Ltd., vol. 26(12), pages 1826-1843, December.
    15. Russo, Daniela & Hart, Terry L. & Malaguti, Maria Chiara & Papathanassiou, Chryssa, 2004. "Governance of securities clearing and settlement systems," Occasional Paper Series 21, European Central Bank.
    16. Hernández-Quevedo, Cristina & Jones, Andrew M. & Rice, Nigel, 2008. "Persistence in health limitations: A European comparative analysis," Journal of Health Economics, Elsevier, vol. 27(6), pages 1472-1488, December.
    17. André Geis & Arnaud Mehl & Stefan Wredenborg, 2004. "The international role of the euro - evidence from bonds issued by non-euro area residents," Occasional Paper Series 18, European Central Bank.
    18. Martin Brown & Matthias Hoffmann, 2016. "Relationship Banking in the Residential Mortgage Market? Evidence from Switzerland," Swiss Journal of Economics and Statistics (SJES), Swiss Society of Economics and Statistics (SSES), vol. 152(I), pages 23-48, March.
    19. Ornelas, José Renato Haas & da Silva, Marcos Soares & Van Doornik, Bernardus Ferdinandus Nazar, 2022. "Informational switching costs, bank competition, and the cost of finance," Journal of Banking & Finance, Elsevier, vol. 138(C).
    20. Doris Neuberger & Solvig Räthke, 2009. "Microenterprises and multiple bank relationships: The case of professionals," Small Business Economics, Springer, vol. 32(2), pages 207-229, February.

    More about this item

    Keywords

    Credit rating; SMB; Historical random forest; Machine learning; Relationship banking; Invoice lending;
    All these keywords.

    JEL classification:

    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information; Mechanism Design
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages
    • G22 - Financial Economics - - Financial Institutions and Services - - - Insurance; Insurance Companies; Actuarial Studies

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:soceps:v:90:y:2023:i:c:s0038012123002586. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/seps .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.