IDEAS home Printed from https://ideas.repec.org/p/pav/demwpp/demwp0201.html
   My bibliography  Save this paper

Machine Learning and Credit Risk: Empirical Evidence from SMEs

Author

Listed:
  • Alessandro Bitetto

    (University of Pavia)

  • Paola Cerchiello

    (University of Pavia)

  • Stefano Filomeni

    (University of Essex)

  • Alessandra Tanda

    (University of Pavia)

  • Barbara Tarantino

    (University of Pavia)

Abstract

In this paper we assess credit risk of SMEs by testing and comparing a classic parametric approach fitting an ordered probit model with a non-parametric one calibrating a machine learning historical random forest (HRF) model. We do so by exploiting a unique and proprietary dataset comprising granular firm-level quarterly data collected from a large European bank and an international insurance company on a sample of 810 Italian small- and medium-sized enterprises (SMEs) over the time period 2015-2017. Our results provide novel evidence that a dynamic Historical Random Forest (HRF) approach outperforms the traditional ordered probit model, highlighting how advanced estimation methodologies that use machine learning techniques can be successfully implemented to predict SME credit risk. Moreover, by using Shapley values for the first time, we are able to assess the relevance of each variable in predicting SME credit risk. Traditionally, credit risk evaluation of informationally-opaque SMEs has relied on soft information-intensive relationship banking. However, the advent of large banking conglomerates and the limits to successfully "harden" and transmit soft information across large banking organizations, challenge the traditional role of relationship banking, urging the need to evaluate SME credit risk by implementing alternative methodologies mostly based on hard information.

Suggested Citation

  • Alessandro Bitetto & Paola Cerchiello & Stefano Filomeni & Alessandra Tanda & Barbara Tarantino, 2021. "Machine Learning and Credit Risk: Empirical Evidence from SMEs," DEM Working Papers Series 201, University of Pavia, Department of Economics and Management.
  • Handle: RePEc:pav:demwpp:demwp0201
    as

    Download full text from publisher

    File URL: http://dem-web.unipv.it/web/docs/dipeco/quad/ps/RePEc/pav/demwpp/DEMWP0201.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. ., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 77-93, June.
    2. Jeffrey M. Wooldridge, 2005. "Simple solutions to the initial conditions problem in dynamic, nonlinear panel data models with unobserved heterogeneity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 20(1), pages 39-54, January.
    3. Altman, Edward I., 1980. "Commercial Bank Lending: Process, Credit Scoring, and Costs of Errors in Lending," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 15(4), pages 813-832, November.
    4. Paul Contoyannis & Andrew M. Jones & Nigel Rice, 2004. "The dynamics of health in the British Household Panel Survey," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 19(4), pages 473-503.
    5. Berger, Allen N. & Miller, Nathan H. & Petersen, Mitchell A. & Rajan, Raghuram G. & Stein, Jeremy C., 2005. "Does function follow organizational form? Evidence from the lending practices of large and small banks," Journal of Financial Economics, Elsevier, vol. 76(2), pages 237-269, May.
    6. Ferri, Giovanni & Pesic, Valerio, 2017. "Bank regulatory arbitrage via risk weighted assets dispersion," Journal of Financial Stability, Elsevier, vol. 33(C), pages 331-345.
    7. Mirko Moscatelli & Simone Narizzano & Fabio Parlapiano & Gianluca Viggiano, 2019. "Corporate default forecasting with machine learning," Temi di discussione (Economic working papers) 1256, Bank of Italy, Economic Research and International Relations Area.
    8. Berger, Allen N & Udell, Gregory F, 1995. "Relationship Lending and Lines of Credit in Small Firm Finance," The Journal of Business, University of Chicago Press, vol. 68(3), pages 351-381, July.
    9. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    10. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    11. Elizabeth R. Odders-White & Mark J. Ready, 2006. "Credit Ratings and Stock Liquidity," The Review of Financial Studies, Society for Financial Studies, vol. 19(1), pages 119-157.
    12. Filomeni, Stefano & Udell, Gregory F. & Zazzaro, Alberto, 2020. "Communication frictions in banking organizations: Evidence from credit score lending," Economics Letters, Elsevier, vol. 195(C).
    13. Cucinelli, Doriana & Battista, Maria Luisa Di & Marchese, Malvina & Nieri, Laura, 2018. "Credit risk in European banks: The bright side of the internal ratings based approach," Journal of Banking & Finance, Elsevier, vol. 93(C), pages 213-229.
    14. José María Liberti & Mitchell A. Petersen, 2018. "Information: Hard and Soft," NBER Working Papers 25075, National Bureau of Economic Research, Inc.
    15. Stijn Claessens & Jan Krahnen & William Lang, 2005. "The Basel II Reform and Retail Credit Markets," Journal of Financial Services Research, Springer;Western Finance Association, vol. 28(1), pages 5-13, October.
    16. William H. Greene & David A. Hensher, 2008. "Modeling Ordered Choices: A Primer and Recent Developments," Working Papers 08-26, New York University, Leonard N. Stern School of Business, Department of Economics.
    17. Blöchlinger, Andreas & Leippold, Markus, 2018. "Are Ratings the Worst Form of Credit Assessment Except for All the Others?," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 53(1), pages 299-334, February.
    18. Greta Falavigna, 2006. "Models for Default Risk Analysis: Focus on Artificial Neural Networks, Model Comparisons, Hybrid Frameworks," CERIS Working Paper 200610, CNR-IRCrES Research Institute on Sustainable Economic Growth - Torino (TO) ITALY - former Institute for Economic Research on Firms and Growth - Moncalieri (TO) ITALY.
    19. Jeremy C. Stein, 2002. "Information Production and Capital Allocation: Decentralized versus Hierarchical Firms," Journal of Finance, American Finance Association, vol. 57(5), pages 1891-1921, October.
    20. Majid Bazarbash, 2019. "FinTech in Financial Inclusion: Machine Learning Applications in Assessing Credit Risk," IMF Working Papers 2019/109, International Monetary Fund.
    21. de Andres, Javier & Landajo, Manuel & Lorca, Pedro, 2005. "Forecasting business profitability by using classification techniques: A comparative analysis based on a Spanish case," European Journal of Operational Research, Elsevier, vol. 167(2), pages 518-542, December.
    22. Dean Fantazzini & Silvia Figini, 2009. "Random Survival Forests Models for SME Credit Risk Measurement," Methodology and Computing in Applied Probability, Springer, vol. 11(1), pages 29-45, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mizen, Paul & Tsoukas, Serafeim, 2012. "Forecasting US bond default ratings allowing for previous and initial state dependence in an ordered probit model," International Journal of Forecasting, Elsevier, vol. 28(1), pages 273-287.
    2. Song Zhang & Liang Han & Konstantinos Kallias & Antonios Kallias, 2021. "The value of in-person banking: evidence from U.S. small businesses," Review of Quantitative Finance and Accounting, Springer, vol. 57(4), pages 1393-1435, November.
    3. Modina, Michele & Pietrovito, Filomena & Gallucci, Carmen & Formisano, Vincenzo, 2023. "Predicting SMEs’ default risk: Evidence from bank-firm relationship data," The Quarterly Review of Economics and Finance, Elsevier, vol. 89(C), pages 254-268.
    4. Fabio Panetta & Fabiano Schivardi & Matthew Shum, 2009. "Do Mergers Improve Information? Evidence from the Loan Market," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 41(4), pages 673-709, June.
    5. Bertrand, Jérémie & Murro, Pierluigi, 2022. "Firm–bank “odd couples” and trade credit: Evidence from Italian small- and medium-sized enterprises," Economic Modelling, Elsevier, vol. 111(C).
    6. Uluc Aysun, 2019. "Centralized versus Decentralized Banking: Bank-level evidence from U.S. Call Reports," Working Papers 2019-03, University of Central Florida, Department of Economics.
    7. André Geis & Arnaud Mehl & Stefan Wredenborg, 2004. "The international role of the euro - evidence from bonds issued by non-euro area residents," Occasional Paper Series 18, European Central Bank.
    8. Hibbeln, Martin & Norden, Lars & Usselmann, Piet & Gürtler, Marc, 2020. "Informational synergies in consumer credit," Journal of Financial Intermediation, Elsevier, vol. 44(C).
    9. Uchida, Hirofumi & Udell, Gregory F. & Yamori, Nobuyoshi, 2012. "Loan officers and relationship lending to SMEs," Journal of Financial Intermediation, Elsevier, vol. 21(1), pages 97-122.
    10. Shusen Qi & Ralph De Haas & Steven Ongena & Stefan Straetmans & Tamas Vadasz, 2017. "Move a Little Closer? Information Sharing and the Spatial Clustering of Bank Branches," Swiss Finance Institute Research Paper Series 17-74, Swiss Finance Institute, revised Jun 2023.
    11. Hans Degryse & Steven Ongena, 2005. "Distance, Lending Relationships, and Competition," Journal of Finance, American Finance Association, vol. 60(1), pages 231-266, February.
    12. Johannes Hörner & Nicolas S Lambert, 2021. "Motivational Ratings [Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(4), pages 1892-1935.
    13. Guido Wolswijk & Jakob de Haan, 2005. "Government debt management in the euro area - recent theoretical developments and changes in practices," Occasional Paper Series 25, European Central Bank.
    14. Stefano Filomeni & Michele Modina & Elena Tabacco, 2023. "Trade credit and firm investments: empirical evidence from Italian cooperative banks," Review of Quantitative Finance and Accounting, Springer, vol. 60(3), pages 1099-1141, April.
    15. Hans Degryse & Luc Laeven & Steven Ongena, 2009. "The Impact of Organizational Structure and Lending Technology on Banking Competition," Review of Finance, European Finance Association, vol. 13(2), pages 225-259.
    16. de Haas, Ralph & Lu, Liping & Ongena, S.R.G., 2018. "Clear and Close Competitors? : On the Causes and Consequences of Bilateral Competition between Banks," Other publications TiSEM e9f86045-13c5-49d9-85df-1, Tilburg University, School of Economics and Management.
    17. Antonio, DI Cesare, 2006. "Do Market‐based Indicators Anticipate Rating Agencies? Evidence for International Banks," Economic Notes, Banca Monte dei Paschi di Siena SpA, vol. 35(1), pages 121-150, February.
    18. Iftekhar Hasan & Krzysztof Jackowicz & Oskar Kowalewski & Łukasz Kozłowski, 2019. "The economic impact of changes in local bank presence," Regional Studies, Taylor & Francis Journals, vol. 53(5), pages 644-656, May.
    19. O. Emre Ergungor, 2002. "Community banks as small business lenders: the tough road ahead," Working Papers (Old Series) 0203, Federal Reserve Bank of Cleveland.
    20. Berger, Allen N. & Udell, Gregory F., 2006. "A more complete conceptual framework for SME finance," Journal of Banking & Finance, Elsevier, vol. 30(11), pages 2945-2966, November.

    More about this item

    Keywords

    Credit Rating; SME; Historical Random Forest; Machine Learning; Relationship Banking; Soft Information;
    All these keywords.

    JEL classification:

    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information; Mechanism Design
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages
    • G22 - Financial Economics - - Financial Institutions and Services - - - Insurance; Insurance Companies; Actuarial Studies

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pav:demwpp:demwp0201. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Alice Albonico (email available below). General contact details of provider: https://edirc.repec.org/data/dppavit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.