IDEAS home Printed from https://ideas.repec.org/p/hal/cesptp/halshs-01835164.html
   My bibliography  Save this paper

Credit Risk Analysis Using Machine and Deep Learning Models

Author

Listed:
  • Dominique Guegan

    (UP1 - Université Paris 1 Panthéon-Sorbonne, CES - Centre d'économie de la Sorbonne - UP1 - Université Paris 1 Panthéon-Sorbonne - CNRS - Centre National de la Recherche Scientifique, Labex ReFi - UP1 - Université Paris 1 Panthéon-Sorbonne, IPAG Business School, University of Ca’ Foscari [Venice, Italy])

  • Peter Martey Addo

    (AFD - Agence française de développement, Labex ReFi - UP1 - Université Paris 1 Panthéon-Sorbonne)

  • Bertrand Hassani

    (Labex ReFi - UP1 - Université Paris 1 Panthéon-Sorbonne, CES - Centre d'économie de la Sorbonne - UP1 - Université Paris 1 Panthéon-Sorbonne - CNRS - Centre National de la Recherche Scientifique, Capgemini Consulting [Paris], UCL-CS - Department of Computer science [University College of London] - UCL - University College of London [London])

Abstract

Due to the advanced technology associated with Big Data, data availability and computing power, most banks or lending institutions are renewing their business models. Credit risk predictions, monitoring, model reliability and effective loan processing are key to decision-making and transparency. In this work, we build binary classifiers based on machine and deep learning models on real data in predicting loan default probability. The top 10 important features from these models are selected and then used in the modeling process to test the stability of binary classifiers by comparing their performance on separate data. We observe that the tree-based models are more stable than the models based on multilayer artificial neural networks. This opens several questions relative to the intensive use of deep learning systems in enterprises.

Suggested Citation

  • Dominique Guegan & Peter Martey Addo & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01835164, HAL.
  • Handle: RePEc:hal:cesptp:halshs-01835164
    DOI: 10.3390/risks6020038
    Note: View the original document on HAL open archive server: https://shs.hal.science/halshs-01835164
    as

    Download full text from publisher

    File URL: https://shs.hal.science/halshs-01835164/document
    Download Restriction: no

    File URL: https://libkey.io/10.3390/risks6020038?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robert Tibshirani, 2011. "Regression shrinkage and selection via the lasso: a retrospective," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 273-282, June.
    2. Justin Sirignano & Apaar Sadhwani & Kay Giesecke, 2016. "Deep Learning for Mortgage Risk," Papers 1607.02470, arXiv.org, revised Mar 2018.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. ,, 1998. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 14(1), pages 151-159, February.
    5. Yitzhaki, Shlomo, 1983. "On an Extension of the Gini Inequality Index," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 24(3), pages 617-628, October.
    6. Lerman, Robert I. & Yitzhaki, Shlomo, 1984. "A note on the calculation and interpretation of the Gini index," Economics Letters, Elsevier, vol. 15(3-4), pages 363-368.
    7. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    8. Butaru, Florentin & Chen, Qingqing & Clark, Brian & Das, Sanmay & Lo, Andrew W. & Siddique, Akhtar, 2016. "Risk and risk management in the credit card industry," Journal of Banking & Finance, Elsevier, vol. 72(C), pages 218-239.
    9. ,, 1998. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 14(5), pages 687-698, October.
    10. A. Seetharaman & Vikas Kumar Sahu & A. S. Saravanan & John Rudolph Raj & Indu Niranjan, 2017. "The Impact of Risk Management in Credit Rating Agencies," Risks, MDPI, vol. 5(4), pages 1-16, September.
    11. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    12. ,, 1998. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 14(3), pages 381-386, June.
    13. ,, 1998. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 14(4), pages 525-537, August.
    14. ,, 1998. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 14(2), pages 285-292, April.
    15. Angelini, Eliana & di Tollo, Giacomo & Roli, Andrea, 2008. "A neural network approach for credit risk evaluation," The Quarterly Review of Economics and Finance, Elsevier, vol. 48(4), pages 733-755, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Risks, MDPI, vol. 6(2), pages 1-20, April.
    2. Dominique Guegan & Peter Martey Addo & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Post-Print halshs-01835164, HAL.
    3. Gallego, Jorge & Rivero, Gonzalo & Martínez, Juan, 2021. "Preventing rather than punishing: An early warning model of malfeasance in public procurement," International Journal of Forecasting, Elsevier, vol. 37(1), pages 360-377.
    4. Dolf Talman & Zaifu Yang, 2012. "On a Parameterized System of Nonlinear Equations with Economic Applications," Journal of Optimization Theory and Applications, Springer, vol. 154(2), pages 644-671, August.
    5. Zhiqiang Zheng & Balaji Padmanabhan & Steven O. Kimbrough, 2003. "On the Existence and Significance of Data Preprocessing Biases in Web-Usage Mining," INFORMS Journal on Computing, INFORMS, vol. 15(2), pages 148-170, May.
    6. Herings, P.J.J. & Talman, A.J.J. & Yang, Z.F., 1999. "Variational Inequality Problems With a Continuum of Solutions : Existence and Computation," Other publications TiSEM 73e2f01b-ad4d-4447-95ba-a, Tilburg University, School of Economics and Management.
    7. Carlos R. Handy & Daniel Vrinceanu & Carl B. Marth & Harold A. Brooks, 2015. "Pointwise Reconstruction of Wave Functions from Their Moments through Weighted Polynomial Expansions: An Alternative Global-Local Quantization Procedure," Mathematics, MDPI, vol. 3(4), pages 1-24, November.
    8. Allen C. Goodman & Miron Stano, 2000. "Hmos and Health Externalities: A Local Public Good Perspective," Public Finance Review, , vol. 28(3), pages 247-269, May.
    9. Bode, Sven & Michaelowa, Axel, 2003. "Avoiding perverse effects of baseline and investment additionality determination in the case of renewable energy projects," Energy Policy, Elsevier, vol. 31(6), pages 505-517, May.
    10. Ala, Guido & Fasshauer, Gregory E. & Francomano, Elisa & Ganci, Salvatore & McCourt, Michael J., 2017. "An augmented MFS approach for brain activity reconstruction," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 141(C), pages 3-15.
    11. Bettina Campedelli & Andrea Guerrina & Giulia Romano & Chiara Leardini, 2014. "La performance della rete ospedaliera pubblica della regione Veneto. L?impatto delle variabili ambientali e operative sull?efficienza," MECOSAN, FrancoAngeli Editore, vol. 2014(92), pages 119-142.
    12. Haider A. Khan, 2004. "General Conclusions: From Crisis to a Global Political Economy of Freedom," Palgrave Macmillan Books, in: Global Markets and Financial Crises in Asia, chapter 9, pages 193-211, Palgrave Macmillan.
    13. Penn Loh & Zoë Ackerman & Joceline Fidalgo & Rebecca Tumposky, 2022. "Co-Education/Co-Research Partnership: A Critical Approach to Co-Learning between Dudley Street Neighborhood Initiative and Tufts University," Social Sciences, MDPI, vol. 11(2), pages 1-17, February.
    14. Broekhuis, Manda & Vos, Janita F.J., 2003. "Improving organizational sustainability using a quality perspective," Research Report 03A43, University of Groningen, Research Institute SOM (Systems, Organisations and Management).
    15. O'Brien, Raymond & Patacchini, Eleonora, 2003. "Testing the exogeneity assumption in panel data models with "non classical" disturbances," Discussion Paper Series In Economics And Econometrics 0302, Economics Division, School of Social Sciences, University of Southampton.
    16. van der Laan, G. & Talman, A.J.J. & Yang, Z.F., 2002. "Perfection and Stability of Stationary Points with Applications in Noncooperative Games," Discussion Paper 2002-108, Tilburg University, Center for Economic Research.
    17. Edcarlos D. Silva & J. C. Albuquerque & T. R. Cavalcante, 2021. "Fourth-order nonlocal type elliptic problems with indefinite nonlinearities," Partial Differential Equations and Applications, Springer, vol. 2(2), pages 1-22, April.
    18. YongSeog Kim & W. Nick Street & Gary J. Russell & Filippo Menczer, 2005. "Customer Targeting: A Neural Network Approach Guided by Genetic Algorithms," Management Science, INFORMS, vol. 51(2), pages 264-276, February.
    19. Montijano, J.I. & Rández, L. & Van Daele, M. & Calvo, M., 2020. "On the numerical stability of the exponentially fitted methods for first order IVPs," Applied Mathematics and Computation, Elsevier, vol. 379(C).
    20. Yanling Li & Zita Oravecz & Shuai Zhou & Yosef Bodovski & Ian J. Barnett & Guangqing Chi & Yuan Zhou & Naomi P. Friedman & Scott I. Vrieze & Sy-Miin Chow, 2022. "Bayesian Forecasting with a Regime-Switching Zero-Inflated Multilevel Poisson Regression Model: An Application to Adolescent Alcohol Use with Spatial Covariates," Psychometrika, Springer;The Psychometric Society, vol. 87(2), pages 376-402, June.

    More about this item

    Keywords

    financial regulation; deep learning; Big data; data science; credit risk;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:cesptp:halshs-01835164. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.