IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v247y2015i1p124-136.html
   My bibliography  Save this article

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

Author

Listed:
  • Lessmann, Stefan
  • Baesens, Bart
  • Seow, Hsin-Vonn
  • Thomas, Lyn C.

Abstract

Many years have passed since Baesens et al. published their benchmarking study of classification algorithms in credit scoring [Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society, 54(6), 627–635.]. The interest in prediction methods for scorecard development is unbroken. However, there have been several advancements including novel learning methods, performance measures and techniques to reliably compare different classifiers, which the credit scoring literature does not reflect. To close these research gaps, we update the study of Baesens et al. and compare several novel classification algorithms to the state-of-the-art in credit scoring. In addition, we examine the extent to which the assessment of alternative scorecards differs across established and novel indicators of predictive accuracy. Finally, we explore whether more accurate classifiers are managerial meaningful. Our study provides valuable insight for professionals and academics in credit scoring. It helps practitioners to stay abreast of technical advancements in predictive modeling. From an academic point of view, the study provides an independent assessment of recent scoring methods and offers a new baseline to which future approaches can be compared.

Suggested Citation

  • Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
  • Handle: RePEc:eee:ejores:v:247:y:2015:i:1:p:124-136
    DOI: 10.1016/j.ejor.2015.05.030
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221715004208
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2015.05.030?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Verbraken, Thomas & Bravo, Cristián & Weber, Richard & Baesens, Bart, 2014. "Development and application of consumer credit scoring models using profit-based classification measures," European Journal of Operational Research, Elsevier, vol. 238(2), pages 505-513.
    2. G Andreeva, 2006. "European generic scoring models using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 57(10), pages 1180-1187, October.
    3. So, Meko M.C. & Thomas, Lyn C., 2011. "Modelling the profitability of credit cards by Markov decision processes," European Journal of Operational Research, Elsevier, vol. 212(1), pages 123-130, July.
    4. Viaene, Stijn & Dedene, Guido, 2005. "Cost-sensitive learning and decision making revisited," European Journal of Operational Research, Elsevier, vol. 166(1), pages 212-220, October.
    5. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    6. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    7. Ju, Yong Han & Sohn, So Young, 2014. "Updating a credit-scoring model based on new attributes without realization of actual data," European Journal of Operational Research, Elsevier, vol. 234(1), pages 119-126.
    8. Hofer, Vera, 2015. "Adapting a classification rule to local and global shift when only unlabelled data are available," European Journal of Operational Research, Elsevier, vol. 243(1), pages 177-189.
    9. Yao, Xiao & Crook, Jonathan & Andreeva, Galina, 2015. "Support vector regression for loss given default modelling," European Journal of Operational Research, Elsevier, vol. 240(2), pages 528-538.
    10. Tong, Edward N.C. & Mues, Christophe & Thomas, Lyn C., 2012. "Mixture cure models in credit scoring: If and when borrowers default," European Journal of Operational Research, Elsevier, vol. 218(1), pages 132-139.
    11. Yu, Lean & Wang, Shouyang & Lai, Kin Keung, 2009. "An intelligent-agent-based fuzzy group decision making model for financial multicriteria decision support: The case of credit scoring," European Journal of Operational Research, Elsevier, vol. 195(3), pages 942-959, June.
    12. Yang, Yingxu, 2007. "Adaptive credit scoring with kernel learning methods," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1521-1536, December.
    13. Akkoç, Soner, 2012. "An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish cred," European Journal of Operational Research, Elsevier, vol. 222(1), pages 168-178.
    14. Dirick, Lore & Claeskens, Gerda & Baesens, Bart, 2015. "An Akaike information criterion for multiple event mixture cure models," European Journal of Operational Research, Elsevier, vol. 241(2), pages 449-457.
    15. Lee, Tian-Shyug & Chiu, Chih-Chou & Chou, Yu-Chao & Lu, Chi-Jie, 2006. "Mining the customer credit using classification and regression tree and multivariate adaptive regression splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 1113-1130, February.
    16. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    17. Calabrese, Raffaella, 2014. "Downturn Loss Given Default: Mixture distribution estimation," European Journal of Operational Research, Elsevier, vol. 237(1), pages 271-277.
    18. D J Hand, 2005. "Good practice in retail credit scorecard assessment," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1109-1117, September.
    19. Paleologo, Giuseppe & Elisseeff, André & Antonini, Gianluca, 2010. "Subagging for credit scoring models," European Journal of Operational Research, Elsevier, vol. 201(2), pages 490-499, March.
    20. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    21. Crone, Sven F. & Lessmann, Stefan & Stahlbock, Robert, 2006. "The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing," European Journal of Operational Research, Elsevier, vol. 173(3), pages 781-800, September.
    22. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    23. D Martens & T Van Gestel & M De Backer & R Haesen & J Vanthienen & B Baesens, 2010. "Credit rating prediction using Ant Colony Optimization," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(4), pages 561-573, April.
    24. Liu, Fan & Hua, Zhongsheng & Lim, Andrew, 2015. "Identifying future defaulters: A hierarchical Bayesian method," European Journal of Operational Research, Elsevier, vol. 241(1), pages 202-211.
    25. Malhotra, Rashmi & Malhotra, D. K., 2003. "Evaluating consumer loans using neural networks," Omega, Elsevier, vol. 31(2), pages 83-96, April.
    26. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    27. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    2. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    3. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    4. Dimitris Andriosopoulos & Michalis Doumpos & Panos M. Pardalos & Constantin Zopounidis, 2019. "Computational approaches and data analytics in financial services: A literature review," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(10), pages 1581-1599, October.
    5. José Willer Prado & Valderí Castro Alcântara & Francisval Melo Carvalho & Kelly Carvalho Vieira & Luiz Kennedy Cruz Machado & Dany Flávio Tonelli, 2016. "Multivariate analysis of credit risk and bankruptcy research data: a bibliometric study involving different knowledge fields (1968–2014)," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1007-1029, March.
    6. Akkoç, Soner, 2012. "An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish cred," European Journal of Operational Research, Elsevier, vol. 222(1), pages 168-178.
    7. Guotai Chi & Zhipeng Zhang, 2017. "Multi Criteria Credit Rating Model for Small Enterprise Using a Nonparametric Method," Sustainability, MDPI, vol. 9(10), pages 1-23, October.
    8. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    9. Jiang, Cuiqing & Wang, Zhao & Zhao, Huimin, 2019. "A prediction-driven mixture cure model and its application in credit scoring," European Journal of Operational Research, Elsevier, vol. 277(1), pages 20-31.
    10. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    11. Stefan Lessmann & Stefan Voß, 2010. "Customer-Centric Decision Support," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 2(2), pages 79-93, April.
    12. Parisa Golbayani & Ionuc{t} Florescu & Rupak Chatterjee, 2020. "A comparative study of forecasting Corporate Credit Ratings using Neural Networks, Support Vector Machines, and Decision Trees," Papers 2007.06617, arXiv.org.
    13. Kolesnikova, A. & Yang, Y. & Lessmann, S. & Ma, T. & Sung, M.-C. & Johnson, J.E.V., 2019. "Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting," IRTG 1792 Discussion Papers 2019-023, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    14. Anna Stelzer, 2019. "Predicting credit default probabilities using machine learning techniques in the face of unequal class distributions," Papers 1907.12996, arXiv.org.
    15. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    16. Golbayani, Parisa & Florescu, Ionuţ & Chatterjee, Rupak, 2020. "A comparative study of forecasting corporate credit ratings using neural networks, support vector machines, and decision trees," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    17. Adnan Dželihodžić & Dženana Đonko & Jasmin Kevrić, 2018. "Improved Credit Scoring Model Based on Bagging Neural Network," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1725-1741, November.
    18. Agustin Pérez-Martín & Agustin Pérez-Torregrosa & Alejandro Rabasa & Marta Vaca, 2020. "Feature Selection to Optimize Credit Banking Risk Evaluation Decisions for the Example of Home Equity Loans," Mathematics, MDPI, vol. 8(11), pages 1-16, November.
    19. Do, Hung Xuan & Rösch, Daniel & Scheule, Harald, 2018. "Predicting loss severities for residential mortgage loans: A three-step selection approach," European Journal of Operational Research, Elsevier, vol. 270(1), pages 246-259.
    20. Cuiqing Jiang & Zhao Wang & Ruiya Wang & Yong Ding, 2018. "Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending," Annals of Operations Research, Springer, vol. 266(1), pages 511-529, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:247:y:2015:i:1:p:124-136. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.