IDEAS home Printed from https://ideas.repec.org/a/pal/jorsoc/v54y2003i6d10.1057_palgrave.jors.2601545.html
   My bibliography  Save this article

Benchmarking state-of-the-art classification algorithms for credit scoring

Author

Listed:
  • B Baesens

    (K.U.Leuven)

  • T Van Gestel

    (K.U.Leuven)

  • S Viaene

    (K.U.Leuven)

  • M Stepanova

    (UBS AG, Financial Services Group)

  • J Suykens

    (K.U.Leuven)

  • J Vanthienen

    (K.U.Leuven)

Abstract

In this paper, we study the performance of various state-of-the-art classification algorithms applied to eight real-life credit scoring data sets. Some of the data sets originate from major Benelux and UK financial institutions. Different types of classifiers are evaluated and compared. Besides the well-known classification algorithms (eg logistic regression, discriminant analysis, k-nearest neighbour, neural networks and decision trees), this study also investigates the suitability and performance of some recently proposed, advanced kernel-based classification algorithms such as support vector machines and least-squares support vector machines (LS-SVMs). The performance is assessed using the classification accuracy and the area under the receiver operating characteristic curve. Statistically significant performance differences are identified using the appropriate test statistics. It is found that both the LS-SVM and neural network classifiers yield a very good performance, but also simple classifiers such as logistic regression and linear discriminant analysis perform very well for credit scoring.

Suggested Citation

  • B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
  • Handle: RePEc:pal:jorsoc:v:54:y:2003:i:6:d:10.1057_palgrave.jors.2601545
    DOI: 10.1057/palgrave.jors.2601545
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/palgrave.jors.2601545
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/palgrave.jors.2601545?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. J J Glen, 2001. "Classification accuracy in discriminant analysis: a mixed integer programming approach," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 52(3), pages 328-339, March.
    2. Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
    3. M Stepanova & L C Thomas, 2001. "PHAB scores: proportional hazards analysis behavioural scores," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 52(9), pages 1007-1016, September.
    4. Steenackers, A. & Goovaerts, M. J., 1989. "A credit scoring model for personal loans," Insurance: Mathematics and Economics, Elsevier, vol. 8(1), pages 31-34, March.
    5. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    6. Baesens, Bart & Viaene, Stijn & Van den Poel, Dirk & Vanthienen, Jan & Dedene, Guido, 2002. "Bayesian neural network learning for repeat purchase modelling in direct marketing," European Journal of Operational Research, Elsevier, vol. 138(1), pages 191-211, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    2. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    3. Robert Till & David Hand, 2003. "Behavioural models of credit card usage," Journal of Applied Statistics, Taylor & Francis Journals, vol. 30(10), pages 1201-1220.
    4. G Verstraeten & D Van den Poel, 2005. "The impact of sample bias on consumer credit scoring performance and profitability," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(8), pages 981-992, August.
    5. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    6. K. Coussement & D. Van Den Poel, 2008. "Improving Customer Attrition Prediction by Integrating Emotions from Client/Company Interaction Emails and Evaluating Multiple Classifiers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/527, Ghent University, Faculty of Economics and Business Administration.
    7. Pulina, Manuela & Paba, Antonello, 2010. "A discrete choice approach to model credit card fraud," MPRA Paper 20019, University Library of Munich, Germany.
    8. Ahmed Almustfa Hussin Adam Khatir & Marco Bee, 2022. "Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?," Risks, MDPI, vol. 10(9), pages 1-22, August.
    9. Jianhua Jiang & Xianqiu Meng & Yang Liu & Huan Wang, 2022. "An Enhanced TSA-MLP Model for Identifying Credit Default Problems," SAGE Open, , vol. 12(2), pages 21582440221, April.
    10. Brad S. Trinkle & Amelia A. Baldwin, 2007. "Interpretable credit model development via artificial neural networks," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 15(3‐4), pages 123-147, July.
    11. B Baesens & T Van Gestel & M Stepanova & D Van den Poel & J Vanthienen, 2005. "Neural network survival analysis for personal loan data," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1089-1098, September.
    12. D Martens & T Van Gestel & M De Backer & R Haesen & J Vanthienen & B Baesens, 2010. "Credit rating prediction using Ant Colony Optimization," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(4), pages 561-573, April.
    13. Dongwoo Kim, 2023. "Can investors’ collective decision-making evolve? Evidence from peer-to-peer lending markets," Electronic Commerce Research, Springer, vol. 23(2), pages 1323-1358, June.
    14. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    15. Jing Quan & Xuelian Sun, 2024. "Credit risk assessment using the factorization machine model with feature interactions," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    16. Andrey Filchenkov & Natalia Khanzhina & Arina Tsai & Ivan Smetannikov, 2021. "Regularization of Autoencoders for Bank Client Profiling Based on Financial Transactions," Risks, MDPI, vol. 9(3), pages 1-16, March.
    17. Mestiri, Sami & Farhat, Abdejelil, 2018. "Credit Risk Prediction based on Bayesian estimation of logistic regression model with random effects," MPRA Paper 119960, University Library of Munich, Germany.
    18. repec:syb:wpbsba:03/2013 is not listed on IDEAS
    19. Brad S. Trinkle & Amelia A. Baldwin, 2016. "Research Opportunities for Neural Networks: The Case for Credit," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 23(3), pages 240-254, July.
    20. G Andreeva, 2006. "European generic scoring models using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 57(10), pages 1180-1187, October.
    21. Bart Baesens & Rudy Setiono & Christophe Mues & Jan Vanthienen, 2003. "Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation," Management Science, INFORMS, vol. 49(3), pages 312-329, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:jorsoc:v:54:y:2003:i:6:d:10.1057_palgrave.jors.2601545. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.palgrave-journals.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.