IDEAS home Printed from https://ideas.repec.org/a/spr/empeco/v61y2021i3d10.1007_s00181-020-01901-8.html
   My bibliography  Save this article

Classification with segmentation for credit scoring and bankruptcy prediction

Author

Listed:
  • Dalila Boughaci

    (University of Sciences and Technology Houari Boumediene)

  • Abdullah A. K. Alkhawaldeh

    (The Hashemite University)

  • Jamil J. Jaber

    (The University of Jordan)

  • Nawaf Hamadneh

    (Saudi Electronic University)

Abstract

Segmentation or clustering is a key concept in the economy. It can be defined as a grouping of interconnected firms or individuals in a particular domain. It is a powerful tool that may be used to form clusters in order to increase business productivity and efficiency. The aim of this paper is to investigate the concept of clusters in the development of credit scoring models. We propose a hybrid method based on clustering and random forest techniques. The purpose is to design an effective credit scoring model and enhance financial bankruptcy prediction. We use clustering to partition data into relatively homogeneous groups of credit applicants where the aim is to cluster creditworthy applicants against non-creditworthy ones. We consider k-means algorithm in the clustering step to segment similar applicants into groups. Then, we apply random forest learning technique on the clustered data. Empirical studies are conducted on six well-known financial datasets of different sizes. The numerical results are promising and show the effectiveness of the proposed technique for applicants segmentation. When clustering is used with classification, the resulting method succeeds in improving highly the classification performance.

Suggested Citation

  • Dalila Boughaci & Abdullah A. K. Alkhawaldeh & Jamil J. Jaber & Nawaf Hamadneh, 2021. "Classification with segmentation for credit scoring and bankruptcy prediction," Empirical Economics, Springer, vol. 61(3), pages 1281-1309, September.
  • Handle: RePEc:spr:empeco:v:61:y:2021:i:3:d:10.1007_s00181-020-01901-8
    DOI: 10.1007/s00181-020-01901-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00181-020-01901-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00181-020-01901-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. ., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 77-93, June.
    2. Wiginton, John C., 1980. "A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 15(3), pages 757-770, September.
    3. Rosa Forte & Nancy Santos, 2015. "A cluster analysis of FDI in Latin America," Latin American Journal of Economics-formerly Cuadernos de Economía, Instituto de Economía. Pontificia Universidad Católica de Chile., vol. 52(1), pages 25-56, May.
    4. Desai, Vijay S. & Crook, Jonathan N. & Overstreet, George A., 1996. "A comparison of neural networks and linear scoring models in the credit union environment," European Journal of Operational Research, Elsevier, vol. 95(1), pages 24-37, November.
    5. Bertrand Candelon & Elena-Ivona Dumitrescu & Christophe Hurlin, 2012. "How to Evaluate an Early-Warning System: Toward a Unified Statistical Framework for Assessing Financial Crises Forecasting Methods," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 60(1), pages 75-113, April.
    6. D. J. Hand & W. E. Henley, 1997. "Statistical Classification Methods in Consumer Credit Scoring: a Review," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 523-541, September.
    7. Bertrand Candelon & Elena-Ivona Dumitrescu & Christophe Hurlin, 2012. "How to evaluate an Early Warning System ?," Working Papers halshs-00450050, HAL.
    8. Harold (Hal) Wolman & Diana Hincapie, 2015. "Clusters and Cluster-Based Development Policy," Economic Development Quarterly, , vol. 29(2), pages 135-149, May.
    9. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    10. Gonzalez, F. & Haas, F. & Johannes, R. & Persson, M. & Toledo, L. & Violi, R. & Zins, C. & Wieland, M., 2004. "Market dynamics associated with credit ratings: a literature review," Financial Stability Review, Banque de France, issue 4, pages 53-76, June.
    11. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    12. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    13. Loretta J. Mester, 1997. "What's the point of credit scoring?," Business Review, Federal Reserve Bank of Philadelphia, issue Sep, pages 3-16.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    2. Mizen, Paul & Tsoukas, Serafeim, 2012. "Forecasting US bond default ratings allowing for previous and initial state dependence in an ordered probit model," International Journal of Forecasting, Elsevier, vol. 28(1), pages 273-287.
    3. Eleimon Gonis & Salima Paul & Jon Tucker, 2012. "Rating or no rating? That is the question: an empirical examination of UK companies," The European Journal of Finance, Taylor & Francis Journals, vol. 18(8), pages 709-735, September.
    4. Yao-Zhi Xu & Jian-Lin Zhang & Ying Hua & Lin-Yue Wang, 2019. "Dynamic Credit Risk Evaluation Method for E-Commerce Sellers Based on a Hybrid Artificial Intelligence Model," Sustainability, MDPI, vol. 11(19), pages 1-17, October.
    5. Dinh, Thi Huyen Thanh & Kleimeier, Stefanie, 2007. "A credit scoring model for Vietnam's retail banking market," International Review of Financial Analysis, Elsevier, vol. 16(5), pages 471-495.
    6. Ha-Thu Nguyen, 2015. "How is credit scoring used to predict default in China?," EconomiX Working Papers 2015-1, University of Paris Nanterre, EconomiX.
    7. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    8. Maria Rocha Sousa & João Gama & Elísio Brandão, 2013. "Introducing time-changing economics into credit scoring," FEP Working Papers 513, Universidade do Porto, Faculdade de Economia do Porto.
    9. Scognamiglio, Elisabetta & Di Lorenzo, Emilia & Sibillo, Marilena & Trotta, Annarita, 2019. "Social uncertainty evaluation in Social Impact Bonds: Review and framework," Research in International Business and Finance, Elsevier, vol. 47(C), pages 40-56.
    10. Alexander B. Matthies, 2013. "Statistical properties and stability of ratings in a subset of US firms," SFB 649 Discussion Papers SFB649DP2013-002, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    11. Yiheng Li & Weidong Chen, 2020. "A Comparative Performance Assessment of Ensemble Learning for Credit Scoring," Mathematics, MDPI, vol. 8(10), pages 1-19, October.
    12. Goriunov Dmytro & Venzhyk Katerina, 2013. "Loan Default Prediction in Ukrainian Retail Banking," EERC Working Paper Series 13/07e, EERC Research Network, Russia and CIS.
    13. Lee, Tian-Shyug & Chiu, Chih-Chou & Chou, Yu-Chao & Lu, Chi-Jie, 2006. "Mining the customer credit using classification and regression tree and multivariate adaptive regression splines," Computational Statistics & Data Analysis, Elsevier, vol. 50(4), pages 1113-1130, February.
    14. Alexander B. Matthies, 2013. "Empirical Research on Corporate Credit-Ratings: A Literature Review," SFB 649 Discussion Papers SFB649DP2013-003, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    15. José Willer Prado & Valderí Castro Alcântara & Francisval Melo Carvalho & Kelly Carvalho Vieira & Luiz Kennedy Cruz Machado & Dany Flávio Tonelli, 2016. "Multivariate analysis of credit risk and bankruptcy research data: a bibliometric study involving different knowledge fields (1968–2014)," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1007-1029, March.
    16. Dinh, K. & Kleimeier, S., 2006. "Credit scoring for Vietnam's retail banking market : implementation and implications for transactional versus relationship lending," Research Memorandum 012, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    17. Reiner Martin & Moreno Roma & Isabel Vansteenkiste, 2005. "Regulatory reforms in selected EU network industries," Occasional Paper Series 28, European Central Bank.
    18. Enrico Supino & Nicola Piras, 2022. "Le performance dei modelli di credit scoring in contesti di forte instabilit? macroeconomica: il ruolo delle Reti Neurali Artificiali," MANAGEMENT CONTROL, FrancoAngeli Editore, vol. 2022(2), pages 41-61.
    19. Trueck, Stefan & Rachev, Svetlozar T., 2008. "Rating Based Modeling of Credit Risk," Elsevier Monographs, Elsevier, edition 1, number 9780123736833.
    20. Zeineb Affes & Rania Hentati-Kaffel, 2016. "Forecast bankruptcy using a blend of clustering and MARS model - Case of US banks," Post-Print halshs-01314553, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:empeco:v:61:y:2021:i:3:d:10.1007_s00181-020-01901-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.