IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v65y2025i5d10.1007_s10614-025-10893-5.html
   My bibliography  Save this article

Integration of CNN Models and Machine Learning Methods in Credit Score Classification: 2D Image Transformation and Feature Extraction

Author

Listed:
  • Yunus Emre Gür

    (Fırat University)

  • Mesut Toğaçar

    (Fırat University)

  • Bilal Solak

    (Fırat University)

Abstract

The problem of accurately classifying credit scores is critical for financial institutions to assess individual creditworthiness and effectively manage credit risk. Traditional methods often face limitations when processing large datasets, resulting in lower accuracy and longer processing time. To address this issue, this paper proposes a novel approach to credit score classification by integrating convolutional neural networks (CNN) with machine learning methods. First, a 1D dataset of sequential text data is transformed into 2D greyscale images to use 2D CNN models for feature extraction and classification. Six CNN architectures—DenseNet201, GoogLeNet, MobileNetV2, ResNet18, ShuffleNet, and SqueezeNet—are implemented, and the features in the last layer (1000 features) of each CNN are classified using the softmax method. To further improve the performance, the two best CNN models were selected, and a new fully connected layer (NewFC) was added. A class-based feature set [3 × 31,695] representing three credit score types (good, poor, and standard) was extracted from each model and merged into a feature set [6 × 31,695]. This combined feature set was then reclassified using KNN, LDA, Naive Bayes, and SVM algorithms. The performance of both CNN and machine learning methods was evaluated using accuracy, precision, sensitivity, specificity, and F-score metrics. To optimize classification performance and reduce computational cost, the RelieF algorithm was used to select the best 5 out of 6 features. Compared to using all 6 features, significant improvements in accuracy and efficiency were observed, demonstrating the effectiveness of the proposed method in credit score classification.

Suggested Citation

  • Yunus Emre Gür & Mesut Toğaçar & Bilal Solak, 2025. "Integration of CNN Models and Machine Learning Methods in Credit Score Classification: 2D Image Transformation and Feature Extraction," Computational Economics, Springer;Society for Computational Economics, vol. 65(5), pages 2991-3035, May.
  • Handle: RePEc:kap:compec:v:65:y:2025:i:5:d:10.1007_s10614-025-10893-5
    DOI: 10.1007/s10614-025-10893-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-025-10893-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-025-10893-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. repec:hal:spmain:info:hdl:2441/6cbt691h0h8o9q5rf0apko0pda is not listed on IDEAS
    2. Cristina Demma, 2017. "Credit Scoring and the Quality of Business Credit During the Crisis," Economic Notes, Banca Monte dei Paschi di Siena SpA, vol. 46(2), pages 269-306, July.
    3. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    4. D. J. Hand & W. E. Henley, 1997. "Statistical Classification Methods in Consumer Credit Scoring: a Review," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 523-541, September.
    5. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    6. repec:ers:journl:v:xxiv:y:2021:i:2:p:1134-1148 is not listed on IDEAS
    7. Jing Lei, 2020. "Cross-Validation With Confidence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1978-1997, December.
    8. repec:spo:wpmain:info:hdl:2441/6cbt691h0h8o9q5rf0apko0pda is not listed on IDEAS
    9. Christoph Bergmeir, 2023. "Common Pitfalls and Better Practices in Forecast Evaluation for Data Scientists," Foresight: The International Journal of Applied Forecasting, International Institute of Forecasters, issue 70, pages 5-12, Q3.
    10. Longbing Cao & Qiang Yang & Philip S. Yu, 2020. "Data science and AI in FinTech: An overview," Papers 2007.12681, arXiv.org, revised Jul 2021.
    11. Dayu Xu & Xuyao Zhang & Hailin Feng, 2019. "Generalized fuzzy soft sets theory‐based novel hybrid ensemble credit scoring model," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 24(2), pages 903-921, April.
    12. Marcin Hernes & Adrianna Kozierkiewicz & Marcin Maleszka & Artur Rot & Agata Kozina & Karolina Matenczuk & Jakub Janus & Ewelina Wrobel, 2021. "Deep Learning for Repayment Prediction in Leasing Companies," European Research Studies Journal, European Research Studies Journal, vol. 0(2 - Part ), pages 1134-1148.
    13. Fourcade, Marion & Healy, Kieran, 2013. "Classification situations: Life-chances in the neoliberal era," Accounting, Organizations and Society, Elsevier, vol. 38(8), pages 559-572.
    14. Yufei Xia & Xinyi Guo & Yinguo Li & Lingyun He & Xueyuan Chen, 2022. "Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(8), pages 1669-1690, December.
    15. Marion Fourcade & Kieran Healy, 2013. "Classification situations: Life-chances in the neoliberal era," Post-Print hal-03470535, HAL.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Linhui Wang & Jianping Zhu & Chenlu Zheng & Zhiyuan Zhang, 2024. "Incorporating Digital Footprints into Credit-Scoring Models through Model Averaging," Mathematics, MDPI, vol. 12(18), pages 1-15, September.
    2. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    3. Nadia Ayed & Khemaies Bougatef, 2024. "Performance Assessment of Logistic Regression (LR), Artificial Neural Network (ANN), Fuzzy Inference System (FIS) and Adaptive Neuro-Fuzzy System (ANFIS) in Predicting Default Probability: The Case of," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1803-1835, September.
    4. Rais Ahmad Itoo & A. Selvarasu & José António Filipe, 2015. "Loan Products and Credit Scoring by Commercial Banks (India)," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 5(1), pages 851-851.
    5. Ahmed Almustfa Hussin Adam Khatir & Marco Bee, 2022. "Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?," Risks, MDPI, vol. 10(9), pages 1-22, August.
    6. Andreea Costea, 2017. "A Quantitative Approach to Credit Risk Management in the Underwriting Process for the Retail Portfolio," Romanian Economic Journal, Department of International Business and Economics from the Academy of Economic Studies Bucharest, vol. 20(63), pages 157-186, March.
    7. G Verstraeten & D Van den Poel, 2005. "The impact of sample bias on consumer credit scoring performance and profitability," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(8), pages 981-992, August.
    8. Dinh, Thi Huyen Thanh & Kleimeier, Stefanie, 2007. "A credit scoring model for Vietnam's retail banking market," International Review of Financial Analysis, Elsevier, vol. 16(5), pages 471-495.
    9. Carlos Serrano-Cinca & Begoña Gutiérrez-Nieto & Nydia M. Reyes, 2013. "A Social Approach to Microfinance Credit Scoring," Working Papers CEB 13-013, ULB -- Universite Libre de Bruxelles.
    10. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    11. Pérez-Martín, A. & Pérez-Torregrosa, A. & Vaca, M., 2018. "Big Data techniques to measure credit banking risk in home equity loans," Journal of Business Research, Elsevier, vol. 89(C), pages 448-454.
    12. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    13. Tigges, Maximilian & Mestwerdt, Sönke & Tschirner, Sebastian & Mauer, René, 2024. "Who gets the money? A qualitative analysis of fintech lending and credit scoring through the adoption of AI and alternative data," Technological Forecasting and Social Change, Elsevier, vol. 205(C).
    14. Hong Wang & Qingsong Xu & Lifeng Zhou, 2015. "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-20, February.
    15. Dangxing Chen & Weicheng Ye & Jiahui Ye, 2022. "Interpretable Selective Learning in Credit Risk," Papers 2209.10127, arXiv.org.
    16. Dinh, K. & Kleimeier, S., 2006. "Credit scoring for Vietnam's retail banking market : implementation and implications for transactional versus relationship lending," Research Memorandum 012, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    17. Desiree Fields, 2022. "Automated landlord: Digital technologies and post-crisis financial accumulation," Environment and Planning A, , vol. 54(1), pages 160-181, February.
    18. Thomas Wainwright, 2011. "Elite Knowledges: Framing Risk and the Geographies of Credit," Environment and Planning A, , vol. 43(3), pages 650-665, March.
    19. Richard Chamboko & Jorge M. Bravo, 2016. "On the modelling of prognosis from delinquency to normal performance on retail consumer loans," Risk Management, Palgrave Macmillan, vol. 18(4), pages 264-287, December.
    20. Ejiogu, Amanze & Ambituuni, Ambisisi & Ejiogu, Chibuzo, 2021. "Accounting for accounting’s role in the neoliberalization processes of social housing in England: A Bourdieusian perspective," CRITICAL PERSPECTIVES ON ACCOUNTING, Elsevier, vol. 80(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:65:y:2025:i:5:d:10.1007_s10614-025-10893-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.