IDEAS home Printed from https://ideas.repec.org/a/ist/iujepr/v9y2022i2p397-424.html
   My bibliography  Save this article

Credit Scoring on Cash Flow Table with Machine Learning: XGBoost Approach

Author

Listed:
  • Guner Altan

    (Marmara Universitesi , Bankacilik ve Sigortacilik Enstitusu, Bankacilik Anabilim Dali, Istanbul,Türkiye)

  • Server Demirci

    (Marmara Üniversitesi , Bankacılık ve Sigortacılık Yüksekokulu, Bankacılık Bölümü, İstanbul, Türkiye)

Abstract

Machine learning methods have started being used with greater momentum in the banking and finance sectors alongside modernization and globalization. The ability to distinguish between good and bad customers has become extremely important, especially with the increase in credit products offered in the banking sector. This ability to distinguish not only increases banks’ profitability but also increases their competitiveness in the market. In this context, banks put companies through a credit evaluation process before loaning to them, and the most important leg of this process is undoubtedly the credit score analysis. Considering that one of the most important risks banks carry is credit risk, the importance of correctly, reliably, and quickly completing the balanced scorecard study during the credit evaluation process cannot be denied. Whether the company undergoing a scorecard study is an independent company or part of a group of companies may change how the company or firms are evaluated. In a group of companies, no matter how good a rating one company has in regard to its status within the parent company, if the other companies have low ratings, this may affect and reduce the consolidated rating. In this context, the current study focuses on groups of companies. The aim of the study is to try to develop a scorecard model using the cash flow statements of consolidated companies. In this study, eXtreme Gradient Boosting (XGBoost), Gradient Boosting and Artificial Neural Network algorithms which are machine learning techniques and Python program were used. These three methods were compared, and the extreme gradient boosting method was shown to be the preferred model with an accuracy rating of 80%.

Suggested Citation

  • Guner Altan & Server Demirci, 2022. "Credit Scoring on Cash Flow Table with Machine Learning: XGBoost Approach," Journal of Economic Policy Researches, Istanbul University, Faculty of Economics, vol. 9(2), pages 397-424, July.
  • Handle: RePEc:ist:iujepr:v:9:y:2022:i:2:p:397-424
    DOI: 10.26650/JEPR1114842
    as

    Download full text from publisher

    File URL: https://cdn.istanbul.edu.tr/file/JTA6CLJ8T5/88F68CE5200D4D14B219ED90207CAD45
    Download Restriction: no

    File URL: https://iupress.istanbul.edu.tr/en/journal/jepr/article/makine-ogrenmesi-ile-nakit-akis-tablosu-uzerinden-kredi-skorlamasi-xgboost-yaklasimi
    Download Restriction: no

    File URL: https://libkey.io/10.26650/JEPR1114842?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lara Marie Demajo & Vince Vella & Alexiei Dingli, 2020. "Explainable AI for Interpretable Credit Scoring," Papers 2012.03749, arXiv.org.
    2. Dominique Guegan & Bertrand Hassani, 2018. "Regulatory learning: How to supervise machine learning models? An application to credit scoring," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01835213, HAL.
    3. Nehrebecka Natalia, 2018. "Predicting the Default Risk of Companies. Comparison of Credit Scoring Models: Logit Vs Support Vector Machines," Econometrics. Advances in Applied Data Analysis, Sciendo, vol. 22(2), pages 54-73, June.
    4. Yiheng Li & Weidong Chen, 2020. "A Comparative Performance Assessment of Ensemble Learning for Credit Scoring," Mathematics, MDPI, vol. 8(10), pages 1-19, October.
    5. Dominique Guegan & Bertrand Hassani, 2018. "Regulatory learning: How to supervise machine learning models? An application to credit scoring," Post-Print halshs-01835213, HAL.
    6. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Charlene H. Chu & Simon Donato-Woodger & Shehroz S. Khan & Rune Nyrup & Kathleen Leslie & Alexandra Lyn & Tianyu Shi & Andria Bianchi & Samira Abbasgholizadeh Rahimi & Amanda Grenier, 2023. "Age-related bias and artificial intelligence: a scoping review," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-17, December.
    2. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    3. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    4. Alonso-Robisco, Andrés & Carbó, José Manuel, 2022. "Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio," International Review of Financial Analysis, Elsevier, vol. 84(C).
    5. Dimitrios Nikolaidis & Michalis Doumpos, 2022. "Credit Scoring with Drift Adaptation Using Local Regions of Competence," SN Operations Research Forum, Springer, vol. 3(4), pages 1-28, December.
    6. Jorge Tejero, 2022. "Unwrapping black box models A case study in credit risk," Revista de Estabilidad Financiera, Banco de España, issue NOV.
    7. Damiano Brigo & Xiaoshan Huang & Andrea Pallavicini & Haitz Saez de Ocariz Borde, 2021. "Interpretability in deep learning for finance: a case study for the Heston model," Papers 2104.09476, arXiv.org.
    8. Yang Liu & Fei Huang & Lili Ma & Qingguo Zeng & Jiale Shi, 2024. "Credit scoring prediction leveraging interpretable ensemble learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 286-308, March.
    9. Anil Kumar & Suneel Sharma & Mehregan Mahdavi, 2021. "Machine Learning (ML) Technologies for Digital Credit Scoring in Rural Finance: A Literature Review," Risks, MDPI, vol. 9(11), pages 1-15, October.
    10. Indy Man Kit Ho & Anthony Weldon & Jason Tze Ho Yong & Candy Tze Tim Lam & Jaime Sampaio, 2023. "Using Machine Learning Algorithms to Pool Data from Meta-Analysis for the Prediction of Countermovement Jump Improvement," IJERPH, MDPI, vol. 20(10), pages 1-15, May.
    11. Mohamed Hamitouche & Jose-Luis Molina, 2022. "A Review of AI Methods for the Prediction of High-Flow Extremal Hydrology," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(10), pages 3859-3876, August.
    12. Roman P. Bulyga & Alexey A. Sitnov & Liudmila V. Kashirskaya & Irina V. Safonova, 2020. "Transparency of credit institutions," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 7(4), pages 3158-3172, June.
    13. Lukasz Prorokowski, 2022. "New definition of default," Bank i Kredyt, Narodowy Bank Polski, vol. 53(5), pages 523-564.
    14. Jorge Tejero, 2022. "Unwrapping black box models A case study in credit risk," Financial Stability Review, Banco de España, issue NOV.
    15. Babek Erdebilli & Burcu Devrim-İçtenbaş, 2022. "Ensemble Voting Regression Based on Machine Learning for Predicting Medical Waste: A Case from Turkey," Mathematics, MDPI, vol. 10(14), pages 1-16, July.
    16. Babaei, Golnoosh & Giudici, Paolo & Raffinetti, Emanuela, 2023. "Explainable FinTech lending," Journal of Economics and Business, Elsevier, vol. 125.
    17. Corina Constantinescu & Julia Eisenberg, 2021. "Special Issue “Interplay between Financial and Actuarial Mathematics”," Risks, MDPI, vol. 9(8), pages 1-3, July.
    18. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    19. Paritosh Navinchandra Jha & Marco Cucculelli, 2021. "A New Model Averaging Approach in Predicting Credit Risk Default," Risks, MDPI, vol. 9(6), pages 1-15, June.
    20. Christian Kurniawan & Xiyu Deng & Adhiraj Chakraborty & Assane Gueye & Niangjun Chen & Yorie Nakahira, 2022. "A Learning and Control Perspective for Microfinance," Papers 2207.12631, arXiv.org, revised Dec 2022.

    More about this item

    Keywords

    Machine Learning; XGBoost; Credit Scoring; Python; Artificial Neural Network JEL Classification : C13; C62; C69;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C62 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Existence and Stability Conditions of Equilibrium
    • C69 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ist:iujepr:v:9:y:2022:i:2:p:397-424. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ertugrul YASAR (email available below). General contact details of provider: https://edirc.repec.org/data/ifisttr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.