IDEAS home Printed from https://ideas.repec.org/a/eme/rafpps/raf-07-2017-0143.html
   My bibliography  Save this article

Predicting credit risk on the basis of financial and non-financial variables and data mining

Author

Listed:
  • Sihem Khemakhem
  • Younes Boujelbene

Abstract

Purpose - Data mining for predicting credit risk is a beneficial tool for financial institutions to evaluate the financial health of companies. However, the ubiquity of selecting parameters and the presence of unbalanced data sets is a very typical problem of this technique. This study aims to provide a new method for evaluating credit risk, taking into account not only financial and non-financial variables, but also the class imbalance. Design/methodology/approach - The most significant financial and non-financial variables were determined to build a credit scoring model and identify the creditworthiness of companies. Moreover, the Synthetic Minority Oversampling Technique was used to solve the problem of class imbalance and improve the performance of the classifier. The artificial neural networks and decision trees were designed to predict default risk. Findings - Results showed that profitability ratios, repayment capacity, solvency, duration of a credit report, guarantees, size of the company, loan number, ownership structure and the corporate banking relationship duration turned out to be the key factors in predicting default. Also, both algorithms were found to be highly sensitive to class imbalance. However, with balanced data, the decision trees displayed higher predictive accuracy for the assessment of credit risk than artificial neural networks. Originality/value - Classification results depend on the appropriateness of data characteristics and the appropriate analysis algorithm for data sets. The selection of financial and non-financial variables, as well as the resolution of class imbalance allows companies to assess their credit risk successfully.

Suggested Citation

  • Sihem Khemakhem & Younes Boujelbene, 2018. "Predicting credit risk on the basis of financial and non-financial variables and data mining," Review of Accounting and Finance, Emerald Group Publishing Limited, vol. 17(3), pages 316-340, August.
  • Handle: RePEc:eme:rafpps:raf-07-2017-0143
    DOI: 10.1108/RAF-07-2017-0143
    as

    Download full text from publisher

    File URL: https://www.emerald.com/insight/content/doi/10.1108/RAF-07-2017-0143/full/html?utm_source=repec&utm_medium=feed&utm_campaign=repec
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://www.emerald.com/insight/content/doi/10.1108/RAF-07-2017-0143/full/pdf?utm_source=repec&utm_medium=feed&utm_campaign=repec
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1108/RAF-07-2017-0143?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bo Gao, 2022. "The Use of Machine Learning Combined with Data Mining Technology in Financial Risk Prevention," Computational Economics, Springer;Society for Computational Economics, vol. 59(4), pages 1385-1405, April.
    2. Lang, Qiaoqi & Ma, Feng & Mirza, Nawazish & Umar, Muhammad, 2023. "The interaction of climate risk and bank liquidity: An emerging market perspective for transitions to low carbon energy," Technological Forecasting and Social Change, Elsevier, vol. 191(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eme:rafpps:raf-07-2017-0143. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emerald Support (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.