IDEAS home Printed from https://ideas.repec.org/a/eee/reveco/v70y2020icp440-455.html
   My bibliography  Save this article

Can machine learning paradigm improve attribute noise problem in credit risk classification?

Author

Listed:
  • Yu, Lean
  • Huang, Xiaowen
  • Yin, Hang

Abstract

In this paper, a dual-voting-based learning paradigm is proposed to solve attribute noise problem in credit risk classification. In the proposed learning paradigm, three stages are involved. In the first stage, four indexes are introduced to evaluate the noise level of attributes. In the second stage, attributes with different noise levels are divided into different attribute sets in accordance with the dual-voting results of noise level. In the third stage, credit datasets with different attributes sets are dealt with different learning strategies and different de-noising methods for comparison purpose. In the proposed learning paradigm, a classification and regression tree (CART) model is adopted as the generic classifier to evaluate the performance on training datasets generated by different learning strategies and noise reduction methods. In addition, the performance of all learning strategies on sparse data with attribute noise is also discussed. Experimental results show that the proposed learning paradigm performs better than the benchmarks to solve the attribute noise problem not only in accuracy and its stability, but also in speediness. Further analysis indicates that the sparse data with attribute noise can further improve the stability of accuracy for a specific de-noising method. This implies that the proposed dual voting-based learning paradigm is a promising solution to attribute noise reduction in credit risk classification.

Suggested Citation

  • Yu, Lean & Huang, Xiaowen & Yin, Hang, 2020. "Can machine learning paradigm improve attribute noise problem in credit risk classification?," International Review of Economics & Finance, Elsevier, vol. 70(C), pages 440-455.
  • Handle: RePEc:eee:reveco:v:70:y:2020:i:c:p:440-455
    DOI: 10.1016/j.iref.2020.08.016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1059056020301969
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.iref.2020.08.016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Guoqiang & Y. Hu, Michael & Eddy Patuwo, B. & C. Indro, Daniel, 1999. "Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis," European Journal of Operational Research, Elsevier, vol. 116(1), pages 16-32, July.
    2. Wilson X. B. Li & Tina T. He & Stella P. L. Cho, 2019. "Government involvement in banking systems and economic growth: a comparison across countries," Economic and Political Studies, Taylor & Francis Journals, vol. 7(1), pages 35-65, January.
    3. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    4. R Florez-Lopez, 2010. "Effects of missing data in credit risk scoring. A comparative analysis of methods to achieve robustness in the absence of sufficient data," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 486-501, March.
    5. Werner, Richard A., 2016. "A lost century in economics: Three theories of banking and the conclusive evidence," International Review of Financial Analysis, Elsevier, vol. 46(C), pages 361-379.
    6. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    7. Lean Yu & Zebin Yang & Ling Tang, 2016. "A novel multistage deep belief network based extreme learning machine ensemble learning paradigm for credit risk assessment," Flexible Services and Manufacturing Journal, Springer, vol. 28(4), pages 576-592, December.
    8. Feder, Gershon & Just, Richard E., 1977. "A study of debt servicing capacity applying logit analysis," Journal of Development Economics, Elsevier, vol. 4(1), pages 25-38, February.
    9. Gang Kou & Yi Peng & Yong Shi & Morgan Wise & Weixuan Xu, 2005. "Discovering Credit Cardholders’ Behavior by Multiple Criteria Linear Programming," Annals of Operations Research, Springer, vol. 135(1), pages 261-274, March.
    10. Qinhua Xu & William Chung, 2018. "Risk assessment of China’s Belt and Road Initiative’s sustainable investing: a data envelopment analysis approach," Economic and Political Studies, Taylor & Francis Journals, vol. 6(3), pages 319-337, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Yi & Yang, Menglong & Wang, Yudong & Li, Yongshan & Xiong, Tiancheng & Li, Anzhe, 2022. "Applying machine learning algorithms to predict default probability in the online credit market: Evidence from China," International Review of Financial Analysis, Elsevier, vol. 79(C).
    2. Aslam, Faheem & Hunjra, Ahmed Imran & Ftiti, Zied & Louhichi, Wael & Shams, Tahira, 2022. "Insurance fraud detection: Evidence from artificial intelligence and machine learning," Research in International Business and Finance, Elsevier, vol. 62(C).
    3. González, Marta Ramos & Ureña, Antonio Partal & Fernández-Aguado, Pilar Gómez, 2023. "Forecasting for regulatory credit loss derived from the COVID-19 pandemic: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    4. Zhou, Yang & Xie, Chi & Wang, Gang-Jin & Zhu, You & Uddin, Gazi Salah, 2023. "Analysing and forecasting co-movement between innovative and traditional financial assets based on complex network and machine learning," Research in International Business and Finance, Elsevier, vol. 64(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lean Yu & Xinxie Li & Ling Tang & Zongyi Zhang & Gang Kou, 2015. "Social credit: a comprehensive literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-18, December.
    2. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    3. Lin, Fengyi & Yeh, Ching Chiang & Lee, Meng Yuan, 2013. "A Hybrid Business Failure Prediction Model Using Locally Linear Embedding And Support Vector Machines," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(1), pages 82-97, March.
    4. Zeineb Affes & Rania Hentati-Kaffel, 2016. "Forecast bankruptcy using a blend of clustering and MARS model - Case of US banks," Post-Print halshs-01314553, HAL.
    5. Mitroussi, K. & Abouarghoub, W. & Haider, J.J. & Pettit, S.J. & Tigka, N., 2016. "Performance drivers of shipping loans: An empirical investigation," International Journal of Production Economics, Elsevier, vol. 171(P3), pages 438-452.
    6. Juraini Zainol Abidin & Nur Adiana Hiau Abdullah & Karren Lee-Hwei Khaw, 2020. "Predicting SMEs Failure: Logistic Regression vs Artificial Neural Network Models," Capital Markets Review, Malaysian Finance Association, vol. 28(2), pages 29-41.
    7. Alina Mihaela Dima & Simona Vasilache, 2016. "Credit Risk modeling for Companies Default Prediction using Neural Networks," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(3), pages 127-143, September.
    8. Nasim Nasirpour & Alireza Mazdaki & Esmail Enayati, 2016. "The Investigation and Comparison of the Performance of Heuristic Methods in the Prediction of the Type of Auditor’s Opinion in Firms Accepted in Tehran Stock Exchange," Asian Social Science, Canadian Center of Science and Education, vol. 12(6), pages 148-148, June.
    9. Yajiao Tang & Junkai Ji & Yulin Zhu & Shangce Gao & Zheng Tang & Yuki Todo, 2019. "A Differential Evolution-Oriented Pruning Neural Network Model for Bankruptcy Prediction," Complexity, Hindawi, vol. 2019, pages 1-21, August.
    10. Rogelio A. Mancisidor & Kjersti Aas, 2022. "Multimodal Generative Models for Bankruptcy Prediction Using Textual Data," Papers 2211.08405, arXiv.org, revised Feb 2024.
    11. Hu, Yu-Chiang & Ansell, Jake, 2007. "Measuring retail company performance using credit scoring techniques," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1595-1606, December.
    12. En-Der Su & Shih-Ming Huang, 2010. "Comparing Firm Failure Predictions Between Logit, KMV, and ZPP Models: Evidence from Taiwan’s Electronics Industry," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 17(3), pages 209-239, September.
    13. Wikil Kwak & Yong Shi & Gang Kou, 2012. "Bankruptcy prediction for Korean firms after the 1997 financial crisis: using a multiple criteria linear programming data mining approach," Review of Quantitative Finance and Accounting, Springer, vol. 38(4), pages 441-453, May.
    14. Vladislav V. Afanasev & Yulia A. Tarasova, 2022. "Default Prediction for Housing and Utilities Management Firms Using Non-Financial Data," Finansovyj žhurnal — Financial Journal, Financial Research Institute, Moscow 125375, Russia, issue 6, pages 91-110, December.
    15. repec:ctc:sdimse:dime21_01 is not listed on IDEAS
    16. Ravi Kumar, P. & Ravi, V., 2007. "Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review," European Journal of Operational Research, Elsevier, vol. 180(1), pages 1-28, July.
    17. Adler Haymans Manurung & Derwin Suhartono & Benny Hutahayan & Noptovius Halimawan, 2023. "Probability Bankruptcy Using Support Vector Regression Machines," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 13(1), pages 1-3.
    18. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    19. Virág, Miklós & Kristóf, Tamás, 2005. "Az első hazai csődmodell újraszámítása neurális hálók segítségével [Recalculation of the first Hungarian bankruptcy-prediction model using neural networks]," Közgazdasági Szemle (Economic Review - monthly of the Hungarian Academy of Sciences), Közgazdasági Szemle Alapítvány (Economic Review Foundation), vol. 0(2), pages 144-162.
    20. Wu, Desheng(Dash) & Liang, Liang & Yang, Zijiang, 2008. "Analyzing the financial distress of Chinese public companies using probabilistic neural networks and multivariate discriminate analysis," Socio-Economic Planning Sciences, Elsevier, vol. 42(3), pages 206-220, September.
    21. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2021. "A transformer-based model for default prediction in mid-cap corporate markets," Papers 2111.09902, arXiv.org, revised Apr 2023.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reveco:v:70:y:2020:i:c:p:440-455. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620165 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.