IDEAS home Printed from https://ideas.repec.org/a/eee/reveco/v70y2020icp440-455.html
   My bibliography  Save this article

Can machine learning paradigm improve attribute noise problem in credit risk classification?

Author

Listed:
  • Yu, Lean
  • Huang, Xiaowen
  • Yin, Hang

Abstract

In this paper, a dual-voting-based learning paradigm is proposed to solve attribute noise problem in credit risk classification. In the proposed learning paradigm, three stages are involved. In the first stage, four indexes are introduced to evaluate the noise level of attributes. In the second stage, attributes with different noise levels are divided into different attribute sets in accordance with the dual-voting results of noise level. In the third stage, credit datasets with different attributes sets are dealt with different learning strategies and different de-noising methods for comparison purpose. In the proposed learning paradigm, a classification and regression tree (CART) model is adopted as the generic classifier to evaluate the performance on training datasets generated by different learning strategies and noise reduction methods. In addition, the performance of all learning strategies on sparse data with attribute noise is also discussed. Experimental results show that the proposed learning paradigm performs better than the benchmarks to solve the attribute noise problem not only in accuracy and its stability, but also in speediness. Further analysis indicates that the sparse data with attribute noise can further improve the stability of accuracy for a specific de-noising method. This implies that the proposed dual voting-based learning paradigm is a promising solution to attribute noise reduction in credit risk classification.

Suggested Citation

  • Yu, Lean & Huang, Xiaowen & Yin, Hang, 2020. "Can machine learning paradigm improve attribute noise problem in credit risk classification?," International Review of Economics & Finance, Elsevier, vol. 70(C), pages 440-455.
  • Handle: RePEc:eee:reveco:v:70:y:2020:i:c:p:440-455
    DOI: 10.1016/j.iref.2020.08.016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1059056020301969
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.iref.2020.08.016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Guoqiang & Y. Hu, Michael & Eddy Patuwo, B. & C. Indro, Daniel, 1999. "Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis," European Journal of Operational Research, Elsevier, vol. 116(1), pages 16-32, July.
    2. Wilson X. B. Li & Tina T. He & Stella P. L. Cho, 2019. "Government involvement in banking systems and economic growth: a comparison across countries," Economic and Political Studies, Taylor & Francis Journals, vol. 7(1), pages 35-65, January.
    3. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    4. R Florez-Lopez, 2010. "Effects of missing data in credit risk scoring. A comparative analysis of methods to achieve robustness in the absence of sufficient data," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 486-501, March.
    5. Werner, Richard A., 2016. "A lost century in economics: Three theories of banking and the conclusive evidence," International Review of Financial Analysis, Elsevier, vol. 46(C), pages 361-379.
    6. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    7. Lean Yu & Zebin Yang & Ling Tang, 2016. "A novel multistage deep belief network based extreme learning machine ensemble learning paradigm for credit risk assessment," Flexible Services and Manufacturing Journal, Springer, vol. 28(4), pages 576-592, December.
    8. Feder, Gershon & Just, Richard E., 1977. "A study of debt servicing capacity applying logit analysis," Journal of Development Economics, Elsevier, vol. 4(1), pages 25-38, February.
    9. Gang Kou & Yi Peng & Yong Shi & Morgan Wise & Weixuan Xu, 2005. "Discovering Credit Cardholders’ Behavior by Multiple Criteria Linear Programming," Annals of Operations Research, Springer, vol. 135(1), pages 261-274, March.
    10. Qinhua Xu & William Chung, 2018. "Risk assessment of China’s Belt and Road Initiative’s sustainable investing: a data envelopment analysis approach," Economic and Political Studies, Taylor & Francis Journals, vol. 6(3), pages 319-337, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liu, Yi & Yang, Menglong & Wang, Yudong & Li, Yongshan & Xiong, Tiancheng & Li, Anzhe, 2022. "Applying machine learning algorithms to predict default probability in the online credit market: Evidence from China," International Review of Financial Analysis, Elsevier, vol. 79(C).
    2. Aslam, Faheem & Hunjra, Ahmed Imran & Ftiti, Zied & Louhichi, Wael & Shams, Tahira, 2022. "Insurance fraud detection: Evidence from artificial intelligence and machine learning," Research in International Business and Finance, Elsevier, vol. 62(C).
    3. González, Marta Ramos & Ureña, Antonio Partal & Fernández-Aguado, Pilar Gómez, 2023. "Forecasting for regulatory credit loss derived from the COVID-19 pandemic: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    4. Zhou, Yang & Xie, Chi & Wang, Gang-Jin & Zhu, You & Uddin, Gazi Salah, 2023. "Analysing and forecasting co-movement between innovative and traditional financial assets based on complex network and machine learning," Research in International Business and Finance, Elsevier, vol. 64(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lean Yu & Xinxie Li & Ling Tang & Zongyi Zhang & Gang Kou, 2015. "Social credit: a comprehensive literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-18, December.
    2. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    3. Mitroussi, K. & Abouarghoub, W. & Haider, J.J. & Pettit, S.J. & Tigka, N., 2016. "Performance drivers of shipping loans: An empirical investigation," International Journal of Production Economics, Elsevier, vol. 171(P3), pages 438-452.
    4. Juraini Zainol Abidin & Nur Adiana Hiau Abdullah & Karren Lee-Hwei Khaw, 2020. "Predicting SMEs Failure: Logistic Regression vs Artificial Neural Network Models," Capital Markets Review, Malaysian Finance Association, vol. 28(2), pages 29-41.
    5. Nasim Nasirpour & Alireza Mazdaki & Esmail Enayati, 2016. "The Investigation and Comparison of the Performance of Heuristic Methods in the Prediction of the Type of Auditor’s Opinion in Firms Accepted in Tehran Stock Exchange," Asian Social Science, Canadian Center of Science and Education, vol. 12(6), pages 148-148, June.
    6. En-Der Su & Shih-Ming Huang, 2010. "Comparing Firm Failure Predictions Between Logit, KMV, and ZPP Models: Evidence from Taiwan’s Electronics Industry," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 17(3), pages 209-239, September.
    7. Vladislav V. Afanasev & Yulia A. Tarasova, 2022. "Default Prediction for Housing and Utilities Management Firms Using Non-Financial Data," Finansovyj žhurnal — Financial Journal, Financial Research Institute, Moscow 125375, Russia, issue 6, pages 91-110, December.
    8. Adler Haymans Manurung & Derwin Suhartono & Benny Hutahayan & Noptovius Halimawan, 2023. "Probability Bankruptcy Using Support Vector Regression Machines," Journal of Applied Finance & Banking, SCIENPRESS Ltd, vol. 13(1), pages 1-3.
    9. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    10. Wu, Desheng(Dash) & Liang, Liang & Yang, Zijiang, 2008. "Analyzing the financial distress of Chinese public companies using probabilistic neural networks and multivariate discriminate analysis," Socio-Economic Planning Sciences, Elsevier, vol. 42(3), pages 206-220, September.
    11. Kamesh Korangi & Christophe Mues & Cristi'an Bravo, 2021. "A transformer-based model for default prediction in mid-cap corporate markets," Papers 2111.09902, arXiv.org, revised Apr 2023.
    12. Andrea Bedin & Monica Billio & Michele Costola & Loriana Pelizzon, 2019. "Credit Scoring in SME Asset-Backed Securities: An Italian Case Study," JRFM, MDPI, vol. 12(2), pages 1-28, May.
    13. Su-Han Woo & Min-Su Kwon & Kum Fai Yuen, 2021. "Financial determinants of credit risk in the logistics and shipping industries," Maritime Economics & Logistics, Palgrave Macmillan;International Association of Maritime Economists (IAME), vol. 23(2), pages 268-290, June.
    14. Qunfeng LIAO & Seyed MEHDIAN, 2016. "Measuring Financial Distress And Predicting Corporate Bankruptcy: An Index Approach," Review of Economic and Business Studies, Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, issue 17, pages 33-51, June.
    15. Youssef Zizi & Amine Jamali-Alaoui & Badreddine El Goumi & Mohamed Oudgou & Abdeslam El Moudden, 2021. "An Optimal Model of Financial Distress Prediction: A Comparative Study between Neural Networks and Logistic Regression," Risks, MDPI, vol. 9(11), pages 1-24, November.
    16. Ilyes Abid & Rim Ayadi & Khaled Guesmi & Farid Mkaouar, 2022. "A new approach to deal with variable selection in neural networks: an application to bankruptcy prediction," Annals of Operations Research, Springer, vol. 313(2), pages 605-623, June.
    17. Javed Iqbal & Furrukh Bashir & Rashid Ahmad & Hina Arshad, 2022. "Predicting Bankruptcy through Neural Network:Case of PSX Listed Companies," iRASD Journal of Management, International Research Alliance for Sustainable Development (iRASD), vol. 4(2), pages 299-315, june.
    18. Korangi, Kamesh & Mues, Christophe & Bravo, Cristián, 2023. "A transformer-based model for default prediction in mid-cap corporate markets," European Journal of Operational Research, Elsevier, vol. 308(1), pages 306-320.
    19. Dejan JOVANOVIĆ & Mirjana TODOROVIĆ & Milka GRBIĆ, 2017. "Financial Indicators As Predictors Of Illiquidity," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(1), pages 128-149, March.
    20. Şaban Çelik, 2013. "Micro Credit Risk Metrics: A Comprehensive Review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 20(4), pages 233-272, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reveco:v:70:y:2020:i:c:p:440-455. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620165 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.