IDEAS home Printed from https://ideas.repec.org/a/wly/ijfiec/v26y2021i3p4372-4385.html

A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data

Author

Listed:
  • Tong Zhang
  • Guotai Chi

Abstract

In the domain of credit scoring, the number of bad clients is far less than that of good ones. So imbalanced data classification is a realisitc and critical issue in the credit scoring process. In this study, a novel heterogeneous ensemble credit scoring model is proposed for the problem of imbalanced data classification. This proposed model is on basis of five standard classifiers, namely LSVM, KNN, MDA, DT, LR, and adaptively selects the base classifiers with highest AUC according to the data distribution, then integrates all base classifiers to obtain a prediction. Finally, by using five comprehensive performance measures and four classical credit datasets, we find that the proposed model is better than other baseline models. This novel model can be applied to actual credit scoring and assist financial institutions in credit risk management.

Suggested Citation

  • Tong Zhang & Guotai Chi, 2021. "A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(3), pages 4372-4385, July.
  • Handle: RePEc:wly:ijfiec:v:26:y:2021:i:3:p:4372-4385
    DOI: 10.1002/ijfe.2019
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/ijfe.2019
    Download Restriction: no

    File URL: https://libkey.io/10.1002/ijfe.2019?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wiginton, John C., 1980. "A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 15(3), pages 757-770, September.
    2. Miguel Almunia & Agustín Bénétrix & Barry Eichengreen & Kevin H. O’Rourke & Gisela Rua, 2010. "From Great Depression to Great Credit Crisis: similarities, differences and lessons [Germany: Guns, butter, and economic miracles]," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 25(62), pages 219-265.
    3. Edward I. Altman, 1968. "Financial Ratios, Discriminant Analysis And The Prediction Of Corporate Bankruptcy," Journal of Finance, American Finance Association, vol. 23(4), pages 589-609, September.
    4. Edward I. Altman, 1968. "The Prediction Of Corporate Bankruptcy: A Discriminant Analysis," Journal of Finance, American Finance Association, vol. 23(1), pages 193-194, March.
    5. Yang, Yingxu, 2007. "Adaptive credit scoring with kernel learning methods," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1521-1536, December.
    6. Longstaff, Francis A., 2010. "The subprime credit crisis and contagion in financial markets," Journal of Financial Economics, Elsevier, vol. 97(3), pages 436-450, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Li, Zhe & Liang, Shuguang & Pan, Xianyou & Pang, Meng, 2024. "Credit risk prediction based on loan profit: Evidence from Chinese SMEs," Research in International Business and Finance, Elsevier, vol. 67(PA).
    2. Linhui Wang & Jianping Zhu & Chenlu Zheng & Zhiyuan Zhang, 2024. "Incorporating Digital Footprints into Credit-Scoring Models through Model Averaging," Mathematics, MDPI, vol. 12(18), pages 1-15, September.
    3. Song, Yingying & Chen, Xinxin, 2025. "Which opinion is more trustworthy: An analysts’ earnings forecast quality assessment framework based on machine learning," The North American Journal of Economics and Finance, Elsevier, vol. 75(PB).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zeineb Affes & Rania Hentati-Kaffel, 2016. "Forecast bankruptcy using a blend of clustering and MARS model - Case of US banks," Post-Print halshs-01314553, HAL.
    2. Natalia Nehrebecka, 2021. "Internal Credit Risk Models and Digital Transformation: What to Prepare for? An Application to Poland," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 2), pages 719-736.
    3. Doshi, Hitesh & Patel, Saurin & Ramani, Srikanth & Sooy, Matthew, 2023. "Uncertain tone, asset volatility and credit default swap spreads," Journal of Contemporary Accounting and Economics, Elsevier, vol. 19(3).
    4. Wolfgang K. Härdle & Rouslan A. Moro & Dorothea Schäfer, 2004. "Rating Companies with Support Vector Machines," Discussion Papers of DIW Berlin 416, DIW Berlin, German Institute for Economic Research.
    5. Yao-Zhi Xu & Jian-Lin Zhang & Ying Hua & Lin-Yue Wang, 2019. "Dynamic Credit Risk Evaluation Method for E-Commerce Sellers Based on a Hybrid Artificial Intelligence Model," Sustainability, MDPI, vol. 11(19), pages 1-17, October.
    6. Keith Anderson & Anup Chowdhury & Moshfique Uddin, 2025. "Piotroski's Fscore under varying economic conditions," Review of Quantitative Finance and Accounting, Springer, vol. 64(3), pages 1261-1307, April.
    7. Tang, Lingxiao & Cai, Fei & Ouyang, Yao, 2019. "Applying a nonparametric random forest algorithm to assess the credit risk of the energy industry in China," Technological Forecasting and Social Change, Elsevier, vol. 144(C), pages 563-572.
    8. Anatoly Peresetsky & Alexandr Karminsky & Sergei Golovan, 2011. "Probability of default models of Russian banks," Economic Change and Restructuring, Springer, vol. 44(4), pages 297-334, November.
    9. Aytekin Ertan & Yun Lee & Regina Wittenberg-Moerman, 2025. "Unexpected defaults: the role of information opacity," Review of Accounting Studies, Springer, vol. 30(1), pages 899-949, March.
    10. Xia, Yufei & Han, Zhiyin & Li, Yawen & He, Lingyun, 2025. "Credit scoring model for fintech lending: An integration of large language models and FocalPoly loss," International Journal of Forecasting, Elsevier, vol. 41(3), pages 894-919.
    11. Peresetsky, A. A., 2011. "What factors drive the Russian banks license withdrawal," MPRA Paper 41507, University Library of Munich, Germany.
    12. Ha-Thu Nguyen, 2014. "Default Predictors in Credit Scoring - Evidence from France’s Retail Banking Institution," EconomiX Working Papers 2014-26, University of Paris Nanterre, EconomiX.
    13. Jeong-Bon Kim & Li Li & Mary L. Z. Ma & Frank M. Song, 2013. "CEO Option Compensation, Risk-Taking Incentives, and Systemic Risk in the Banking Industry," Working Papers 182013, Hong Kong Institute for Monetary Research.
    14. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    15. Anatoly Peresetsky, 2013. "Modeling reasons for Russian bank license withdrawal: Unaccounted factors," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 30(2), pages 49-64.
    16. Moro, Russ & Härdle, Wolfgang Karl & Aliakbari, Saeideh & Hoffmann, Linda, 2011. "Forecasting corporate distress in the Asian and Pacific region," SFB 649 Discussion Papers 2011-023, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    17. Xuhui Wang & Lifang Zhang & Jianzhou Wang & Zhenkun Liu & Xinsong Niu, 2026. "Profit-oriented loan default prediction for the financial industry: a fusion framework with interpretability," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 12(1), pages 1-33, December.
    18. Anatoly Peresetsky & Alexandr Karminsky & Sergei Golovan, 2011. "Probability of default models of Russian banks," Economic Change and Restructuring, Springer, vol. 44(4), pages 297-334, November.
    19. Ying Zhou & Xia Lin & Guotai Chi & Peng Jin & Mengtong Li, 2024. "EWT‐SMOTE to improve default prediction performance in imbalanced data: Analysis of Chinese data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(3), pages 615-643, April.
    20. Пересецкий А.А., 2007. "Методы Оценки Вероятности Дефолта Банков," Журнал Экономика и математические методы (ЭММ), Центральный Экономико-Математический Институт (ЦЭМИ), vol. 43(3), июль.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:ijfiec:v:26:y:2021:i:3:p:4372-4385. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1076-9307/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.