IDEAS home Printed from https://ideas.repec.org/p/cdl/ucscec/qt3v33k65c.html
   My bibliography  Save this paper

A novel method for credit scoring based on feature transformation and ensemble model

Author

Listed:
  • Li, Hongxiang
  • Feng, Ao
  • Lin, Bin
  • Su, Houcheng
  • Liu, Zixi
  • Duan, Xuliang
  • Pu, Haibo
  • Wang, Yifei

Abstract

Credit scoring is a very critical task for banks and other financial institutions, and it has become an important evaluation metric to distinguish potential defaulting users. In this paper, we propose a credit score prediction method based on feature transformation and ensemble model, which is essentially a cascade approach. The feature transformation process consisting of boosting trees (BT) and auto-encoders (AE) is employed to replace manual feature engineering and to solve the data imbalance problem. For the classification process, this paper designs a heterogeneous ensemble model by weighting the factorization machine (FM) and deep neural networks (DNN), which can efficiently extract low-order intersections and high-order intersections. Comprehensive experiments were conducted on two standard datasets and the results demonstrate that the proposed approach outperforms existing credit scoring models in accuracy.

Suggested Citation

  • Li, Hongxiang & Feng, Ao & Lin, Bin & Su, Houcheng & Liu, Zixi & Duan, Xuliang & Pu, Haibo & Wang, Yifei, 2021. "A novel method for credit scoring based on feature transformation and ensemble model," Santa Cruz Department of Economics, Working Paper Series qt3v33k65c, Department of Economics, UC Santa Cruz.
  • Handle: RePEc:cdl:ucscec:qt3v33k65c
    as

    Download full text from publisher

    File URL: https://www.escholarship.org/uc/item/3v33k65c.pdf;origin=repeccitec
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fahmida E. Moula & Chi Guotai & Mohammad Zoynul Abedin, 2017. "Credit default prediction modeling: an application of support vector machine," Risk Management, Palgrave Macmillan, vol. 19(2), pages 158-187, May.
    2. Lang, Jan Hannes & Peltonen, Tuomas A. & Sarlin, Peter, 2018. "A framework for early-warning modeling with an application to banks," Working Paper Series 2182, European Central Bank.
    3. Caruso, G. & Gattone, S.A. & Fortuna, F. & Di Battista, T., 2021. "Cluster Analysis for mixed data: An application to credit risk evaluation," Socio-Economic Planning Sciences, Elsevier, vol. 73(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Buckmann, Marcus & Gallego Marquez, Paula & Gimpelewicz, Mariana & Kapadia, Sujit & Rismanchi, Katie, 2023. "The more the merrier? Evidence on the value of multiple requirements in bank regulation," Journal of Banking & Finance, Elsevier, vol. 149(C).
    2. Zhiwu Zhou & Julián Alcalá & Víctor Yepes, 2021. "Optimized Application of Sustainable Development Strategy in International Engineering Project Management," Mathematics, MDPI, vol. 9(14), pages 1-29, July.
    3. Abedin, Mohammad Zoynul & Hajek, Petr & Sharif, Taimur & Satu, Md. Shahriare & Khan, Md. Imran, 2023. "Modelling bank customer behaviour using feature engineering and classification techniques," Research in International Business and Finance, Elsevier, vol. 65(C).
    4. Quentin Bro de Comères, 2022. "Predicting European Banks Distress Events: Do Financial Information Producers Matter?," Working Papers hal-03752678, HAL.
    5. Piotr Bańbuła & Marcin Pietrzak, 2021. "Early Warning Models of Banking Crises: VIX and High Profits," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 13(4), pages 381-403, December.
    6. Sun, Weixin & Zhang, Xuantao & Li, Minghao & Wang, Yong, 2023. "Interpretable high-stakes decision support system for credit default forecasting," Technological Forecasting and Social Change, Elsevier, vol. 196(C).
    7. Mr. Jorge A Chan-Lau, 2020. "UnFEAR: Unsupervised Feature Extraction Clustering with an Application to Crisis Regimes Classification," IMF Working Papers 2020/262, International Monetary Fund.
    8. Ángel Acevedo-Duque & Romel Gonzalez-Diaz & Elena Cachicatari Vargas & Anherys Paz-Marcano & Sheyla Muller-Pérez & Guido Salazar-Sepúlveda & Giulia Caruso & Idiano D’Adamo, 2021. "Resilience, Leadership and Female Entrepreneurship within the Context of SMEs: Evidence from Latin America," Sustainability, MDPI, vol. 13(15), pages 1-17, July.
    9. Acosta-Smith, Jonathan & Grill, Michael & Lang, Jan Hannes, 2024. "The leverage ratio, risk-taking and bank stability," Journal of Financial Stability, Elsevier, vol. 74(C).
    10. Lesly Lisset Ortiz-Cerezo & Alin Andrei Carsteanu & Julio Bernardo Clempner, 2022. "Sharpe-Ratio Portfolio in Controllable Markov Chains: Analytic and Algorithmic Approach for Second Order Cone Programming," Mathematics, MDPI, vol. 10(18), pages 1-13, September.
    11. Sonya Georgieva, 2023. "Application of Artificial Intelligence and Machine Learning in the Conduct of Monetary Policy by Central Banks," Economic Studies journal, Bulgarian Academy of Sciences - Economic Research Institute, issue 8, pages 177-199.
    12. Tarkocin, Coskun & Donduran, Murat, 2024. "Constructing early warning indicators for banks using machine learning models," The North American Journal of Economics and Finance, Elsevier, vol. 69(PB).
    13. Bai, Chunguang & Shi, Baofeng & Liu, Feng & Sarkis, Joseph, 2019. "Banking credit worthiness: Evaluating the complex relationships," Omega, Elsevier, vol. 83(C), pages 26-38.
    14. Michał Bernardelli & Zbigniew Korzeb & Paweł Niedziółka, 2022. "Does Fossil Fuel Financing Affect Banks’ ESG Ratings?," Energies, MDPI, vol. 15(4), pages 1-19, February.
    15. Mohammad S. Uddin & Guotai Chi & Mazin A. M. Al Janabi & Tabassum Habib, 2022. "Leveraging random forest in micro‐enterprises credit risk modelling for accuracy and interpretability," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(3), pages 3713-3729, July.
    16. Codruț-Florin Ivașcu, 2024. "Understanding Dividend Puzzle Using Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 161-179, July.
    17. Fiza Qureshi & Ali M. Kutan & Habib Hussain Khan & Saba Qureshi, 2019. "Equity fund flows, market returns, and market risk: evidence from China," Risk Management, Palgrave Macmillan, vol. 21(1), pages 48-71, March.
    18. Bouteska, Ahmed & Abedin, Mohammad Zoynul & Hajek, Petr & Yuan, Kunpeng, 2024. "Cryptocurrency price forecasting – A comparative analysis of ensemble learning and deep learning methods," International Review of Financial Analysis, Elsevier, vol. 92(C).
    19. Nicolás Caselli & Ricardo Soto & Broderick Crawford & Sergio Valdivia & Rodrigo Olivares, 2021. "A Self-Adaptive Cuckoo Search Algorithm Using a Machine Learning Technique," Mathematics, MDPI, vol. 9(16), pages 1-28, August.
    20. Belanes, Amel & Saâdaoui, Foued & Abedin, Mohammad Zoynul, 2024. "Potential diversification benefits: A comparative study of Islamic and conventional stock market indexes," Research in International Business and Finance, Elsevier, vol. 67(PA).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cdl:ucscec:qt3v33k65c. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Lisa Schiff (email available below). General contact details of provider: https://edirc.repec.org/data/ecucsus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.