IDEAS home Printed from https://ideas.repec.org/p/cdl/ucscec/qt3v33k65c.html
   My bibliography  Save this paper

A novel method for credit scoring based on feature transformation and ensemble model

Author

Listed:
  • Li, Hongxiang
  • Feng, Ao
  • Lin, Bin
  • Su, Houcheng
  • Liu, Zixi
  • Duan, Xuliang
  • Pu, Haibo
  • Wang, Yifei

Abstract

Credit scoring is a very critical task for banks and other financial institutions, and it has become an important evaluation metric to distinguish potential defaulting users. In this paper, we propose a credit score prediction method based on feature transformation and ensemble model, which is essentially a cascade approach. The feature transformation process consisting of boosting trees (BT) and auto-encoders (AE) is employed to replace manual feature engineering and to solve the data imbalance problem. For the classification process, this paper designs a heterogeneous ensemble model by weighting the factorization machine (FM) and deep neural networks (DNN), which can efficiently extract low-order intersections and high-order intersections. Comprehensive experiments were conducted on two standard datasets and the results demonstrate that the proposed approach outperforms existing credit scoring models in accuracy.

Suggested Citation

  • Li, Hongxiang & Feng, Ao & Lin, Bin & Su, Houcheng & Liu, Zixi & Duan, Xuliang & Pu, Haibo & Wang, Yifei, 2021. "A novel method for credit scoring based on feature transformation and ensemble model," Santa Cruz Department of Economics, Working Paper Series qt3v33k65c, Department of Economics, UC Santa Cruz.
  • Handle: RePEc:cdl:ucscec:qt3v33k65c
    as

    Download full text from publisher

    File URL: https://www.escholarship.org/uc/item/3v33k65c.pdf;origin=repeccitec
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fahmida E. Moula & Chi Guotai & Mohammad Zoynul Abedin, 2017. "Credit default prediction modeling: an application of support vector machine," Risk Management, Palgrave Macmillan, vol. 19(2), pages 158-187, May.
    2. Lang, Jan Hannes & Peltonen, Tuomas A. & Sarlin, Peter, 2018. "A framework for early-warning modeling with an application to banks," Working Paper Series 2182, European Central Bank.
    3. Caruso, G. & Gattone, S.A. & Fortuna, F. & Di Battista, T., 2021. "Cluster Analysis for mixed data: An application to credit risk evaluation," Socio-Economic Planning Sciences, Elsevier, vol. 73(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abedin, Mohammad Zoynul & Hajek, Petr & Sharif, Taimur & Satu, Md. Shahriare & Khan, Md. Imran, 2023. "Modelling bank customer behaviour using feature engineering and classification techniques," Research in International Business and Finance, Elsevier, vol. 65(C).
    2. Mr. Jorge A Chan-Lau, 2020. "UnFEAR: Unsupervised Feature Extraction Clustering with an Application to Crisis Regimes Classification," IMF Working Papers 2020/262, International Monetary Fund.
    3. Mohammad Motasem ALrfai & Danilah Binti Salleh & Waeibrorheem Waemustafa, 2022. "Empirical Examination of Credit Risk Determinant of Commercial Banks in Jordan," Risks, MDPI, vol. 10(4), pages 1-11, April.
    4. Jacopo Carmassi & Sonja Dobkowitz & Johanne Evrard & Laura Parisi & André F Silva & Michael Wedow, 2020. "Completing the Banking Union with a European deposit insurance scheme: who is afraid of cross-subsidization?," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 35(101), pages 41-95.
    5. Jiang Hua & Tonglin Hao & Liangcai Zeng & Gui Yu, 2021. "YOLOMask, an Instance Segmentation Algorithm Based on Complementary Fusion Network," Mathematics, MDPI, vol. 9(15), pages 1-12, July.
    6. Maisa Cardoso Aniceto & Flavio Barboza & Herbert Kimura, 2020. "Machine learning predictivity applied to consumer creditworthiness," Future Business Journal, Springer, vol. 6(1), pages 1-14, December.
    7. Quentin Bro de Comères, 2022. "Predicting European Banks Distress Events: Do Financial Information Producers Matter?," Working Papers hal-03752678, HAL.
    8. Piotr Bańbuła & Marcin Pietrzak, 2021. "Early Warning Models of Banking Crises: VIX and High Profits," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 13(4), pages 381-403, December.
    9. Lesly Lisset Ortiz-Cerezo & Alin Andrei Carsteanu & Julio Bernardo Clempner, 2022. "Sharpe-Ratio Portfolio in Controllable Markov Chains: Analytic and Algorithmic Approach for Second Order Cone Programming," Mathematics, MDPI, vol. 10(18), pages 1-13, September.
    10. Mohammad S. Uddin & Guotai Chi & Mazin A. M. Al Janabi & Tabassum Habib, 2022. "Leveraging random forest in micro‐enterprises credit risk modelling for accuracy and interpretability," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(3), pages 3713-3729, July.
    11. Bouteska, Ahmed & Abedin, Mohammad Zoynul & Hajek, Petr & Yuan, Kunpeng, 2024. "Cryptocurrency price forecasting – A comparative analysis of ensemble learning and deep learning methods," International Review of Financial Analysis, Elsevier, vol. 92(C).
    12. Jorge E. Galán, 2021. "CREWS: a CAMELS-based early warning system of systemic risk in the banking sector," Occasional Papers 2132, Banco de España.
    13. Fabrizio Ferriani & Wanda Cornacchia & Paolo Farroni & Eliana Ferrara & Francesco Guarino & Francesco Pisanti, 2019. "An early warning system for less significant Italian banks," Questioni di Economia e Finanza (Occasional Papers) 480, Bank of Italy, Economic Research and International Relations Area.
    14. Sun, Weixin & Zhang, Xuantao & Li, Minghao & Wang, Yong, 2023. "Interpretable high-stakes decision support system for credit default forecasting," Technological Forecasting and Social Change, Elsevier, vol. 196(C).
    15. Smith, Jonathan Acosta & Grill, Michael & Lang, Jan Hannes, 2017. "The leverage ratio, risk-taking and bank stability," Working Paper Series 2079, European Central Bank.
    16. Tarkocin, Coskun & Donduran, Murat, 2024. "Constructing early warning indicators for banks using machine learning models," The North American Journal of Economics and Finance, Elsevier, vol. 69(PB).
    17. Michał Bernardelli & Zbigniew Korzeb & Paweł Niedziółka, 2022. "Does Fossil Fuel Financing Affect Banks’ ESG Ratings?," Energies, MDPI, vol. 15(4), pages 1-19, February.
    18. Fiza Qureshi & Ali M. Kutan & Habib Hussain Khan & Saba Qureshi, 2019. "Equity fund flows, market returns, and market risk: evidence from China," Risk Management, Palgrave Macmillan, vol. 21(1), pages 48-71, March.
    19. Belanes, Amel & Saâdaoui, Foued & Abedin, Mohammad Zoynul, 2024. "Potential diversification benefits: A comparative study of Islamic and conventional stock market indexes," Research in International Business and Finance, Elsevier, vol. 67(PA).
    20. Moritz von Zahn & Jan Zacharias & Maximilian Lowin & Johannes Chen & Oliver Hinz, 2025. "Navigating AI conformity: A design framework to assess fairness, explainability, and performance," Electronic Markets, Springer;IIM University of St. Gallen, vol. 35(1), pages 1-24, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cdl:ucscec:qt3v33k65c. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Lisa Schiff (email available below). General contact details of provider: https://edirc.repec.org/data/ecucsus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.