IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v41y2022i8p1669-1690.html
   My bibliography  Save this article

Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending

Author

Listed:
  • Yufei Xia
  • Xinyi Guo
  • Yinguo Li
  • Lingyun He
  • Xueyuan Chen

Abstract

Online consumer lending has recently been growing rapidly, but it faces high credit risk. For this problem, developing powerful credit scoring models has become an effective solution and can be achieved from three aspects: modeling approach, data source, and evaluation measure. This paper proposes a novel model that departs from those in previous studies in threefold. First, a heterogeneous deep forest model that combines deep learning architecture and tree‐based ensemble classifiers is proposed as the modeling approach. Second, a Bayesian‐based macroeconomic variable optimization method is developed to determine the macroeconomic variables and the corresponding lag term, and the selected macroeconomic variables are used as supplementary data source for modeling. Lastly, a series of capital charge error measures is proposed to evaluate credit scoring models from a regulatory perspective. The proposal is evaluated on multiple large datasets under performance measures on predictive accuracy, profitability, and capital charge errors. Frequentist and Bayesian nonparametric significance tests are used to examine the statistical significance of heterogeneous deep forest and benchmarks. Three main conclusions can be reached from the comparison. First, heterogeneous deep forest significantly outperforms the industry benchmarks over all the evaluation measures. Second, the predictive performance is enhanced after incorporating the selected macroeconomic variables and the corresponding lag, and the result remains robust under cross‐validation and forward‐chaining validation. Third, the capital charge errors reflect the model performance from a regulatory perspective and thus lead to different rankings from those when evaluating predictive accuracy and profitability.

Suggested Citation

  • Yufei Xia & Xinyi Guo & Yinguo Li & Lingyun He & Xueyuan Chen, 2022. "Deep learning meets decision trees: An application of a heterogeneous deep forest approach in credit scoring for online consumer lending," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(8), pages 1669-1690, December.
  • Handle: RePEc:wly:jforec:v:41:y:2022:i:8:p:1669-1690
    DOI: 10.1002/for.2891
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.2891
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.2891?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gunnarsson, Björn Rafn & vanden Broucke, Seppe & Baesens, Bart & Óskarsdóttir, María & Lemahieu, Wilfried, 2021. "Deep learning for credit scoring: Do or don’t?," European Journal of Operational Research, Elsevier, vol. 295(1), pages 292-305.
    2. de Andrade, Fabio Wendling Muniz & Thomas, Lyn, 2007. "Structural models in consumer credit," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1569-1581, December.
    3. Peterson K. Ozili, 2018. "Impact of digital finance on financial inclusion and stability," Borsa Istanbul Review, Research and Business Development Department, Borsa Istanbul, vol. 18(4), pages 329-340, December.
    4. Buchak, Greg & Matvos, Gregor & Piskorski, Tomasz & Seru, Amit, 2018. "Fintech, regulatory arbitrage, and the rise of shadow banks," Journal of Financial Economics, Elsevier, vol. 130(3), pages 453-483.
    5. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    6. Eisenbeis, Robert A, 1977. "Pitfalls in the Application of Discriminant Analysis in Business, Finance, and Economics," Journal of Finance, American Finance Association, vol. 32(3), pages 875-900, June.
    7. T Bellotti & J Crook, 2009. "Credit scoring with macroeconomic variables using survival analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(12), pages 1699-1707, December.
    8. Hurlin, Christophe & Leymarie, Jérémy & Patin, Antoine, 2018. "Loss functions for Loss Given Default model comparison," European Journal of Operational Research, Elsevier, vol. 268(1), pages 348-360.
    9. Lore Dirick & Tony Bellotti & Gerda Claeskens & Bart Baesens, 2019. "Macro-Economic Factors in Credit Risk Calculations: Including Time-Varying Covariates in Mixture Cure Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(1), pages 40-53, January.
    10. Ozili, Peterson Kitakogelu, 2018. "Impact of Digital Finance on Financial Inclusion and Stability," MPRA Paper 84771, University Library of Munich, Germany.
    11. Dayu Xu & Xuyao Zhang & Hailin Feng, 2019. "Generalized fuzzy soft sets theory‐based novel hybrid ensemble credit scoring model," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 24(2), pages 903-921, April.
    12. Xia, Yufei & Zhao, Junhao & He, Lingyun & Li, Yinguo & Yang, Xiaoli, 2021. "Forecasting loss given default for peer-to-peer loans via heterogeneous stacking ensemble approach," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1590-1613.
    13. Tobias Berg & Valentin Burg & Ana Gombović & Manju Puri, 2020. "On the Rise of FinTechs: Credit Scoring Using Digital Footprints," The Review of Financial Studies, Society for Financial Studies, vol. 33(7), pages 2845-2897.
    14. Liran Einav & Mark Jenkins & Jonathan Levin, 2013. "The impact of credit scoring on consumer lending," RAND Journal of Economics, RAND Corporation, vol. 44(2), pages 249-274, June.
    15. Christian Lohmann & Thorsten Ohliger, 2019. "The total cost of misclassification in credit scoring: A comparison of generalized linear models and generalized additive models," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 38(5), pages 375-389, August.
    16. Paulo Cesar Schotten & Danielle Costa Morais, 2019. "A group decision model for credit granting in the financial market," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-19, December.
    17. Djeundje, Viani Biatat & Crook, Jonathan, 2018. "Incorporating heterogeneity and macroeconomic variables into multi-state delinquency models for credit cards," European Journal of Operational Research, Elsevier, vol. 271(2), pages 697-709.
    18. Lean Yu & Xinxie Li & Ling Tang & Zongyi Zhang & Gang Kou, 2015. "Social credit: a comprehensive literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 1(1), pages 1-18, December.
    19. Bertsch, Christoph & Hull, Isaiah & Qi, Yingjie & Zhang, Xin, 2020. "Bank misconduct and online lending," Journal of Banking & Finance, Elsevier, vol. 116(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sunghun Chung & Keongtae Kim & Chul Ho Lee & Wonseok Oh, 2023. "Interdependence between online peer‐to‐peer lending and cryptocurrency markets and its effects on financial inclusion," Production and Operations Management, Production and Operations Management Society, vol. 32(6), pages 1939-1957, June.
    2. Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
    3. Dirick, Lore & Claeskens, Gerda & Vasnev, Andrey & Baesens, Bart, 2022. "A hierarchical mixture cure model with unobserved heterogeneity for credit risk," Econometrics and Statistics, Elsevier, vol. 22(C), pages 39-55.
    4. José María Liberti & Mitchell A. Petersen, 2018. "Information: Hard and Soft," NBER Working Papers 25075, National Bureau of Economic Research, Inc.
    5. Medina-Olivares, Victor & Calabrese, Raffaella & Dong, Yizhe & Shi, Baofeng, 2022. "Spatial dependence in microfinance credit default," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1071-1085.
    6. Li, Xin & Shao, Xuefeng & Chang, Tsangyao & Albu, Lucian Liviu, 2022. "Does digital finance promote the green innovation of China's listed companies?," Energy Economics, Elsevier, vol. 114(C).
    7. Victor Medina-Olivares & Finn Lindgren & Raffaella Calabrese & Jonathan Crook, 2023. "Joint model for longitudinal and spatio-temporal survival data," Papers 2311.04008, arXiv.org.
    8. Lei Lu & Jianxing Wei & Weixing Wu & Yi Zhou, 2023. "Pricing strategies in BigTech lending: Evidence from China," Financial Management, Financial Management Association International, vol. 52(2), pages 333-374, June.
    9. Zhang, Dongyang, 2023. "Can digital finance empowerment reduce extreme ESG hypocrisy resistance to improve green innovation?," Energy Economics, Elsevier, vol. 125(C).
    10. Mao, Fengfu & Wang, Yuanfan & Zhu, Mengsi, 2023. "Digital financial inclusion, traditional finance system and household entrepreneurship," Pacific-Basin Finance Journal, Elsevier, vol. 80(C).
    11. Medina-Olivares, Victor & Calabrese, Raffaella & Crook, Jonathan & Lindgren, Finn, 2023. "Joint models for longitudinal and discrete survival data in credit scoring," European Journal of Operational Research, Elsevier, vol. 307(3), pages 1457-1473.
    12. Luo, Sumei & Sun, Yongkun & Zhou, Rui, 2022. "Can fintech innovation promote household consumption? Evidence from China family panel studies," International Review of Financial Analysis, Elsevier, vol. 82(C).
    13. Medina-Olivares, Victor & Lindgren, Finn & Calabrese, Raffaella & Crook, Jonathan, 2023. "Joint models of multivariate longitudinal outcomes and discrete survival data with INLA: An application to credit repayment behaviour," European Journal of Operational Research, Elsevier, vol. 310(2), pages 860-873.
    14. PU, Zhengning & FEI, Jinhua, 2022. "The impact of digital finance on residential carbon emissions: Evidence from China," Structural Change and Economic Dynamics, Elsevier, vol. 63(C), pages 515-527.
    15. Haibo Lei & Qin Su, 2023. "Does the Use of Digital Finance Affect Household Farmland Transfer-Out?," Sustainability, MDPI, vol. 15(16), pages 1-18, August.
    16. Ozili, Peterson K, 2021. "Bank non-performing loans in the Fintech era," MPRA Paper 113467, University Library of Munich, Germany.
    17. Krzysztof Waliszewski & Ewa Cichowicz & £ukasz Gêbski & Filip Kliber & Jakub Kubiczek & Pawe³ Niedzió³ka & Ma³gorzata Solarz & Anna Warchlewska, 2023. "The role of the Lendtech sector in the consumer credit market in the context of household financial exclusion," Oeconomia Copernicana, Institute of Economic Research, vol. 14(2), pages 609-643, June.
    18. Mateusz Folwarski, 2021. "The FinTech Sector and Aspects on the Financial Inclusion of the Society in EU Countries," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 1), pages 459-467.
    19. Yi Yang & Shuhe Shi & Jingjing Wu, 2022. "Digital Financial Inclusion to Corporation Value: The Mediating Effect of Ambidextrous Innovation," Sustainability, MDPI, vol. 14(24), pages 1-23, December.
    20. Dongjing Chen & Xiaotong Guo, 2023. "Impact of the Digital Economy and Financial Development on Residents’ Consumption Upgrading: Evidence from Mainland China," Sustainability, MDPI, vol. 15(10), pages 1-25, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:41:y:2022:i:8:p:1669-1690. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.