IDEAS home Printed from https://ideas.repec.org/a/eee/finana/v79y2022ics1057521921002878.html
   My bibliography  Save this article

Applying machine learning algorithms to predict default probability in the online credit market: Evidence from China

Author

Listed:
  • Liu, Yi
  • Yang, Menglong
  • Wang, Yudong
  • Li, Yongshan
  • Xiong, Tiancheng
  • Li, Anzhe

Abstract

Using data from Renrendai and three machine learning algorithms, namely, k-nearest neighbor, support vector machine, and random forest, we predicted the default probability of online loan borrowers and compared their prediction performance with that of a logistic model. The results show that, first, based on the AUC (area under the ROC curve) value, accuracy rate and Brier score, the machine learning models can accurately predict the default risk of online borrowers. Second, the integrated discrimination improvement (IDI) test results show that the prediction performance of the machine learning algorithms is significantly better than that of the logistic model. Third, after constructing the investor profit function with misclassification cost, we find that the machine learning algorithms can provide more benefits to investors.

Suggested Citation

  • Liu, Yi & Yang, Menglong & Wang, Yudong & Li, Yongshan & Xiong, Tiancheng & Li, Anzhe, 2022. "Applying machine learning algorithms to predict default probability in the online credit market: Evidence from China," International Review of Financial Analysis, Elsevier, vol. 79(C).
  • Handle: RePEc:eee:finana:v:79:y:2022:i:c:s1057521921002878
    DOI: 10.1016/j.irfa.2021.101971
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1057521921002878
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.irfa.2021.101971?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Guo, Yanhong & Zhou, Wenjun & Luo, Chunyu & Liu, Chuanren & Xiong, Hui, 2016. "Instance-based credit risk assessment for investment decisions in P2P lending," European Journal of Operational Research, Elsevier, vol. 249(2), pages 417-426.
    2. Wiginton, John C., 1980. "A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 15(3), pages 757-770, September.
    3. Viaene, Stijn & Dedene, Guido, 2005. "Cost-sensitive learning and decision making revisited," European Journal of Operational Research, Elsevier, vol. 166(1), pages 212-220, October.
    4. Teply, Petr & Polena, Michal, 2020. "Best classification algorithms in peer-to-peer lending," The North American Journal of Economics and Finance, Elsevier, vol. 51(C).
    5. Chen, Xiao & Huang, Bihong & Ye, Dezhu, 2020. "Gender gap in peer-to-peer lending: Evidence from China," Journal of Banking & Finance, Elsevier, vol. 112(C).
    6. He, Feng & Qin, Shuqi & Zhang, Xiaotao, 2021. "Investor attention and platform interest rate in Chinese peer-to-peer lending market," Finance Research Letters, Elsevier, vol. 39(C).
    7. Yi Liu & Quanli Zhou & Xuan Zhao & Yudong Wang, 2018. "Can Listing Information Indicate Borrower Credit Risk in Online Peer-to-Peer Lending?," Emerging Markets Finance and Trade, Taylor & Francis Journals, vol. 54(13), pages 2982-2994, October.
    8. Akkoç, Soner, 2012. "An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish cred," European Journal of Operational Research, Elsevier, vol. 222(1), pages 168-178.
    9. Chen, Shiyi & Gu, Yan & Liu, Qingfu & Tse, Yiuman, 2020. "How do lenders evaluate borrowers in peer-to-peer lending in China?," International Review of Economics & Finance, Elsevier, vol. 69(C), pages 651-662.
    10. Chen, Xiao & Huang, Bihong & Ye, Dezhu, 2018. "The role of punctuation in P2P lending: Evidence from China," Economic Modelling, Elsevier, vol. 68(C), pages 634-643.
    11. Xuchen Lin & Xiaolong Li & Zhong Zheng, 2017. "Evaluating borrower’s default risk in peer-to-peer lending: evidence from a lending platform in China," Applied Economics, Taylor & Francis Journals, vol. 49(35), pages 3538-3545, July.
    12. Yu, Lean & Huang, Xiaowen & Yin, Hang, 2020. "Can machine learning paradigm improve attribute noise problem in credit risk classification?," International Review of Economics & Finance, Elsevier, vol. 70(C), pages 440-455.
    13. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    14. Qizhi Tao & Yizhe Dong & Ziming Lin, 2017. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 19(3), pages 425-441, June.
    15. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    16. Riza Emekter & Yanbin Tu & Benjamas Jirasakuldech & Min Lu, 2015. "Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending," Applied Economics, Taylor & Francis Journals, vol. 47(1), pages 54-70, January.
    17. Jiang, Cuiqing & Wang, Zhao & Zhao, Huimin, 2019. "A prediction-driven mixture cure model and its application in credit scoring," European Journal of Operational Research, Elsevier, vol. 277(1), pages 20-31.
    18. Gunnarsson, Björn Rafn & vanden Broucke, Seppe & Baesens, Bart & Óskarsdóttir, María & Lemahieu, Wilfried, 2021. "Deep learning for credit scoring: Do or don’t?," European Journal of Operational Research, Elsevier, vol. 295(1), pages 292-305.
    19. Zhou, Jing & Li, Wei & Wang, Jiaxin & Ding, Shuai & Xia, Chengyi, 2019. "Default prediction in P2P lending from high-dimensional data based on machine learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 534(C).
    20. Yu, Lean & Yao, Xiao & Zhang, Xiaoming & Yin, Hang & Liu, Jia, 2020. "A novel dual-weighted fuzzy proximal support vector machine with application to credit risk analysis," International Review of Financial Analysis, Elsevier, vol. 71(C).
    21. Khandani, Amir E. & Kim, Adlar J. & Lo, Andrew W., 2010. "Consumer credit-risk models via machine-learning algorithms," Journal of Banking & Finance, Elsevier, vol. 34(11), pages 2767-2787, November.
    22. Qizhi Tao & Yizhe Dong & Ziming Lin, 0. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    23. Chen, Rongda & Chen, Xinhao & Jin, Chenglu & Chen, Yiyang & Chen, Jiayi, 2020. "Credit rating of online lending borrowers using recovery rates," International Review of Economics & Finance, Elsevier, vol. 68(C), pages 204-216.
    24. Chen, Jia & Jiang, Jiajun & Liu, Yu-jane, 2018. "Financial literacy and gender difference in loan performance," Journal of Empirical Finance, Elsevier, vol. 48(C), pages 307-320.
    25. Jefferson Duarte & Stephan Siegel & Lance Young, 2012. "Trust and Credit: The Role of Appearance in Peer-to-peer Lending," The Review of Financial Studies, Society for Financial Studies, vol. 25(8), pages 2455-2484.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Abedin, Mohammad Zoynul & Hajek, Petr & Sharif, Taimur & Satu, Md. Shahriare & Khan, Md. Imran, 2023. "Modelling bank customer behaviour using feature engineering and classification techniques," Research in International Business and Finance, Elsevier, vol. 65(C).
    2. Aslam, Faheem & Hunjra, Ahmed Imran & Ftiti, Zied & Louhichi, Wael & Shams, Tahira, 2022. "Insurance fraud detection: Evidence from artificial intelligence and machine learning," Research in International Business and Finance, Elsevier, vol. 62(C).
    3. Wang, Dan & Chen, Zhi & Florescu, Ionuţ & Wen, Bingyang, 2023. "A sparsity algorithm for finding optimal counterfactual explanations: Application to corporate credit rating," Research in International Business and Finance, Elsevier, vol. 64(C).
    4. Bolívar, Fernando & Duran, Miguel A. & Lozano-Vivas, Ana, 2023. "Business model contributions to bank profit performance: A machine learning approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    5. Zhou, Ying & Shen, Long & Ballester, Laura, 2023. "A two-stage credit scoring model based on random forest: Evidence from Chinese small firms," International Review of Financial Analysis, Elsevier, vol. 89(C).
    6. Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).
    7. Bitetto, Alessandro & Cerchiello, Paola & Mertzanis, Charilaos, 2023. "Measuring financial soundness around the world: A machine learning approach," International Review of Financial Analysis, Elsevier, vol. 85(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sha, Yezhou, 2022. "Rating manipulation and creditworthiness for platform economy: Evidence from peer-to-peer lending," International Review of Financial Analysis, Elsevier, vol. 84(C).
    2. Mengyin Li & Phillip H. Phan & Xian Sun, 2021. "Business Friendliness: A Double-Edged Sword," Sustainability, MDPI, vol. 13(4), pages 1-22, February.
    3. Štefan Lyócsa & Petra Vašaničová & Branka Hadji Misheva & Marko Dávid Vateha, 2022. "Default or profit scoring credit systems? Evidence from European and US peer-to-peer lending markets," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-21, December.
    4. Chen, Pei-Fen & Lo, Shihmin & Tang, Hai-Yuan, 2022. "What if borrowers stop paying their loans? Investors’ rates of return on a peer-to-peer lending platform," International Review of Economics & Finance, Elsevier, vol. 77(C), pages 359-377.
    5. Li, Jianwen & Zhang, Bo & Jiang, Mingming & Hu, Jinyan, 2023. "Homophilous intensity in the online lending market: Bidding behavior and economic effects," Journal of Banking & Finance, Elsevier, vol. 152(C).
    6. Dongwoo Kim, 2023. "Can investors’ collective decision-making evolve? Evidence from peer-to-peer lending markets," Electronic Commerce Research, Springer, vol. 23(2), pages 1323-1358, June.
    7. Pankaj Kumar Maskara & Emre Kuvvet & Gengxuan Chen, 2021. "The role of P2P platforms in enhancing financial inclusion in the United States: An analysis of peer‐to‐peer lending across the rural–urban divide," Financial Management, Financial Management Association International, vol. 50(3), pages 747-774, September.
    8. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    9. Ting Sun & Miklos A. Vasarhelyi, 2018. "Predicting credit card delinquencies: An application of deep neural networks," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 25(4), pages 174-189, October.
    10. Wu, Bao & Liu, Zijia & Gu, Qiuyang & Tsai, Fu-Sheng, 2023. "Underdog mentality, identity discrimination and access to peer-to-peer lending market: Exploring effects of digital authentication," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 83(C).
    11. Gaigalienė Asta & Česnys Dovydas, 2018. "Determinants of Default in Lithuanian Peer-To-Peer Platforms," Management of Organizations: Systematic Research, Sciendo, vol. 80(1), pages 19-36, December.
    12. Huang, Jin & Sena, Vania & Li, Jun & Ozdemir, Sena, 2021. "Message framing in P2P lending relationships," Journal of Business Research, Elsevier, vol. 122(C), pages 761-773.
    13. Serena Gallo, 2021. "Fintech platforms: Lax or careful borrowers’ screening?," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-33, December.
    14. Qun Chen & Ji-Wen Li & Jian-Guo Liu & Jing-Ti Han & Yun Shi & Xun-Hua Guo, 2021. "Borrower Learning Effects: Do Prior Experiences Promote Continuous Successes in Peer-to-Peer Lending?," Information Systems Frontiers, Springer, vol. 23(4), pages 963-986, August.
    15. Qun Chen & Ji-Wen Li & Jian-Guo Liu & Jing-Ti Han & Yun Shi & Xun-Hua Guo, 0. "Borrower Learning Effects: Do Prior Experiences Promote Continuous Successes in Peer-to-Peer Lending?," Information Systems Frontiers, Springer, vol. 0, pages 1-24.
    16. Jiang, Cuixia & Xu, Qifa & Zhang, Weiming & Li, Mengting & Yang, Shanlin, 2018. "Does automatic bidding mechanism affect herding behavior? Evidence from online P2P lending in China," Journal of Behavioral and Experimental Finance, Elsevier, vol. 20(C), pages 39-44.
    17. Samuel Ribeiro-Navarrete & Juan Piñeiro-Chousa & M. Ángeles López-Cabarcos & Daniel Palacios-Marqués, 2022. "Crowdlending: mapping the core literature and research frontiers," Review of Managerial Science, Springer, vol. 16(8), pages 2381-2411, November.
    18. Li, Yibei & Wang, Ximei & Djehiche, Boualem & Hu, Xiaoming, 2020. "Credit scoring by incorporating dynamic networked information," European Journal of Operational Research, Elsevier, vol. 286(3), pages 1103-1112.
    19. Gunnarsson, Björn Rafn & vanden Broucke, Seppe & Baesens, Bart & Óskarsdóttir, María & Lemahieu, Wilfried, 2021. "Deep learning for credit scoring: Do or don’t?," European Journal of Operational Research, Elsevier, vol. 295(1), pages 292-305.
    20. Yanhong Guo & Shuai Jiang & Wenjun Zhou & Chunyu Luo & Hui Xiong, 2021. "A predictive indicator using lender composition for loan evaluation in P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:finana:v:79:y:2022:i:c:s1057521921002878. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620166 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.