IDEAS home Printed from https://ideas.repec.org/a/gam/jfinte/v3y2024i1p12-215d1351432.html
   My bibliography  Save this article

Reimagining Peer-to-Peer Lending Sustainability: Unveiling Predictive Insights with Innovative Machine Learning Approaches for Loan Default Anticipation

Author

Listed:
  • Ly Nguyen

    (Emissis Ltd., 2 Ellerbeck Court, Stockley Business Park, Middlesbrough TS9 5PT, UK)

  • Mominul Ahsan

    (Department of Computer Science, University of York, Deramore Lane, York YO10 5GH, UK)

  • Julfikar Haider

    (Department of Engineering, Manchester Metropolitan University, John Dalton Building, Chester Street, Manchester M1 5GD, UK)

Abstract

Peer-to-peer lending, a novel element of Internet finance that links lenders and borrowers via online platforms, has generated large profits for investors. However, borrowers’ missed payments have negatively impacted the industry’s sustainable growth. It is imperative to create a system that can correctly predict loan defaults to lessen the damage brought on by defaulters. The goal of this study is to fill the gap in the literature by exploring the feasibility of developing prediction models for P2P loan defaults without relying heavily on personal data while also focusing on identifying key variables influencing borrowers’ repayment capacity through systematic feature selection and exploratory data analysis. Given this, this study aims to create a computational model that aids lenders in determining the approval or rejection of a loan application, relying on the financial data provided by applicants. The selected dataset, sourced from an open database, contains 8578 transaction records and includes 14 attributes related to financial information, with no personal data included. A loan dataset is first subjected to an in-depth exploratory data analysis to find behaviors connected to loan defaults. Subsequently, diverse and noteworthy machine learning classification algorithms, including Random Forest, Support Vector Machine, Decision Tree, Logistic Regression, Naïve Bayes, and XGBoost, were employed to build models capable of discerning borrowers who repay their loans from those who do not. Our findings indicate that borrowers who fail to comply with their lenders’ credit policies, pay elevated interest rates, and possess low FICO ratings are at a higher likelihood of defaulting. Furthermore, elevated risk is observed among clients who obtain loans for small businesses. All classification models, including XGBoost and Random Forest, successfully developed and performed satisfactorily and achieved an accuracy of over 80%. When the decision threshold is set to 0.4, the best performance for predicting loan defaulters is achieved using logistic regression, which accurately identifies 83% of the defaulted loans, with a recall of 83%, precision of 21% and f1 score of 33%.

Suggested Citation

  • Ly Nguyen & Mominul Ahsan & Julfikar Haider, 2024. "Reimagining Peer-to-Peer Lending Sustainability: Unveiling Predictive Insights with Innovative Machine Learning Approaches for Loan Default Anticipation," FinTech, MDPI, vol. 3(1), pages 1-32, March.
  • Handle: RePEc:gam:jfinte:v:3:y:2024:i:1:p:12-215:d:1351432
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2674-1032/3/1/12/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2674-1032/3/1/12/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Adam Nowak & Amanda Ross & Christopher Yencha, 2018. "Small Business Borrowing And Peer‐To‐Peer Lending: Evidence From Lending Club," Contemporary Economic Policy, Western Economic Association International, vol. 36(2), pages 318-336, April.
    2. Cuiqing Jiang & Zhao Wang & Ruiya Wang & Yong Ding, 2018. "Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending," Annals of Operations Research, Springer, vol. 266(1), pages 511-529, July.
    3. Seth Freedman & Ginger Zhe Jin, 2008. "Do Social Networks Solve Information Problems for Peer-to-Peer Lending? Evidence from Prosper.com," Working Papers 08-43, NET Institute.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gregor Dorfleitner & Eva-Maria Oswald & Rongxin Zhang, 2021. "From Credit Risk to Social Impact: On the Funding Determinants in Interest-Free Peer-to-Peer Lending," Journal of Business Ethics, Springer, vol. 170(2), pages 375-400, May.
    2. Christopher Gerling & Stefan Lessmann, 2023. "Multimodal Document Analytics for Banking Process Automation," Papers 2307.11845, arXiv.org, revised Nov 2023.
    3. Ejaz Ghani & William R. Kerr & Christopher Stanton, 2014. "Diasporas and Outsourcing: Evidence from oDesk and India," Management Science, INFORMS, vol. 60(7), pages 1677-1697, July.
    4. Xueru Chen & Xiaoji Hu & Shenglin Ben, 2021. "How do reputation, structure design and FinTech ecosystem affect the net cash inflow of P2P lending platforms? Evidence from China," Electronic Commerce Research, Springer, vol. 21(4), pages 1055-1082, December.
    5. Wangcheng Yan & Wenjun Zhou, 2023. "Is blockchain a cure for peer-to-peer lending?," Annals of Operations Research, Springer, vol. 321(1), pages 693-716, February.
    6. Kovacs, Attila, 2018. "Gender Differences in Equity Crowdfunding," OSF Preprints 5pcmb, Center for Open Science.
    7. Yanhong Guo & Shuai Jiang & Wenjun Zhou & Chunyu Luo & Hui Xiong, 2021. "A predictive indicator using lender composition for loan evaluation in P2P lending," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.
    8. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    9. repec:zbw:bofrdp:urn:nbn:fi:bof-201511261452 is not listed on IDEAS
    10. Jiang, Cuiqing & Lyu, Ximei & Yuan, Yufei & Wang, Zhao & Ding, Yong, 2022. "Mining semantic features in current reports for financial distress prediction: Empirical evidence from unlisted public firms in China," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1086-1099.
    11. Soumajyoti Sarkar & Hamidreza Alvari, 2020. "Mitigating Bias in Online Microfinance Platforms: A Case Study on Kiva.org," Papers 2006.12995, arXiv.org.
    12. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    13. Xiaoyu Li & Jiahong Yuan & Yan Shi & Zilai Sun & Junhu Ruan, 2020. "Emerging Trends and Innovation Modes of Internet Finance—Results from Co-Word and Co-Citation Networks," Future Internet, MDPI, vol. 12(3), pages 1-14, March.
    14. Qizhi Tao & Yizhe Dong & Ziming Lin, 2017. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 19(3), pages 425-441, June.
    15. Rajkamal Iyer & Asim Ijaz Khwaja & Erzo F. P. Luttmer & Kelly Shue, 2016. "Screening Peers Softly: Inferring the Quality of Small Borrowers," Management Science, INFORMS, vol. 62(6), pages 1554-1577, June.
    16. Huosong Xia & Jing Liu & Zuopeng Justin Zhang, 2024. "Identifying Fintech risk through machine learning: analyzing the Q&A text of an online loan investment platform," Annals of Operations Research, Springer, vol. 333(2), pages 579-599, February.
    17. Medina-Olivares, Victor & Calabrese, Raffaella & Dong, Yizhe & Shi, Baofeng, 2022. "Spatial dependence in microfinance credit default," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1071-1085.
    18. Mousumi Munmun & Dongli Zhang & Charles C. Luo, 2024. "Peer-to-Peer Lending Performance Improvement: Learn from Lean Principles," International Journal of Business and Management, Canadian Center of Science and Education, vol. 19(1), pages 101-101, February.
    19. Ajay Byanjankar & József Mezei & Markku Heikkilä, 2021. "Data‐driven optimization of peer‐to‐peer lending portfolios based on the expected value framework," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 28(2), pages 119-129, April.
    20. Miller, Sarah, 2015. "Information and default in consumer credit markets: Evidence from a natural experiment," Journal of Financial Intermediation, Elsevier, vol. 24(1), pages 45-70.
    21. Li, Zhiyong & Li, Aimin & Bellotti, Anthony & Yao, Xiao, 2023. "The profitability of online loans: A competing risks analysis on default and prepayment," European Journal of Operational Research, Elsevier, vol. 306(2), pages 968-985.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jfinte:v:3:y:2024:i:1:p:12-215:d:1351432. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.