IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v266y2018i1d10.1007_s10479-017-2668-z.html
   My bibliography  Save this article

Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending

Author

Listed:
  • Cuiqing Jiang

    (Hefei University of Technology)

  • Zhao Wang

    (Hefei University of Technology)

  • Ruiya Wang

    (Hefei University of Technology)

  • Yong Ding

    (Hefei University of Technology)

Abstract

Predicting whether a borrower will default on a loan is of significant concern to platforms and investors in online peer-to-peer (P2P) lending. Because the data types online platforms use are complex and involve unstructured information such as text, which is difficult to quantify and analyze, loan default prediction faces new challenges in P2P. To this end, we propose a default prediction method for P2P lending combined with soft information related to textual description. We introduce a topic model to extract valuable features from the descriptive text concerning loans and construct four default prediction models to demonstrate the performance of these features for default prediction. Moreover, a two-stage method is designed to select an effective feature set containing both soft and hard information. An empirical analysis using real-word data from a major P2P lending platform in China shows that the proposed method can improve loan default prediction performance compared with existing methods based only on hard information.

Suggested Citation

  • Cuiqing Jiang & Zhao Wang & Ruiya Wang & Yong Ding, 2018. "Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending," Annals of Operations Research, Springer, vol. 266(1), pages 511-529, July.
  • Handle: RePEc:spr:annopr:v:266:y:2018:i:1:d:10.1007_s10479-017-2668-z
    DOI: 10.1007/s10479-017-2668-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-017-2668-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-017-2668-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Angilella, Silvia & Mazzù, Sebastiano, 2015. "The financing of innovative SMEs: A multicriteria credit rating model," European Journal of Operational Research, Elsevier, vol. 244(2), pages 540-554.
    2. Goller, Daniel & Lechner, Michael & Moczall, Andreas & Wolff, Joachim, 2020. "Does the estimation of the propensity score by machine learning improve matching estimation? The case of Germany's programmes for long term unemployed," Labour Economics, Elsevier, vol. 65(C).
    3. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    4. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    5. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    6. Dorfleitner, Gregor & Priberny, Christopher & Schuster, Stephanie & Stoiber, Johannes & Weber, Martina & de Castro, Ivan & Kammler, Julia, 2016. "Description-text related soft information in peer-to-peer lending – Evidence from two leading European platforms," Journal of Banking & Finance, Elsevier, vol. 64(C), pages 169-187.
    7. Yao, Xiao & Crook, Jonathan & Andreeva, Galina, 2015. "Support vector regression for loss given default modelling," European Journal of Operational Research, Elsevier, vol. 240(2), pages 528-538.
    8. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    9. Sohini Paul, 2014. "Creditworthiness of a Borrower and the Selection Process in Micro-finance: A Case Study from the Urban Slums of India," Margin: The Journal of Applied Economic Research, National Council of Applied Economic Research, vol. 8(1), pages 59-75, February.
    10. Riza Emekter & Yanbin Tu & Benjamas Jirasakuldech & Min Lu, 2015. "Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending," Applied Economics, Taylor & Francis Journals, vol. 47(1), pages 54-70, January.
    11. Shuxia Wang & Yuwei Qi & Bin Fu & Hongzhi Liu, 2016. "Credit Risk Evaluation Based on Text Analysis," International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), IGI Global, vol. 10(1), pages 1-11, January.
    12. Mingfeng Lin & Nagpurnanand R. Prabhala & Siva Viswanathan, 2013. "Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending," Management Science, INFORMS, vol. 59(1), pages 17-35, August.
    13. Clyde Holsapple & Anita Lee & Jim Otto, 1997. "A machine learning method for multi-expert decision support," Annals of Operations Research, Springer, vol. 75(0), pages 171-188, January.
    14. Devin G. Pope & Justin R. Sydnor, 2011. "What’s in a Picture?: Evidence of Discrimination from Prosper.com," Journal of Human Resources, University of Wisconsin Press, vol. 46(1), pages 53-92.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eid, Nourhan & Maltby, Josephine & Talavera, Oleksandr, 2016. "Income Rounding and Loan Performance in the Peer-to-Peer Market," MPRA Paper 72852, University Library of Munich, Germany.
    2. Xia, Yufei & Zhao, Junhao & He, Lingyun & Li, Yinguo & Yang, Xiaoli, 2021. "Forecasting loss given default for peer-to-peer loans via heterogeneous stacking ensemble approach," International Journal of Forecasting, Elsevier, vol. 37(4), pages 1590-1613.
    3. Kriebel, Johannes & Stitz, Lennart, 2022. "Credit default prediction from user-generated text in peer-to-peer lending using deep learning," European Journal of Operational Research, Elsevier, vol. 302(1), pages 309-323.
    4. Jianrong Yao & Jiarui Chen & June Wei & Yuangao Chen & Shuiqing Yang, 2019. "The relationship between soft information in loan titles and online peer-to-peer lending: evidence from RenRenDai platform," Electronic Commerce Research, Springer, vol. 19(1), pages 111-129, March.
    5. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    6. Dimitris Andriosopoulos & Michalis Doumpos & Panos M. Pardalos & Constantin Zopounidis, 2019. "Computational approaches and data analytics in financial services: A literature review," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(10), pages 1581-1599, October.
    7. Carlos Serrano-Cinca & Begoña Gutiérrez-Nieto & Luz López-Palacios, 2015. "Determinants of Default in P2P Lending," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-22, October.
    8. Jong Wook Lee & So Young Sohn, 2021. "Evaluating borrowers’ default risk with a spatial probit model reflecting the distance in their relational network," PLOS ONE, Public Library of Science, vol. 16(12), pages 1-11, December.
    9. Wolfgang Pointner & Burkhard Raunig, 2018. "A primer on peer-to-peer lending: immediate financial intermediation in practice," Monetary Policy & the Economy, Oesterreichische Nationalbank (Austrian Central Bank), issue Q3/18, pages 36-51.
    10. Xueru Chen & Xiaoji Hu & Shenglin Ben, 2021. "How do reputation, structure design and FinTech ecosystem affect the net cash inflow of P2P lending platforms? Evidence from China," Electronic Commerce Research, Springer, vol. 21(4), pages 1055-1082, December.
    11. Dorfleitner, Gregor & Rad, Jacqueline & Weber, Martina, 2017. "Pricing in the online invoice trading market: First empirical evidence," Economics Letters, Elsevier, vol. 161(C), pages 56-61.
    12. Zhao Wang & Cuiqing Jiang & Huimin Zhao, 2022. "Know Where to Invest: Platform Risk Evaluation in Online Lending," Information Systems Research, INFORMS, vol. 33(3), pages 765-783, September.
    13. Teply, Petr & Polena, Michal, 2020. "Best classification algorithms in peer-to-peer lending," The North American Journal of Economics and Finance, Elsevier, vol. 51(C).
    14. Yufei Xia & Lingyun He & Yinguo Li & Nana Liu & Yanlin Ding, 2020. "Predicting loan default in peer‐to‐peer lending using narrative data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 260-280, March.
    15. Wang, Qi & Xiong, Xiong & Zheng, Zunxin, 2021. "Platform Characteristics and Online Peer-to-Peer Lending: Evidence from China," Finance Research Letters, Elsevier, vol. 38(C).
    16. Qizhi Tao & Yizhe Dong & Ziming Lin, 2017. "Who can get money? Evidence from the Chinese peer-to-peer lending platform," Information Systems Frontiers, Springer, vol. 19(3), pages 425-441, June.
    17. Wu, Yu & Zhang, Tong, 2021. "Can credit ratings predict defaults in peer-to-peer online lending? Evidence from a Chinese platform," Finance Research Letters, Elsevier, vol. 40(C).
    18. Wolfgang Breuer & Can K. Soypak & Bertram I. Steininger, 2020. "Magnitude effects in lending and borrowing: empirical evidence from a P2P platform," The European Journal of Finance, Taylor & Francis Journals, vol. 26(9), pages 854-873, June.
    19. Liu, Aiping & Urquía-Grande, Elena & López-Sánchez, Pilar & Rodríguez-López, Ángel, 2023. "Research into microfinance and ICTs: A bibliometric analysis," Evaluation and Program Planning, Elsevier, vol. 97(C).
    20. Li, Zhiyong & Li, Aimin & Bellotti, Anthony & Yao, Xiao, 2023. "The profitability of online loans: A competing risks analysis on default and prepayment," European Journal of Operational Research, Elsevier, vol. 306(2), pages 968-985.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:266:y:2018:i:1:d:10.1007_s10479-017-2668-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.