IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0303566.html
   My bibliography  Save this article

Enhancing credit scoring accuracy with a comprehensive evaluation of alternative data

Author

Listed:
  • Rivalani Hlongwane
  • Kutlwano K K M Ramaboa
  • Wilson Mongwe

Abstract

This study explores the potential of utilizing alternative data sources to enhance the accuracy of credit scoring models, compared to relying solely on traditional data sources, such as credit bureau data. A comprehensive dataset from the Home Credit Group’s home loan portfolio is analysed. The research examines the impact of incorporating alternative predictors that are typically overlooked, such as an applicant’s social network default status, regional economic ratings, and local population characteristics. The modelling approach applies the model-X knockoffs framework for systematic variable selection. By including these alternative data sources, the credit scoring models demonstrate improved predictive performance, achieving an area under the curve metric of 0.79360 on the Kaggle Home Credit default risk competition dataset, outperforming models that relied solely on traditional data sources, such as credit bureau data. The findings highlight the significance of leveraging diverse, non-traditional data sources to augment credit risk assessment capabilities and overall model accuracy.

Suggested Citation

  • Rivalani Hlongwane & Kutlwano K K M Ramaboa & Wilson Mongwe, 2024. "Enhancing credit scoring accuracy with a comprehensive evaluation of alternative data," PLOS ONE, Public Library of Science, vol. 19(5), pages 1-18, May.
  • Handle: RePEc:plo:pone00:0303566
    DOI: 10.1371/journal.pone.0303566
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0303566
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0303566&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0303566?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yanhao Wei & Pinar Yildirim & Christophe Van den Bulte & Chrysanthos Dellarocas, 2016. "Credit Scoring with Social Network Data," Marketing Science, INFORMS, vol. 35(2), pages 234-258, March.
    2. Coussement, Kristof & Benoit, Dries Frederik & Van den Poel, Dirk, 2009. "Improved Marketing Decision Making in a Customer Churn Prediction Context Using Generalized Additive Models," Working Papers 2009/18, Hogeschool-Universiteit Brussel, Faculteit Economie en Management.
    3. Timotej Jagric & Vita Jagric & Davorin Kracun, 2011. "Does Non-linearity Matter in Retail Credit Risk Modeling?," Czech Journal of Economics and Finance (Finance a uver), Charles University Prague, Faculty of Social Sciences, vol. 61(4), pages 384-402, August.
    4. Lean Yu & Lihang Yu & Kaitao Yu, 2021. "A high-dimensionality-trait-driven learning paradigm for high dimensional credit classification," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-20, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    2. Louis Geiler & Séverine Affeldt & Mohamed Nadif, 2022. "A survey on machine learning methods for churn prediction," Post-Print hal-03824873, HAL.
    3. Mohammad Sahabuddin & Junaina Muhammad & Mohamed Hisham Yahya & Sabarina Mohammed Shah & Md. Kausar Alam, 2019. "Digitalization, Innovation and Sustainable Development: An Evidence of Islamic Finance Perspective," International Journal of Asian Social Science, Asian Economic and Social Society, vol. 9(12), pages 651-656, December.
    4. Daniel Bjorkegren & Joshua E. Blumenstock & Samsun Knight, 2020. "Manipulation-Proof Machine Learning," Papers 2004.03865, arXiv.org.
    5. Shiqi Fang & Zexun Chen & Jake Ansell, 2024. "Peer-induced Fairness: A Causal Approach for Algorithmic Fairness Auditing," Papers 2408.02558, arXiv.org, revised Sep 2024.
    6. Sanjiv Das & Xin Huang & Soji Adeshina & Patrick Yang & Leonardo Bachega, 2023. "Credit Risk Modeling with Graph Machine Learning," INFORMS Joural on Data Science, INFORMS, vol. 2(2), pages 197-217, October.
    7. Nana Chai & Baofeng Shi & Bin Meng & Yizhe Dong, 2023. "Default Feature Selection in Credit Risk Modeling: Evidence From Chinese Small Enterprises," SAGE Open, , vol. 13(2), pages 21582440231, April.
    8. Jian Mou & J. Christopher Westland & Tuan Q. Phan & Tianhui Tan, 2020. "Microlending on mobile social credit platforms: an exploratory study using Philippine loan contracts," Electronic Commerce Research, Springer, vol. 20(1), pages 173-196, March.
    9. Coussement, Kristof & De Bock, Koen W., 2013. "Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning," Journal of Business Research, Elsevier, vol. 66(9), pages 1629-1636.
    10. P. Baecke & D. Van Den Poel, 2012. "Including Spatial Interdependence in Customer Acquisition Models: a Cross-Category Comparison," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/788, Ghent University, Faculty of Economics and Business Administration.
    11. João Paulo Coelho Ribeiro & Fábio Duarte & Ana Paula Matias Gama, 2022. "Does microfinance foster the development of its clients? A bibliometric analysis and systematic literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    12. Waas, Bernd, 2023. "Künstliche Intelligenz und Arbeitsrecht," HSI-Schriftenreihe, Hugo Sinzheimer Institute for Labour and Social Security Law (HSI), Hans Böckler Foundation, volume 46, number 303122.
    13. Pinar Yildirim & Yanhao Wei & Christophe Bulte & Joy Lu, 2020. "Social network design for inducing effort," Quantitative Marketing and Economics (QME), Springer, vol. 18(4), pages 381-417, December.
    14. K. W. De Bock & D. Van Den Poel, 2011. "An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 11/717, Ghent University, Faculty of Economics and Business Administration.
    15. David A. Schweidel & Yakov Bart & J. Jeffrey Inman & Andrew T. Stephen & Barak Libai & Michelle Andrews & Ana Babić Rosario & Inyoung Chae & Zoey Chen & Daniella Kupor & Chiara Longoni & Felipe Thomaz, 2022. "How consumer digital signals are reshaping the customer journey," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1257-1276, November.
    16. Seungwook Kim & Daeyoung Choi & Eunjung Lee & Wonjong Rhee, 2017. "Churn prediction of mobile and online casual games using play log data," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-19, July.
    17. Bryan Bollinger & Song Yao, 2018. "Risk transfer versus cost reduction on two-sided microfinance platforms," Quantitative Marketing and Economics (QME), Springer, vol. 16(3), pages 251-287, September.
    18. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    19. Matthias Bogaert & Michel Ballings & Martijn Hosten & Dirk Van den Poel, 2017. "Identifying Soccer Players on Facebook Through Predictive Analytics," Decision Analysis, INFORMS, vol. 14(4), pages 274-297, December.
    20. Dimitris Andriosopoulos & Michalis Doumpos & Panos M. Pardalos & Constantin Zopounidis, 2019. "Computational approaches and data analytics in financial services: A literature review," Journal of the Operational Research Society, Taylor & Francis Journals, vol. 70(10), pages 1581-1599, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0303566. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.