IDEAS home Printed from https://ideas.repec.org/p/boa/wpaper/202532.html
   My bibliography  Save this paper

Machine learning and financial inclusion: Evidence from credit risk assessment of small-business loans in China

Author

Listed:
  • YANG, ZHANG

    (Department of Finance and Business Economics, Faculty of Business Administration / Asia-Pacific Academy of Economics and Management, University of Macau)

  • JIANXIONG LIN

    (QIFU Technology, China)

  • YIHE QIAN

    (Department of Finance and Business Economics, Faculty of Business Administration, University of Macau)

  • LIANJIE SHU

    (Faculty of Business Administration , University of Macau)

Abstract

MachiAs a key enabler of poverty alleviation and equitable growth, financial inclusion aims to expand access to credit and financial services for underserved individuals and small businesses. However, the elevated default risk and data scarcity in inclusive lending present major challenges to traditional credit assessment tools. This study evaluates whether machine learning (ML) techniques can improve default prediction for small-business loans,thereby enhancing the effectiveness and fairness of credit allocation. Using proprietary loan-level data from a city commercial bank in China, we compare eight classification models—Logistic Regression, Linear Discriminant Analysis (LDA), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, Random Forest, XGBoost, and LightGBM—under three sampling strategies to address class imbalance. Our findings reveal that undersampling significantly enhances model performance, and tree-based ML models, particularly XGBoost and Decision Tree, outperform traditional classifiers. Feature importance and misclassification analyses suggest that documentation completeness, demographic traits, and credit utilization are critical predictors of default. By combining robust empirical validation with model interpretability, this study contributes to the growing literature at the intersection of machine learning, credit risk, and financial development. Our findings offer actionable insights for policymakers, financial institutions, and data scientists working to build fairer and more effective credit systems in emerging markets.

Suggested Citation

  • Yang, Zhang & Jianxiong Lin & Yihe Qian & Lianjie Shu, 2025. "Machine learning and financial inclusion: Evidence from credit risk assessment of small-business loans in China," Working Papers 202532, University of Macau, Faculty of Business Administration.
  • Handle: RePEc:boa:wpaper:202532
    as

    Download full text from publisher

    File URL: https://fba.um.edu.mo/wp-content/uploads/RePEc/doc/202532.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    2. Yung-Chia Chang & Kuei-Hu Chang & Heng-Hsuan Chu & Lee-Ing Tong, 2016. "Establishing decision tree-based short-term default credit risk assessment models," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 45(23), pages 6803-6815, December.
    3. Gambacorta, Leonardo & Huang, Yiping & Qiu, Han & Wang, Jingyi, 2024. "How do machine learning and non-traditional data affect credit scoring? New evidence from a Chinese fintech firm," Journal of Financial Stability, Elsevier, vol. 73(C).
    4. Andreas Fuster & Paul Goldsmith‐Pinkham & Tarun Ramadorai & Ansgar Walther, 2022. "Predictably Unequal? The Effects of Machine Learning on Credit Markets," Journal of Finance, American Finance Association, vol. 77(1), pages 5-47, February.
    5. Kuldeep Kumar & Sukanto Bhattacharya, 2006. "Artificial neural network vs linear discriminant analysis in credit ratings forecast: A comparative study of prediction performances," Review of Accounting and Finance, Emerald Group Publishing, vol. 5(3), pages 216-227, August.
    6. Butaru, Florentin & Chen, Qingqing & Clark, Brian & Das, Sanmay & Lo, Andrew W. & Siddique, Akhtar, 2016. "Risk and risk management in the credit card industry," Journal of Banking & Finance, Elsevier, vol. 72(C), pages 218-239.
    7. A Matuszyk & C Mues & L C Thomas, 2010. "Modelling LGD for unsecured personal loans: decision tree approach," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 393-398, March.
    8. Julapa Jagtiani & Catharine Lemieux, 2019. "The roles of alternative data and machine learning in fintech lending: Evidence from the LendingClub consumer platform," Financial Management, Financial Management Association International, vol. 48(4), pages 1009-1029, December.
    9. Adair Morse, 2015. "Peer-to-Peer Crowdfunding: Information and the Potential for Disruption in Consumer Lending," Annual Review of Financial Economics, Annual Reviews, vol. 7(1), pages 463-482, December.
    10. Peter Gomber & Jascha-Alexander Koch & Michael Siering, 2017. "Digital Finance and FinTech: current research and future research directions," Journal of Business Economics, Springer, vol. 87(5), pages 537-580, July.
    11. Edward I. Altman & Gabriele Sabato, 2013. "MODELING CREDIT RISK FOR SMEs: EVIDENCE FROM THE US MARKET," World Scientific Book Chapters, in: Oliviero Roggi & Edward I Altman (ed.), Managing and Measuring Risk Emerging Global Standards and Regulations After the Financial Crisis, chapter 9, pages 251-279, World Scientific Publishing Co. Pte. Ltd..
    12. Aida Krichene Abdelmoula, 2015. "Bank Credit Risk Analysis with K-Nearest-Neighbor Classifier: Case of Tunisian Banks," Journal of Accounting and Management Information Systems, Faculty of Accounting and Management Information Systems, The Bucharest University of Economic Studies, vol. 14(1), pages 79-106, March.
    13. Thomas, Lyn C., 2009. "Consumer Credit Models: Pricing, Profit and Portfolios," OUP Catalogue, Oxford University Press, number 9780199232130, Decembrie.
    14. Adair Morse, 2015. "Peer-to-Peer Crowdfunding: Information and the Potential for Disruption in Consumer Lending," NBER Working Papers 20899, National Bureau of Economic Research, Inc.
    15. Jon Frost & Leonardo Gambacorta & Yi Huang & Hyun Song Shin & Pablo Zbinden, 2019. "BigTech and the changing structure of financial intermediation," Economic Policy, CEPR, CESifo, Sciences Po;CES;MSH, vol. 34(100), pages 761-799.
    16. Eliana Costa e Silva & Isabel Cristina Lopes & Aldina Correia & Susana Faria, 2020. "A logistic regression model for consumer default risk," Journal of Applied Statistics, Taylor & Francis Journals, vol. 47(13-15), pages 2879-2894, November.
    17. Miriam Bruhn & Inessa Love, 2014. "The Real Impact of Improved Access to Finance: Evidence from Mexico," Journal of Finance, American Finance Association, vol. 69(3), pages 1347-1376, June.
    18. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    19. Majid Bazarbash, 2019. "FinTech in Financial Inclusion: Machine Learning Applications in Assessing Credit Risk," IMF Working Papers 2019/109, International Monetary Fund.
    20. Yiping Huang & Ms. Longmei Zhang & Zhenhua Li & Han Qiu & Tao Sun & Xue Wang, 2020. "Fintech Credit Risk Assessment for SMEs: Evidence from China," IMF Working Papers 2020/193, International Monetary Fund.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicola Branzoli & Ilaria Supino, 2020. "FinTech credit: a critical review of empirical research," Questioni di Economia e Finanza (Occasional Papers) 549, Bank of Italy, Economic Research and International Relations Area.
    2. Tobias Berg & Andreas Fuster & Manju Puri, 2022. "FinTech Lending," Annual Review of Financial Economics, Annual Reviews, vol. 14(1), pages 187-207, November.
    3. Lei Lu & Jianxing Wei & Weixing Wu & Yi Zhou, 2023. "Pricing strategies in BigTech lending: Evidence from China," Financial Management, Financial Management Association International, vol. 52(2), pages 333-374, June.
    4. Thorsten Beck & Leonardo Gambacorta & Yiping Huang & Zhenhua Li & Han Qiu, 2022. "Big techs, QR code payments and financial inclusion," BIS Working Papers 1011, Bank for International Settlements.
    5. Zeynep Alraqeb & Peter Knaack & Camille Macaire, 2022. "Does FinTech Promote Entrepreneurship? Evidence from China," Working papers 895, Banque de France.
    6. Huang, Yiping & Li, Zhenhua & Qiu, Han & Tao, Sun & Wang, Xue & Zhang, Longmei, 2023. "BigTech credit risk assessment for SMEs," China Economic Review, Elsevier, vol. 81(C).
    7. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    8. Eccles, Peter & Grout, Paul & Siciliani, Paolo & Zalewska, Anna, 2021. "The impact of machine learning and big data on credit markets," Bank of England working papers 930, Bank of England.
    9. Dong, Yingwei & Gou, Qin & Qiu, Han, 2023. "Big tech credit score and default risk ——Evidence from loan-level data of a representative microfinance company in China," China Economic Review, Elsevier, vol. 81(C).
    10. Hui, Peng & Zhao, Hui & Liu, Desheng & Li, Ye, 2023. "How does digital finance affect regional innovation capacity? A spatial econometric analysis," Economic Modelling, Elsevier, vol. 122(C).
    11. Rishabh, Kumar & Schäublin, Jorma, 2021. "Payment Fintechs and Debt Enforcement," Working papers 2021/02, Faculty of Business and Economics - University of Basel.
    12. Leonardo Gambacorta & Yiping Huang & Zhenhua Li & Han Qiu & Shu Chen, 2020. "Data vs collateral," BIS Working Papers 881, Bank for International Settlements.
    13. Marcus Buckmann & Andy Haldane & Anne-Caroline Hüser, 2021. "Comparing minds and machines: implications for financial stability," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 37(3), pages 479-508.
    14. Doumpos, Michalis & Zopounidis, Constantin & Gounopoulos, Dimitrios & Platanakis, Emmanouil & Zhang, Wenke, 2023. "Operational research and artificial intelligence methods in banking," European Journal of Operational Research, Elsevier, vol. 306(1), pages 1-16.
    15. Anita Mirchandani & Namrata Gupta & Esinath Ndiweni, 2020. "Understanding the Fintech Wave: A Search for a Theoretical Explanation," International Journal of Economics and Financial Issues, Econjournals, vol. 10(5), pages 331-343.
    16. Cecchetti, Stephen & Schoenholtz, Kermit L., 2020. "Finance and Technology: What is changing and what is not," CEPR Discussion Papers 15352, C.E.P.R. Discussion Papers.
    17. Kowalewski, Oskar & Pisany, Paweł, 2022. "Banks' consumer lending reaction to fintech and bigtech credit emergence in the context of soft versus hard credit information processing," International Review of Financial Analysis, Elsevier, vol. 81(C).
    18. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    19. Anagnostopoulos, Ioannis, 2018. "Fintech and regtech: Impact on regulators and banks," Journal of Economics and Business, Elsevier, vol. 100(C), pages 7-25.
    20. Xie, Xueyan & Zhu, Xiaoyang, 2022. "FinTech and capital allocation efficiency: Another equity-efficiency dilemma?," Global Finance Journal, Elsevier, vol. 53(C).

    More about this item

    Keywords

    machine learning; financial inclusion; small business; China; credit risk assessment;
    All these keywords.

    JEL classification:

    • G21 - Financial Economics - - Financial Institutions and Services - - - Banks; Other Depository Institutions; Micro Finance Institutions; Mortgages
    • G32 - Financial Economics - - Corporate Finance and Governance - - - Financing Policy; Financial Risk and Risk Management; Capital and Ownership Structure; Value of Firms; Goodwill
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • O16 - Economic Development, Innovation, Technological Change, and Growth - - Economic Development - - - Financial Markets; Saving and Capital Investment; Corporate Finance and Governance

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:boa:wpaper:202532. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Carla Leong (email available below). General contact details of provider: https://edirc.repec.org/data/fbmacmo.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.