IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2512.12783.html

Credit Risk Estimation with Non-Financial Features: Evidence from a Synthetic Istanbul Dataset

Author

Listed:
  • Atalay Denknalbant
  • Emre Sezdi
  • Zeki Furkan Kutlu

Abstract

Financial exclusion constrains entrepreneurship, increases income volatility, and widens wealth gaps. Underbanked consumers in Istanbul often have no bureau file because their earnings and payments flow through informal channels. To study how such borrowers can be evaluated we create a synthetic dataset of one hundred thousand Istanbul residents that reproduces first quarter 2025 T\"U\.IK (TURKSTAT) census marginals and telecom usage patterns. Retrieval augmented generation feeds these public statistics into the OpenAI o3 model, which synthesises realistic yet private records. Each profile contains seven socio demographic variables and nine alternative attributes that describe phone specifications, online shopping rhythm, subscription spend, car ownership, monthly rent, and a credit card flag. To test the impact of the alternative financial data CatBoost, LightGBM, and XGBoost are each trained in two versions. Demo models use only the socio demographic variables; Full models include both socio demographic and alternative attributes. Across five fold stratified validation the alternative block raises area under the curve by about one point three percentage and lifts balanced F 1 from roughly 0.84 to 0.95, a fourteen percent gain. We contribute an open Istanbul 2025 Q1 synthetic dataset, a fully reproducible modeling pipeline, and empirical evidence that a concise set of behavioural attributes can approach bureau level discrimination power while serving borrowers who lack formal credit records. These findings give lenders and regulators a transparent blueprint for extending fair and safe credit access to the underbanked.

Suggested Citation

  • Atalay Denknalbant & Emre Sezdi & Zeki Furkan Kutlu, 2025. "Credit Risk Estimation with Non-Financial Features: Evidence from a Synthetic Istanbul Dataset," Papers 2512.12783, arXiv.org, revised Feb 2026.
  • Handle: RePEc:arx:papers:2512.12783
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2512.12783
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Julapa Jagtiani & Catharine Lemieux, 2019. "The roles of alternative data and machine learning in fintech lending: Evidence from the LendingClub consumer platform," Financial Management, Financial Management Association International, vol. 48(4), pages 1009-1029, December.
    2. Arráiz,Irani & Bruhn,Miriam & Stucchi,Rodolfo Mario, 2015. "Psychometrics as a tool to improve screening and access to credit," Policy Research Working Paper Series 7506, The World Bank.
    3. Razavi, Rouzbeh & Elbahnasawy, Nasr G., 2025. "Unlocking credit access: Using non-CDR mobile data to enhance credit scoring for financial inclusion," Finance Research Letters, Elsevier, vol. 73(C).
    4. Yanhao Wei & Pinar Yildirim & Christophe Van den Bulte & Chrysanthos Dellarocas, 2016. "Credit Scoring with Social Network Data," Marketing Science, INFORMS, vol. 35(2), pages 234-258, March.
    5. Jiang, Jinglin & Liao, Li & Lu, Xi & Wang, Zhengwei & Xiang, Hongyu, 2021. "Deciphering big data in consumer credit evaluation," Journal of Empirical Finance, Elsevier, vol. 62(C), pages 28-45.
    6. Demirguc-Kunt,Asli & Klapper,Leora & Singer,Dorothe, 2017. "Financial inclusion and inclusive growth : a review of recent empirical evidence," Policy Research Working Paper Series 8040, The World Bank.
    7. Mingfeng Lin & Nagpurnanand R. Prabhala & Siva Viswanathan, 2013. "Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending," Management Science, INFORMS, vol. 59(1), pages 17-35, August.
    8. Andry Alamsyah & Aufa Azhari Hafidh & Annisa Dwiyanti Mulya, 2025. "Innovative Credit Risk Assessment: Leveraging Social Media Data for Inclusive Credit Scoring in Indonesia’s Fintech Sector," JRFM, MDPI, vol. 18(2), pages 1-32, February.
    9. Dao Ha & Phuong Le & Duc Khuong Nguyen, 2025. "Financial inclusion and fintech: a state-of-the-art systematic literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 11(1), pages 1-42, December.
    10. Marco Di Maggio & Dimuthu Ratnadiwakara & Don Carmichael, 2022. "Invisible Primes: Fintech Lending with Alternative Data," NBER Working Papers 29840, National Bureau of Economic Research, Inc.
    11. Tobias Berg & Valentin Burg & Ana Gombović & Manju Puri, 2020. "On the Rise of FinTechs: Credit Scoring Using Digital Footprints," The Review of Financial Studies, Society for Financial Studies, vol. 33(7), pages 2845-2897.
    12. Rivalani Hlongwane & Kutlwano K K M Ramaboa & Wilson Mongwe, 2024. "Enhancing credit scoring accuracy with a comprehensive evaluation of alternative data," PLOS ONE, Public Library of Science, vol. 19(5), pages 1-18, May.
    13. Arráiz,Irani & Bruhn,Miriam & Stucchi,Rodolfo Mario, 2015. "Psychometrics as a tool to improve screening and access to credit," Policy Research Working Paper Series 7506, The World Bank.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yidi Liu & Xin Li & Zhiqiang (Eric) Zheng, 2024. "Consequences of China’s 2018 Online Lending Regulation and the Promise of PolicyTech," Information Systems Research, INFORMS, vol. 35(3), pages 1235-1256, September.
    2. Brandon Goldstein & Julapa Jagtiani & Catharine Lemieux, 2023. "Did Fintech Loans Default More During the COVID-19 Pandemic? Were Fintech Firms “Cream-Skimming” the Best Borrowers?," Working Papers 23-26, Federal Reserve Bank of Philadelphia.
    3. Joseph P. Hughes & Julapa Jagtiani & Choon-Geol Moon, 2022. "Consumer lending efficiency: commercial banks versus a fintech lender," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-39, December.
    4. Loutfi, Ahmad Amine, 2022. "A framework for evaluating the business deployability of digital footprint based models for consumer credit," Journal of Business Research, Elsevier, vol. 152(C), pages 473-486.
    5. Tobias Berg & Andreas Fuster & Manju Puri, 2022. "FinTech Lending," Annual Review of Financial Economics, Annual Reviews, vol. 14(1), pages 187-207, November.
    6. Linhui Wang & Jianping Zhu & Chenlu Zheng & Zhiyuan Zhang, 2024. "Incorporating Digital Footprints into Credit-Scoring Models through Model Averaging," Mathematics, MDPI, vol. 12(18), pages 1-15, September.
    7. Gao, Mingze & Leung, Henry & Liu, Linhui & Qiu, Buhui, 2023. "Consumer behaviour and credit supply: Evidence from an Australian FinTech lender," Finance Research Letters, Elsevier, vol. 57(C).
    8. Khan, Habib Hussain & Ahmad, Mohammad Rais, 2025. "The fintech revolution: Exploring the potential of fintech finance in reducing corporate credit constraints," Research in International Business and Finance, Elsevier, vol. 79(C).
    9. Ao, Zhiming & Ji, Xinru, 2025. "Strategic cooperation in fintech field and efficiency of commercial banks," The North American Journal of Economics and Finance, Elsevier, vol. 76(C).
    10. Christa Gibbs & Benedict Guttman-Kenney & Donghoon Lee & Scott Nelson & Wilbert van der Klaauw & Jialan Wang, 2025. "Consumer Credit Reporting Data," Journal of Economic Literature, American Economic Association, vol. 63(2), pages 598-636, June.
    11. Bredice, Marilena & Formisano, Anna Vittoria & Kullafi, Sara & Palma, Pasquale, 2025. "Access to credit and fintech: A lexicon-based sentiment analysis application on Twitter data," Research in International Business and Finance, Elsevier, vol. 77(PA).
    12. Yidi Liu & Xin Li & Zhiqiang (Eric) Zheng, 2024. "Smart Natural Disaster Relief: Assisting Victims with Artificial Intelligence in Lending," Information Systems Research, INFORMS, vol. 35(2), pages 489-504, June.
    13. Tigges, Maximilian & Mestwerdt, Sönke & Tschirner, Sebastian & Mauer, René, 2024. "Who gets the money? A qualitative analysis of fintech lending and credit scoring through the adoption of AI and alternative data," Technological Forecasting and Social Change, Elsevier, vol. 205(C).
    14. Allen, Franklin & Gu, Xian & Jagtiani, Julapa, 2022. "Fintech, Cryptocurrencies, and CBDC: Financial Structural Transformation in China," Journal of International Money and Finance, Elsevier, vol. 124(C).
    15. Nam, Rachel J., 2022. "Open banking and customer data sharing: Implications for FinTech borrowers," SAFE Working Paper Series 364, Leibniz Institute for Financial Research SAFE.
    16. Pankaj Kumar Maskara & Emre Kuvvet & Gengxuan Chen, 2021. "The role of P2P platforms in enhancing financial inclusion in the United States: An analysis of peer‐to‐peer lending across the rural–urban divide," Financial Management, Financial Management Association International, vol. 50(3), pages 747-774, September.
    17. Peng, Hongfeng & Ji, Jiao & Sun, Hanwen & Xu, Haofeng, 2023. "Legal enforcement and fintech credit: International evidence," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 214-231.
    18. Sumin Hu & Qi Zhu & Xia Zhao & Ziyue Xu, 2023. "Digital Finance and Corporate Sustainability Performance: Promoting or Restricting? Evidence from China’s Listed Companies," Sustainability, MDPI, vol. 15(13), pages 1-16, June.
    19. Doerr, Sebastian & Frost, Jon & Gambacorta, Leonardo & Shreeti, Vatsala, 2023. "Big techs in finance," CEPR Discussion Papers 18665, C.E.P.R. Discussion Papers.
    20. Lu, Yao & Li, Lu & Zhan, Huanqi & Zhan, Minghua, 2024. "Fintech, information heterogeneity, and the regional distribution effects of corporate financing constraints," Economic Modelling, Elsevier, vol. 137(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.12783. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.