IDEAS home Printed from https://ideas.repec.org/a/eee/quaeco/v100y2025ics1062976925000018.html
   My bibliography  Save this article

A novel credit model risk measure: Do more data lead to lower model risk?

Author

Listed:
  • Yoshida, Valter T.
  • Schiozer, Rafael
  • de Genaro, Alan
  • dos Santos, Toni R.E.

Abstract

Large databases and Machine Learning enhance our capacity to develop models with many observations and explanatory variables. While the literature has primarily focused on optimizing classifications, little attention has been given to model risk, especially originating from inadequate use. To address this gap, we introduce a new metric for assessing model risk in credit applications. We test the metric using cross-section LASSO default models, each incorporating 200 thousand loan observations from several banks and more than 100 explanatory variables. The results indicate that models that use loans from a single bank have lower model risk than models using loans from the entire financial system. Therefore, adding loans from different banks to increase the number of observations in a model is suboptimal, challenging the widely accepted assumption that more data leads to better predictions.

Suggested Citation

  • Yoshida, Valter T. & Schiozer, Rafael & de Genaro, Alan & dos Santos, Toni R.E., 2025. "A novel credit model risk measure: Do more data lead to lower model risk?," The Quarterly Review of Economics and Finance, Elsevier, vol. 100(C).
  • Handle: RePEc:eee:quaeco:v:100:y:2025:i:c:s1062976925000018
    DOI: 10.1016/j.qref.2025.101960
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1062976925000018
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.qref.2025.101960?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bernard, Carole & Vanduffel, Steven, 2015. "A new approach to assessing model risk in high dimensions," Journal of Banking & Finance, Elsevier, vol. 58(C), pages 166-178.
    2. Bernardus Van Doornik & Dimas Fazio & David Schoenherr & Janis Skrastins, 2022. "Unemployment Insurance as a Subsidy to Risky Firms," The Review of Financial Studies, Society for Financial Studies, vol. 35(12), pages 5535-5595.
    3. Yiping Huang & Ms. Longmei Zhang & Zhenhua Li & Han Qiu & Tao Sun & Xue Wang, 2020. "Fintech Credit Risk Assessment for SMEs: Evidence from China," IMF Working Papers 2020/193, International Monetary Fund.
    4. Guillaume Coqueret & Bertrand Tavin, 2016. "An investigation of model risk in a market with jumps and stochastic volatility," Post-Print hal-02010659, HAL.
    5. Danielsson, Jon & James, Kevin R. & Valenzuela, Marcela & Zer, Ilknur, 2016. "Model risk of risk models," Journal of Financial Stability, Elsevier, vol. 23(C), pages 79-91.
    6. Martins, Theo Cotrim & Schiozer, Rafael & Linardi, Fernando de Menezes, 2023. "The information content from lending relationships across the supply chain," Journal of Financial Intermediation, Elsevier, vol. 56(C).
    7. Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769.
    8. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    9. Boucher, Christophe M. & Daníelsson, Jón & Kouontchou, Patrick S. & Maillet, Bertrand B., 2014. "Risk models-at-risk," Journal of Banking & Finance, Elsevier, vol. 44(C), pages 72-92.
    10. Sigrist, Fabio & Leuenberger, Nicola, 2023. "Machine learning for corporate default risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1390-1406.
    11. Fernandes, Guilherme Barreto & Artes, Rinaldo, 2016. "Spatial dependence in credit risk and its improvement in credit scoring," European Journal of Operational Research, Elsevier, vol. 249(2), pages 517-524.
    12. Coqueret, Guillaume & Tavin, Bertrand, 2016. "An investigation of model risk in a market with jumps and stochastic volatility," European Journal of Operational Research, Elsevier, vol. 253(3), pages 648-658.
    13. Tomohiro Ando & Jushan Bai, 2016. "Panel Data Models with Grouped Factor Structure Under Unknown Group Membership," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(1), pages 163-191, January.
    14. Niels Pedersen & Sébastien Page & Fei He, 2014. "Asset Allocation: Risk Models for Alternative Investments," Financial Analysts Journal, Taylor & Francis Journals, vol. 70(3), pages 34-45, May.
    15. Andrés Alonso & José Manuel Carbó, 2021. "Understanding the performance of machine learning models to predict credit default: a novel approach for supervisory evaluation," Working Papers 2105, Banco de España.
    16. Johannes Hoelzemann & Gustavo Manso & Abhishek Nagaraj & Matteo Tranchero, 2024. "The Streetlight Effect in Data-Driven Exploration," NBER Working Papers 32401, National Bureau of Economic Research, Inc.
    17. Theo Cotrim Martins & Rafael Schiozer & Fernando de Menezes Linardi, 2023. "The Information Content from Lending Relationships Across the Supply Chain," Working Papers Series 577, Central Bank of Brazil, Research Department.
    18. Jacopo Ponticelli & Leonardo S. Alencar, 2016. "Court Enforcement, Bank Loans, and Firm Investment: Evidence from a Bankruptcy Reform in Brazil," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(3), pages 1365-1413.
    19. repec:hal:journl:hal-02313399 is not listed on IDEAS
    20. Barrieu, Pauline & Scandolo, Giacomo, 2015. "Assessing financial model risk," European Journal of Operational Research, Elsevier, vol. 242(2), pages 546-556.
    21. Wall, Larry D., 2018. "Some financial regulatory implications of artificial intelligence," Journal of Economics and Business, Elsevier, vol. 100(C), pages 55-63.
    22. Hong Wang & Qingsong Xu & Lifeng Zhou, 2015. "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-20, February.
    23. Frederico A. Mourad & Rafael F. Schiozer & Toni R. E. dos Santos, 2020. "Bank Loan Forbearance: evidence from a million restructured loans," Working Papers Series 541, Central Bank of Brazil, Research Department.
    24. Schiozer, Rafael F. & Oliveira, Raquel de Freitas, 2016. "Asymmetric transmission of a bank liquidity shock," Journal of Financial Stability, Elsevier, vol. 25(C), pages 234-246.
    25. Wuyi Wang & Peter C. B. Phillips & Liangjun Su, 2018. "Homogeneity pursuit in panel data models: Theory and application," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(6), pages 797-815, September.
    26. Lazar, Emese & Zhang, Ning, 2019. "Model risk of expected shortfall," Journal of Banking & Finance, Elsevier, vol. 105(C), pages 74-93.
    27. Dimitrios Nikolaidis & Michalis Doumpos, 2022. "Credit Scoring with Drift Adaptation Using Local Regions of Competence," SN Operations Research Forum, Springer, vol. 3(4), pages 1-28, December.
    28. Jones, Stewart & Johnstone, David & Wilson, Roy, 2015. "An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes," Journal of Banking & Finance, Elsevier, vol. 56(C), pages 72-85.
    29. Wendun Wang & Xinyu Zhang & Richard Paap, 2019. "To pool or not to pool: What is a good strategy for parameter estimation and forecasting in panel regressions?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(5), pages 724-745, August.
    30. Kerkhof, Jeroen & Melenberg, Bertrand & Schumacher, Hans, 2010. "Model risk and capital reserves," Journal of Banking & Finance, Elsevier, vol. 34(1), pages 267-279, January.
    31. Cosmin L. Ilut & Martin Schneider, 2022. "Modeling Uncertainty as Ambiguity: a Review," NBER Working Papers 29915, National Bureau of Economic Research, Inc.
    32. Rama Cont, 2006. "Model Uncertainty And Its Impact On The Pricing Of Derivative Instruments," Mathematical Finance, Wiley Blackwell, vol. 16(3), pages 519-547, July.
    33. Hainmueller, Jens & Mummolo, Jonathan & Xu, Yiqing, 2019. "How Much Should We Trust Estimates from Multiplicative Interaction Models? Simple Tools to Improve Empirical Practice," Political Analysis, Cambridge University Press, vol. 27(2), pages 163-192, April.
    34. Fonseca, Julia & Van Doornik, Bernardus, 2022. "Financial development and labor market outcomes: Evidence from Brazil," Journal of Financial Economics, Elsevier, vol. 143(1), pages 550-568.
    35. Schneider, Judith C. & Schweizer, Nikolaus, 2015. "Robust measurement of (heavy-tailed) risks: Theory and implementation," Journal of Economic Dynamics and Control, Elsevier, vol. 61(C), pages 183-203.
    36. Ligang Zhou & Kin Lai & Jerome Yen, 2014. "Bankruptcy prediction using SVM models with a new approach to combine features selection and parameter optimisation," International Journal of Systems Science, Taylor & Francis Journals, vol. 45(3), pages 241-253.
    37. Rama Cont, 2006. "Model uncertainty and its impact on the pricing of derivative instruments," Post-Print halshs-00002695, HAL.
    38. Ceylan Onay & Elif Öztürk, 2018. "A review of credit scoring research in the age of Big Data," Journal of Financial Regulation and Compliance, Emerald Group Publishing Limited, vol. 26(3), pages 382-405, July.
    39. Filippo Curti & Ibrahim Ergen & Minh Le & Marco Migueis & Rob T. Stewart, 2016. "Benchmarking Operational Risk Models," Finance and Economics Discussion Series 2016-070, Board of Governors of the Federal Reserve System (U.S.).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Valter T. Yoshida Jr & Alan de Genaro & Rafael Schiozer & Toni R. E. dos Santos, 2023. "A Novel Credit Model Risk Measure: does more data lead to lower model risk in credit scoring models?," Working Papers Series 582, Central Bank of Brazil, Research Department.
    2. Farkas, Walter & Fringuellotti, Fulvia & Tunaru, Radu, 2020. "A cost-benefit analysis of capital requirements adjusted for model risk," Journal of Corporate Finance, Elsevier, vol. 65(C).
    3. Valeriane Jokhadze & Wolfgang M. Schmidt, 2020. "Measuring Model Risk In Financial Risk Management And Pricing," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 23(02), pages 1-37, April.
    4. Mohammed Berkhouch & Fernanda Maria Müller & Ghizlane Lakhnati & Marcelo Brutti Righi, 2022. "Deviation-Based Model Risk Measures," Computational Economics, Springer;Society for Computational Economics, vol. 59(2), pages 527-547, February.
    5. Righi, Marcelo Brutti & Müller, Fernanda Maria & Moresco, Marlon Ruoso, 2020. "On a robust risk measurement approach for capital determination errors minimization," Insurance: Mathematics and Economics, Elsevier, vol. 95(C), pages 199-211.
    6. Lazar, Emese & Qi, Shuyuan, 2022. "Model risk in the over-the-counter market," European Journal of Operational Research, Elsevier, vol. 298(2), pages 769-784.
    7. Marcelo Brutti Righi, 2018. "A theory for combinations of risk measures," Papers 1807.01977, arXiv.org, revised May 2023.
    8. Claußen, Arndt & Rösch, Daniel & Schmelzle, Martin, 2019. "Hedging parameter risk," Journal of Banking & Finance, Elsevier, vol. 100(C), pages 111-121.
    9. Andrés Alonso Robisco & José Manuel Carbó Martínez, 2022. "Measuring the model risk-adjusted performance of machine learning algorithms in credit default prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-35, December.
    10. Radu Tunaru, 2015. "Model Risk in Financial Markets:From Financial Engineering to Risk Management," World Scientific Books, World Scientific Publishing Co. Pte. Ltd., number 9524.
    11. Yu Feng, 2019. "Non-Parametric Robust Model Risk Measurement with Path-Dependent Loss Functions," Papers 1903.00590, arXiv.org.
    12. Emese Lazar & Shuyuan Qi & Radu Tunaru, 2020. "Measures of Model Risk in Continuous-time Finance Models," Papers 2010.08113, arXiv.org, revised Oct 2020.
    13. Mariano González-Sánchez & Eva M. Ibáñez Jiménez & Ana I. Segovia San Juan, 2022. "Market and model risks: a feasible joint estimate methodology," Risk Management, Palgrave Macmillan, vol. 24(3), pages 187-213, September.
    14. Lazar, Emese & Zhang, Ning, 2019. "Model risk of expected shortfall," Journal of Banking & Finance, Elsevier, vol. 105(C), pages 74-93.
    15. Yu Feng, 2019. "Theory and Application of Model Risk Quantification," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 3-2019, January-A.
    16. Danielsson, Jon & James, Kevin R. & Valenzuela, Marcela & Zer, Ilknur, 2016. "Model risk of risk models," Journal of Financial Stability, Elsevier, vol. 23(C), pages 79-91.
    17. Christophe Boucher & Gregory Jannin & Patrick Kouontchou & Bertrand Maillet, 2013. "An Economic Evaluation of Model Risk in Long-term Asset Allocations," Review of International Economics, Wiley Blackwell, vol. 21(3), pages 475-491, August.
    18. Li, Dan & Clements, Adam & Drovandi, Christopher, 2023. "A Bayesian approach for more reliable tail risk forecasts," Journal of Financial Stability, Elsevier, vol. 64(C).
    19. Mark Cummins & Fabian Gogolin & Fearghal Kearney & Greg Kiely & Bernard Murphy, 2023. "Practice-relevant model validation: distributional parameter risk analysis in financial model risk management," Annals of Operations Research, Springer, vol. 330(1), pages 431-455, November.
    20. Simon Fritzsch & Maike Timphus & Gregor Weiss, 2021. "Marginals Versus Copulas: Which Account For More Model Risk In Multivariate Risk Forecasting?," Papers 2109.10946, arXiv.org.

    More about this item

    Keywords

    Model risk; Model selection; Credit risk; Credit scoring; Big data; Machine learning;
    All these keywords.

    JEL classification:

    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:quaeco:v:100:y:2025:i:c:s1062976925000018. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/620167 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.