Shallow Self-Learning for Reject Inference in Credit Scoring

My bibliography Save this paper

Shallow Self-Learning for Reject Inference in Credit Scoring

Author

Listed:

Nikita Kozodoi
Panagiotis Katsas
Stefan Lessmann
Luis Moreira-Matias
Konstantinos Papakonstantinou

Registered:

Abstract

Credit scoring models support loan approval decisions in the financial services industry. Lenders train these models on data from previously granted credit applications, where the borrowers' repayment behavior has been observed. This approach creates sample bias. The scoring model (i.e., classifier) is trained on accepted cases only. Applying the resulting model to screen credit applications from the population of all borrowers degrades model performance. Reject inference comprises techniques to overcome sampling bias through assigning labels to rejected cases. The paper makes two contributions. First, we propose a self-learning framework for reject inference. The framework is geared toward real-world credit scoring requirements through considering distinct training regimes for iterative labeling and model training. Second, we introduce a new measure to assess the effectiveness of reject inference strategies. Our measure leverages domain knowledge to avoid artificial labeling of rejected cases during strategy evaluation. We demonstrate this approach to offer a robust and operational assessment of reject inference strategies. Experiments on a real-world credit scoring data set confirm the superiority of the adjusted self-learning framework over regular self-learning and previous reject inference strategies. We also find strong evidence in favor of the proposed evaluation measure assessing reject inference strategies more reliably, raising the performance of the eventual credit scoring model.

Suggested Citation

Nikita Kozodoi & Panagiotis Katsas & Stefan Lessmann & Luis Moreira-Matias & Konstantinos Papakonstantinou, 2019. "Shallow Self-Learning for Reject Inference in Credit Scoring," Papers 1909.06108, arXiv.org.

Handle: RePEc:arx:papers:1909.06108

Download full text from publisher

References listed on IDEAS

J Banasik & J Crook, 2005. "Credit scoring, augmentation and lean models," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1072-1081, September.
J Banasik & J Crook & L Thomas, 2003. "Sample selection bias in credit scoring models," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(8), pages 822-832, August.
J Banasik & J Crook, 2010. "Reject inference in survival analysis by augmentation," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 473-485, March.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Kozodoi, Nikita & Lessmann, Stefan & Alamgir, Morteza & Moreira-Matias, Luis & Papakonstantinou, Konstantinos, 2025. "Fighting sampling bias: A framework for training and evaluating credit scoring models," European Journal of Operational Research, Elsevier, vol. 324(2), pages 616-628.
Mengnan Song & Jiasong Wang & Suisui Su, 2022. "Towards a Better Microcredit Decision," Papers 2209.07574, arXiv.org.
Monir El Annas & Badreddine Benyacoub & Mohamed Ouzineb, 2023. "Semi-supervised adapted HMMs for P2P credit scoring systems with reject inference," Computational Statistics, Springer, vol. 38(1), pages 149-169, March.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
Tang, Haoxin & Liang, Decui, 2025. "Multi-view reject inference for semi-supervised credit scoring with consistency training and three-way decision," Omega, Elsevier, vol. 133(C).
Rogelio A. Mancisidor & Michael Kampffmeyer & Kjersti Aas & Robert Jenssen, 2019. "Deep Generative Models for Reject Inference in Credit Scoring," Papers 1904.11376, arXiv.org, revised Sep 2021.
Ha-Thu Nguyen, 2015. "How is credit scoring used to predict default in China?," EconomiX Working Papers 2015-1, University of Paris Nanterre, EconomiX.
Banasik, John & Crook, Jonathan, 2007. "Reject inference, augmentation, and sample selection," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1582-1594, December.
Kozodoi, Nikita & Lessmann, Stefan & Alamgir, Morteza & Moreira-Matias, Luis & Papakonstantinou, Konstantinos, 2025. "Fighting sampling bias: A framework for training and evaluating credit scoring models," European Journal of Operational Research, Elsevier, vol. 324(2), pages 616-628.
Ha Thu Nguyen, 2015. "How is credit scoring used to predict default in China?," Working Papers hal-04133309, HAL.
Monir El Annas & Badreddine Benyacoub & Mohamed Ouzineb, 2023. "Semi-supervised adapted HMMs for P2P credit scoring systems with reject inference," Computational Statistics, Springer, vol. 38(1), pages 149-169, March.
Calabrese, Raffaella & Osmetti, Silvia Angela & Zanin, Luca, 2024. "Sample selection bias in non-traditional lending: A copula-based approach for imbalanced data," Socio-Economic Planning Sciences, Elsevier, vol. 95(C).
Zhiyong Li & Xinyi Hu & Ke Li & Fanyin Zhou & Feng Shen, 2020. "Inferring the outcomes of rejected loans: an application of semisupervised clustering," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 631-654, February.
J Banasik & J Crook, 2010. "Reject inference in survival analysis by augmentation," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 473-485, March.
Ha-Thu Nguyen, 2016. "Reject inference in application scorecards: evidence from France," EconomiX Working Papers 2016-10, University of Paris Nanterre, EconomiX.
Karol Przanowski, 2014. "Credit acceptance process strategy case studies - the power of Credit Scoring," Papers 1403.6531, arXiv.org.
Dong-Her Shih & Ting-Wei Wu & Po-Yuan Shih & Nai-An Lu & Ming-Hung Shih, 2022. "A Framework of Global Credit-Scoring Modeling Using Outlier Detection and Machine Learning in a P2P Lending Platform," Mathematics, MDPI, vol. 10(13), pages 1-13, June.
Y Kim & S Y Sohn, 2007. "Technology scoring model considering rejected applicants and effect of reject inference," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 58(10), pages 1341-1347, October.
Evžen Kocenda & Martin Vojtek, 2011. "Default Predictors in Retail Credit Scoring: Evidence from Czech Banking Data," Emerging Markets Finance and Trade, Taylor & Francis Journals, vol. 47(6), pages 80-98, November.
- Evžen Kocenda & Martin Vojtek & Evžen Kočenda, 2009. "Default Predictors and Credit Scoring Models for Retail Banking," CESifo Working Paper Series 2862, CESifo.
- Evzen Kocenda & Martin Vojtek, 2011. "Default Predictors in Retail Credit Scoring: Evidence from Czech Banking Data," William Davidson Institute Working Papers Series wp1015, William Davidson Institute at the University of Michigan.
Ha Thu Nguyen, 2016. "Reject inference in application scorecards: evidence from France," Working Papers hal-04141601, HAL.
Silva, Diego M.B. & Pereira, Gustavo H.A. & Magalhães, Tiago M., 2022. "A class of categorization methods for credit scoring models," European Journal of Operational Research, Elsevier, vol. 296(1), pages 323-331.
Chen, Liao & Jia, Ning & Jiao, Zhixian & Zhao, Hongke & Cui, Runbang & Wang, Huimin, 2025. "A semi-supervised reject inference framework with hierarchical heterogeneous networks for credit scoring," International Journal of Forecasting, Elsevier, vol. 41(3), pages 920-939.
Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2019-09-23 (Big Data)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1909.06108. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Shallow Self-Learning for Reject Inference in Credit Scoring

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data