IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v133y2019icp180-194.html
   My bibliography  Save this article

A new approach for credit scoring by directly maximizing the Kolmogorov–Smirnov statistic

Author

Listed:
  • Fang, Fang
  • Chen, Yuanyuan

Abstract

Credit scoring plays a critical role in many areas such as business, finance, engineering and health. The Kolmogorov–Smirnov statistic is one of the most important performance evaluation criteria for scoring methods and has been widely used in practice. However, none of the existing scoring methods deals with the Kolmogorov–Smirnov statistic directly at the modeling stage. To fill the gap, a new credit scoring method that Directly Maximizes the Kolmogorov-Smirnov statistic (DMKS) is proposed. Theoretically, the consistency of the proposed DMKS estimator is proved. Computationally, an iterative marginal optimization algorithm and a smoothed pool-adjacent-violators algorithm are proposed to overcome the computational difficulties caused by the neither smooth nor continuous objective function. Empirically, results of simulation studies and two real business examples are presented. The proposed method compares favorably with the popular existing scoring methods considering the tradeoff among predictive ability in terms of KS, computational complexity and practical interpretability.

Suggested Citation

  • Fang, Fang & Chen, Yuanyuan, 2019. "A new approach for credit scoring by directly maximizing the Kolmogorov–Smirnov statistic," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 180-194.
  • Handle: RePEc:eee:csdana:v:133:y:2019:i:c:p:180-194
    DOI: 10.1016/j.csda.2018.10.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947318302494
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2018.10.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Hansheng, 2007. "A note on iterative marginal optimization: a simple algorithm for maximum rank correlation estimation," Computational Statistics & Data Analysis, Elsevier, vol. 51(6), pages 2803-2812, March.
    2. Anderson, Raymond, 2007. "The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation," OUP Catalogue, Oxford University Press, number 9780199226405.
    3. Jianguo Xu, 2017. "China's Internet Finance: A Critical Review," China & World Economy, Institute of World Economics and Politics, Chinese Academy of Social Sciences, vol. 25(4), pages 78-92, July.
    4. Yanhao Wei & Pinar Yildirim & Christophe Van den Bulte & Chrysanthos Dellarocas, 2016. "Credit Scoring with Social Network Data," Marketing Science, INFORMS, vol. 35(2), pages 234-258, March.
    5. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    6. D. J. Hand & W. E. Henley, 1997. "Statistical Classification Methods in Consumer Credit Scoring: a Review," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 523-541, September.
    7. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    8. de Leeuw, Jan & Hornik, Kurt & Mair, Patrick, 2009. "Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i05).
    9. D J Hand, 2005. "Good practice in retail credit scorecard assessment," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1109-1117, September.
    10. Alireza Hooman & Govindan Marthandan & Wan Fadzilah Wan Yusoff & Mohana Omid & Sasan Karamizadeh, 2016. "Statistical and data mining methods in credit scoring," Journal of Developing Areas, Tennessee State University, College of Business, vol. 50(5), pages 371-381, Special I.
    11. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hamidreza Arian & Seyed Mohammad Sina Seyfi & Azin Sharifi, 2020. "Forecasting Probability of Default for Consumer Loan Management with Gaussian Mixture Models," Papers 2011.07906, arXiv.org.
    2. Zhang, Xinyu & Liu, Chu-An, 2023. "Model averaging prediction by K-fold cross-validation," Journal of Econometrics, Elsevier, vol. 235(1), pages 280-301.
    3. Siyi Wang & Xing Yan & Bangqi Zheng & Hu Wang & Wangli Xu & Nanbo Peng & Qi Wu, 2021. "Risk and return prediction for pricing portfolios of non-performing consumer credit," Papers 2110.15102, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    2. Finlay, Steven, 2010. "Credit scoring for profitability objectives," European Journal of Operational Research, Elsevier, vol. 202(2), pages 528-537, April.
    3. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    4. Finlay, Steven, 2011. "Multiple classifier architectures and their application to credit risk assessment," European Journal of Operational Research, Elsevier, vol. 210(2), pages 368-378, April.
    5. Trivedi, Shrawan Kumar, 2020. "A study on credit scoring modeling with different feature selection and machine learning approaches," Technology in Society, Elsevier, vol. 63(C).
    6. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    7. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    8. Rais Ahmad Itoo & A. Selvarasu & José António Filipe, 2015. "Loan Products and Credit Scoring by Commercial Banks (India)," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 5(1), pages 851-851.
    9. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    10. Richard Chamboko & Jorge Miguel Bravo, 2020. "A Multi-State Approach to Modelling Intermediate Events and Multiple Mortgage Loan Outcomes," Risks, MDPI, vol. 8(2), pages 1-29, June.
    11. Maria Rocha Sousa & João Gama & Elísio Brandão, 2013. "Introducing time-changing economics into credit scoring," FEP Working Papers 513, Universidade do Porto, Faculdade de Economia do Porto.
    12. Martin Řezáč, 2015. "ESIS2: Information Value Estimator for Credit Scoring Models," Computational Economics, Springer;Society for Computational Economics, vol. 45(2), pages 303-322, February.
    13. Anna Stelzer, 2019. "Predicting credit default probabilities using machine learning techniques in the face of unequal class distributions," Papers 1907.12996, arXiv.org.
    14. Runchi Zhang & Zhiyi Qiu, 2020. "Optimizing hyper-parameters of neural networks with swarm intelligence: A novel framework for credit scoring," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-35, June.
    15. Martin Řezáč, 2011. "Advanced empirical estimate of information value for credit scoring models," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 59(2), pages 267-274.
    16. José Willer Prado & Valderí Castro Alcântara & Francisval Melo Carvalho & Kelly Carvalho Vieira & Luiz Kennedy Cruz Machado & Dany Flávio Tonelli, 2016. "Multivariate analysis of credit risk and bankruptcy research data: a bibliometric study involving different knowledge fields (1968–2014)," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1007-1029, March.
    17. Martin Rezac & Frantisek Rezac, 2011. "How to Measure the Quality of Credit Scoring Models," Czech Journal of Economics and Finance (Finance a uver), Charles University Prague, Faculty of Social Sciences, vol. 61(5), pages 486-507, November.
    18. Rais Ahmad Itoo & A. Selvarasu, 2017. "Loan products and Credit Scoring Methods by Commercial Banks," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 7(1), pages 1297-1297.
    19. Dangxing Chen & Weicheng Ye & Jiahui Ye, 2022. "Interpretable Selective Learning in Credit Risk," Papers 2209.10127, arXiv.org.
    20. Jonathan K. Budd & Peter G. Taylor, 2015. "Calculating optimal limits for transacting credit card customers," Papers 1506.05376, arXiv.org, revised Aug 2015.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:133:y:2019:i:c:p:180-194. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.