IDEAS home Printed from https://ideas.repec.org/a/csb/stintr/v17y2016i2p265-280.html
   My bibliography  Save this article

Heteroscedastic Discriminant Analysis Combined With Feature Selection For Credit Scoring

Author

Listed:
  • Tomasz Smolarczyk
  • Katarzyna Stąpor
  • Piotr Fabian

Abstract

Credit granting is a fundamental question and one of the most complex tasks that every credit institution is faced with. Typically, credit scoring databases are often large and characterized by redundant and irrelevant features. An effective classification model will objectively help managers instead of intuitive experience. This study proposes an approach for building a credit scoring model based on the combination of heteroscedastic extension (Loog, Duin, 2002) of classical Fisher Linear Discriminant Analysis (Fisher, 1936, Krzyśko, 1990) and a feature selection algorithm that retains sufficient information for classification purpose. We have tested five feature subset selection algorithms: two filters and three wrappers. To evaluate the accuracy of the proposed credit scoring model and to compare it with the existing approaches we have used the German credit data set from the study (Chen, Li, 2010). The results of our study suggest that the proposed hybrid approach is an effective and promising method for building credit scoring models.

Suggested Citation

  • Tomasz Smolarczyk & Katarzyna Stąpor & Piotr Fabian, 2016. "Heteroscedastic Discriminant Analysis Combined With Feature Selection For Credit Scoring," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 17(2), pages 265-280, June.
  • Handle: RePEc:csb:stintr:v:17:y:2016:i:2:p:265-280
    as

    Download full text from publisher

    File URL: http://index.stat.gov.pl/repec/files/csb/stintr/csb_stintr_v17_2016_i2_n8.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Thomas, Lyn C., 2000. "A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers," International Journal of Forecasting, Elsevier, vol. 16(2), pages 149-172.
    2. Pacheco, Joaquin & Casado, Silvia & Nunez, Laura & Gomez, Olga, 2006. "Analysis of new variable selection methods for discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 1463-1478, December.
    3. Crook, Jonathan N. & Edelman, David B. & Thomas, Lyn C., 2007. "Recent developments in consumer credit risk assessment," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1447-1465, December.
    4. L C Thomas & R W Oliver & D J Hand, 2005. "A survey of the issues in consumer credit modelling research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 56(9), pages 1006-1015, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. A. Iduseri & J. E. Osemwenkhae, 2018. "A New Approach for Improving Classification Accuracy in Predictive Discriminant Analysis," Annals of Data Science, Springer, vol. 5(3), pages 339-357, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stąpor Katarzyna & Smolarczyk Tomasz & Fabian Piotr, 2016. "Heteroscedastic Discriminant Analysis Combined with Feature Selection for Credit Scoring," Statistics in Transition New Series, Polish Statistical Association, vol. 17(2), pages 265-280, June.
    2. Katarzyna Stąpor & Tomasz Smolarczyk & Piotr Fabian, 2016. "Heteroscedastic Discriminant Analysis Combined With Feature Selection For Credit Scoring," Statistics in Transition New Series, Polish Statistical Association, vol. 17(2), pages 265-280, June.
    3. Crone, Sven F. & Finlay, Steven, 2012. "Instance sampling in credit scoring: An empirical study of sample size and balancing," International Journal of Forecasting, Elsevier, vol. 28(1), pages 224-238.
    4. L C Thomas, 2010. "Consumer finance: challenges for operational research," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(1), pages 41-52, January.
    5. Naveed Chehrazi & Peter W. Glynn & Thomas A. Weber, 2019. "Dynamic Credit-Collections Optimization," Management Science, INFORMS, vol. 67(6), pages 2737-2769, June.
    6. Thomas Wainwright, 2011. "Elite Knowledges: Framing Risk and the Geographies of Credit," Environment and Planning A, , vol. 43(3), pages 650-665, March.
    7. K Rajaratnam & P Beling & G Overstreet, 2010. "Scoring decisions in the context of economic uncertainty," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 421-429, March.
    8. Huseyin Ince & Bora Aktan, 2009. "A comparison of data mining techniques for credit scoring in banking: A managerial perspective," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 10(3), pages 233-240, March.
    9. Ting Sun & Miklos A. Vasarhelyi, 2018. "Predicting credit card delinquencies: An application of deep neural networks," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 25(4), pages 174-189, October.
    10. repec:syb:wpbsba:03/2013 is not listed on IDEAS
    11. Hussein A. Abdou & John Pointon, 2011. "Credit Scoring, Statistical Techniques And Evaluation Criteria: A Review Of The Literature," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 18(2-3), pages 59-88, April.
    12. Pulina, Manuela & Paba, Antonello, 2010. "A discrete choice approach to model credit card fraud," MPRA Paper 20019, University Library of Munich, Germany.
    13. Lobna Abid & Afif Masmoudi & Sonia Zouari-Ghorbel, 2018. "The Consumer Loan’s Payment Default Predictive Model: an Application of the Logistic Regression and the Discriminant Analysis in a Tunisian Commercial Bank," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 9(3), pages 948-962, September.
    14. Huei-Wen Teng & Michael Lee, 2019. "Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default," Review of Pacific Basin Financial Markets and Policies (RPBFMP), World Scientific Publishing Co. Pte. Ltd., vol. 22(03), pages 1-27, September.
    15. Zhixin Liu & Ping He & Bo Chen, 2019. "A Markov decision model for consumer term-loan collections," Review of Quantitative Finance and Accounting, Springer, vol. 52(4), pages 1043-1064, May.
    16. Lu Gao & Kanshukan Rajaratnam & Peter Beling, 2016. "Loan origination decisions using a multinomial scorecard," Annals of Operations Research, Springer, vol. 243(1), pages 199-210, August.
    17. Finlay, Steven, 2010. "Credit scoring for profitability objectives," European Journal of Operational Research, Elsevier, vol. 202(2), pages 528-537, April.
    18. Akkoç, Soner, 2012. "An empirical comparison of conventional techniques, neural networks and the three stage hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) model for credit scoring analysis: The case of Turkish cred," European Journal of Operational Research, Elsevier, vol. 222(1), pages 168-178.
    19. Naveed Chehrazi & Thomas A. Weber, 2015. "Dynamic Valuation of Delinquent Credit-Card Accounts," Management Science, INFORMS, vol. 61(12), pages 3077-3096, December.
    20. Tomáš Vaněk & David Hampel, 2017. "The Probability of Default Under IFRS 9: Multi-period Estimation and Macroeconomic Forecast," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 65(2), pages 759-776.
    21. Vivek Kumar Singh & Burcin Bozkaya & Alex Pentland, 2015. "Money Walks: Implicit Mobility Behavior and Financial Well-Being," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-17, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:csb:stintr:v:17:y:2016:i:2:p:265-280. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Beata Witek (email available below). General contact details of provider: https://edirc.repec.org/data/gusgvpl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.