IDEAS home Printed from https://ideas.repec.org/a/nbp/nbpbik/v53y2022i6p587-604.html
   My bibliography  Save this article

Symbolic data analysis as a tool for credit fraud detection

Author

Listed:
  • Andrzej Dudek

    (Uniwersytet Ekonomiczny we Wrocławiu)

  • Marcin Pełka

    (Uniwersytet Ekonomiczny we Wrocławiu)

Abstract

It can be said that the money fraud problem is as old as money itself. The development of new technologies allows criminals to develop new ways of fraud and also provides new methods to prevent them. The process of identifying if a newly authorised transaction is a case of fraudulent or genuine transaction is called fraud detection (Maes et al. 2002). Many classical methods can be used to detect money frauds. This paper proposes to apply symbolic data analysis methods, which allow describing objects in a more precise and complex way in order to handle the credit card fraud detection problem. The main hypothesis is that the decision tree for symbolic data is a better tool in credit card fraud detection than other methods. Symbolic data analysis, unlike classical data analysis, allows describing objects in a more complex way. Symbolic data analysis makes it possible to take into account all variability and uncertainty in the data and provides suitable methods and techniques to deal with such data (see: Bock, Diday 2000; Billard, Diday 2006). The first part is the introduction that describes the problem of credit card fraud detection and presents literature that deals with this problem. The second part presents the basic ideas of symbolic data analysis, describes all the models that will be applied in the empirical part (decision tree for symbolic data, logistic regression for symbolic data, k-nearest neighbour method for symbolic data and kernel discriminant analysis for symbolic data). The third part presents the results of credit card fraud detection. The data set containing 284,807 different card transactions (492 being fraud transactions) is used to build all models. The obtained results show that decision trees usually lead to slightly better results than other methods in the symbolic data case (for a single model). The last part presents the final remarks.

Suggested Citation

  • Andrzej Dudek & Marcin Pełka, 2022. "Symbolic data analysis as a tool for credit fraud detection," Bank i Kredyt, Narodowy Bank Polski, vol. 53(6), pages 587-604.
  • Handle: RePEc:nbp:nbpbik:v:53:y:2022:i:6:p:587-604
    as

    Download full text from publisher

    File URL: https://bankikredyt.nbp.pl/content/2022/06/bik_06_2022_02.pdf
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    credit card; fraud detection; symbolic data; machine learning; R software;
    All these keywords.

    JEL classification:

    • G2 - Financial Economics - - Financial Institutions and Services
    • C02 - Mathematical and Quantitative Methods - - General - - - Mathematical Economics
    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbp:nbpbik:v:53:y:2022:i:6:p:587-604. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wojciech Burjanek (email available below). General contact details of provider: https://edirc.repec.org/data/nbpgvpl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.