IDEAS home Printed from https://ideas.repec.org/a/psc/journl/v12y2020i3p301-316.html
   My bibliography  Save this article

Feature Engineering for Anti-Fraud Models Based on Anomaly Detection

Author

Listed:
  • Damian Przekop

    (Warsaw School of Economics)

Abstract

The paper presents two algorithms as a solution to the problem of identifying fraud intentions of a customer. Their purpose is to generate variables that contribute to fraud models’ predictive power improvement. In this article, a novel approach to the feature engineering, based on anomaly detection, is presented. As the choice of statistical model used in the research improves predictive capabilities of a solution to some extent, most of the attention should be paid to the choice of proper predictors. The main finding of the research is that model enrichment with additional predictors leads to the further improvement of predictive power and better interpretability of anti-fraud model. The paper is a contribution to the fraud prediction problem but the method presented may generate variable input to every tool equipped with variableselection algorithm. The cost is the increased complexity of the models obtained. The approach is illustrated on a dataset from one of the European banks.

Suggested Citation

  • Damian Przekop, 2020. "Feature Engineering for Anti-Fraud Models Based on Anomaly Detection," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 12(3), pages 301-316, September.
  • Handle: RePEc:psc:journl:v:12:y:2020:i:3:p:301-316
    as

    Download full text from publisher

    File URL: http://cejeme.org/publishedarticles/2020-54-10-637353356678060736-9142.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Douglas M. Hawkins, 1980. "Critical Values for Identifying Outliers," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(1), pages 95-96, March.
    2. Hartmann-Wendels, Thomas & Mählmann, Thomas & Versen, Tobias, 2009. "Determinants of banks' risk exposure to new account fraud - Evidence from Germany," Journal of Banking & Finance, Elsevier, vol. 33(2), pages 347-357, February.
    3. Yufei Jin & Roderick Rejesus & Bertis Little, 2005. "Binary choice models for rare events data: a crop insurance fraud application," Applied Economics, Taylor & Francis Journals, vol. 37(7), pages 841-848.
    4. Belinna Bai & Jerome Yen & Xiaoguang Yang, 2008. "False Financial Statements: Characteristics Of China'S Listed Companies And Cart Detecting Approach," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 7(02), pages 339-359.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Blackman, Allen & Guerrero, Santiago, 2012. "What drives voluntary eco-certification in Mexico?," Journal of Comparative Economics, Elsevier, vol. 40(2), pages 256-268.
    2. Ashok Mishra & Barry Goodwin, 2006. "Revenue insurance purchase decisions of farmers," Applied Economics, Taylor & Francis Journals, vol. 38(2), pages 149-159.
    3. Qimeng Pan & Lysa Porth & Hong Li, 2022. "Assessing the Effectiveness of the Actuaries Climate Index for Estimating the Impact of Extreme Weather on Crop Yield and Insurance Applications," Sustainability, MDPI, vol. 14(11), pages 1-24, June.
    4. Schlueter, Tobias & Sievers, Soenke & Hartmann-Wendels, Thomas, 2015. "Bank funding stability, pricing strategies and the guidance of depositors," Journal of Banking & Finance, Elsevier, vol. 51(C), pages 43-61.
    5. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    6. Maria Figueroa-Armijos & John P. Berns, 2022. "Vulnerable Populations and Individual Social Responsibility in Prosocial Crowdfunding: Does the Framing Matter for Female and Rural Entrepreneurs?," Journal of Business Ethics, Springer, vol. 177(2), pages 377-394, May.
    7. Gaucher, Solenne & Klopp, Olga & Robin, Geneviève, 2021. "Outlier detection in networks with missing links," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    8. Andrzej Chmielowiec, 2021. "Algorithm for error-free determination of the variance of all contiguous subsequences and fixed-length contiguous subsequences for a sequence of industrial measurement data," Computational Statistics, Springer, vol. 36(4), pages 2813-2840, December.
    9. Marc Chataigner & Stéphane Crépey & Jiang Pu, 2020. "Nowcasting Networks," Post-Print hal-03910123, HAL.
    10. Greco, Salvatore & Ishizaka, Alessio & Tasiou, Menelaos & Torrisi, Gianpiero, 2019. "Sigma-Mu efficiency analysis: A methodology for evaluating units through composite indicators," European Journal of Operational Research, Elsevier, vol. 278(3), pages 942-960.
    11. David Juárez-Varón & Victoria Tur-Viñes & Alejandro Rabasa-Dolado & Kristina Polotskaya, 2020. "An Adaptive Machine Learning Methodology Applied to Neuromarketing Analysis: Prediction of Consumer Behaviour Regarding the Key Elements of the Packaging Design of an Educational Toy," Social Sciences, MDPI, vol. 9(9), pages 1-23, September.
    12. Stéphane Crépey & Lehdili Noureddine & Nisrine Madhar & Maud Thomas, 2022. "Anomaly Detection on Financial Time Series by Principal Component Analysis and Neural Networks," Working Papers hal-03777995, HAL.
    13. Sharma, Sankalp & Walters, Cory G., 2018. "Influence of Farm and Lease Type on Crop Insurance Returns," 2018 Annual Meeting, August 5-7, Washington, D.C. 273881, Agricultural and Applied Economics Association.
    14. Zhongqiu Wang & Guan Yuan & Haoran Pei & Yanmei Zhang & Xiao Liu, 2020. "Unsupervised learning trajectory anomaly detection algorithm based on deep representation," International Journal of Distributed Sensor Networks, , vol. 16(12), pages 15501477209, December.
    15. M. Ritter & O. Mußhoff & M. Odening, 2014. "Minimizing Geographical Basis Risk of Weather Derivatives Using A Multi-Site Rainfall Model," Computational Economics, Springer;Society for Computational Economics, vol. 44(1), pages 67-86, June.
    16. Pulina, Manuela & Paba, Antonello, 2010. "A discrete choice approach to model credit card fraud," MPRA Paper 20019, University Library of Munich, Germany.
    17. Arata, Linda & Fabrizi, Enrico & Sckokai, Paolo, 2020. "A worldwide analysis of trend in crop yields and yield variability: Evidence from FAO data," Economic Modelling, Elsevier, vol. 90(C), pages 190-208.
    18. Wentao Yang & Huaxi He & Dongsheng Wei & Hao Chen, 2022. "Generating pseudo-absence samples of invasive species based on outlier detection in the geographical characteristic space," Journal of Geographical Systems, Springer, vol. 24(2), pages 261-279, April.
    19. Elias Zavitsanos & Dimitris Mavroeidis & Konstantinos Bougiatiotis & Eirini Spyropoulou & Lefteris Loukas & Georgios Paliouras, 2023. "Financial misstatement detection: a realistic evaluation," Papers 2305.17457, arXiv.org.
    20. Bellucci, Andrea & Borisov, Alexander & Zazzaro, Alberto, 2010. "Does gender matter in bank-firm relationships? Evidence from small business lending," Journal of Banking & Finance, Elsevier, vol. 34(12), pages 2968-2984, December.

    More about this item

    Keywords

    fraud detection; application fraud; feature engineering; anomaly detection; risk modeling;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:psc:journl:v:12:y:2020:i:3:p:301-316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Damian Jelito (email available below). General contact details of provider: http://cejeme.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.