IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i4p642-d1592301.html
   My bibliography  Save this article

Advanced Tax Fraud Detection: A Soft-Voting Ensemble Based on GAN and Encoder Architecture

Author

Listed:
  • Masad A. Alrasheedi

    (Department of Management Information Systems, College of Business Administration, Taibah University, Al-Madinah Al-Munawara 42353, Saudi Arabia)

  • Samia Ijaz

    (Department of Computer Science, HITEC University, Taxila 47080, Pakistan)

  • Ayed M. Alrashdi

    (Department of Electrical Engineering, College of Engineering, University of Ha’il, Ha’il 81441, Saudi Arabia)

  • Seung-Won Lee

    (Department of Precision Medicine, Sungkyunkwan University School of Medicine, Suwon 16419, Republic of Korea
    Department of Metabiohealth, Sungkyunkwan University, Suwon 16419, Republic of Korea
    Personalized Cancer Immunotherapy Research Center, Sungkyunkwan University School of Medicine, Suwon 16419, Republic of Korea
    Department of Artificial Intelligence, Sungkyunkwan University, Suwon 16419, Republic of Korea)

Abstract

The world prevalence of the two types of authorized and fraudulent transactions makes it difficult to distinguish between the two operations. The small percentage of fraudulent transactions, in turn, gives rise to the class imbalance problem. Hence, an adequately robust fraud detection mechanism must exist for tax systems to avoid their collapse. It has become significantly difficult to obtain any dataset, specifically a tax return dataset, because of the rising importance of privacy in a society where people generally feel squeamish about sharing personal information. Because of this, we arrive at the decision to synthesize our dataset by employing publicly available data, as well as enhance them through Correlational Generative Adversarial Networks (CGANs) and the Synthetic Minority Oversampling Technique (SMOTE). The proposed method includes a preprocessing stage to denoise the data and identify anomalies, outliers, and dimensionality reduction. Then the data have undergone enhancement using the SMOTE and the proposed CGAN techniques. A unique encoder design has been proposed, which serves the purpose of exposing the hidden patterns among legitimate and fraudulent records. This research found anomalous deductions, income inconsistencies, recurrent transaction manipulations, and irregular filing practices that distinguish fraudulent from valid tax records. These patterns are identified by encoder-based feature extraction and synthetic data augmentation. Several machine learning classifiers, along with a voting ensemble technique, have been used both with and without data augmentation. Experimental results have shown that the proposed Soft-Voting technique outperformed the original without an ensemble method.

Suggested Citation

  • Masad A. Alrasheedi & Samia Ijaz & Ayed M. Alrashdi & Seung-Won Lee, 2025. "Advanced Tax Fraud Detection: A Soft-Voting Ensemble Based on GAN and Encoder Architecture," Mathematics, MDPI, vol. 13(4), pages 1-29, February.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:4:p:642-:d:1592301
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/4/642/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/4/642/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Marina Pavlovna Khrestina & Dmitry Ivanovich Dorofeev & Polina Andreevna Kachurina & Timur Rinatovich Usubaliev & Aleksey Sergeevich Dobrotvorskiy, 2017. "Development of Algorithms for Searching, Analyzing and Detecting Fraudulent Activities in the Financial Sphere," European Research Studies Journal, European Research Studies Journal, vol. 0(4B), pages 484-498.
    2. Shuangshuang Chen & Wei Guo, 2023. "Auto-Encoders in Deep Learning—A Review with New Perspectives," Mathematics, MDPI, vol. 11(8), pages 1-54, April.
    3. Muhammad Swaileh A. Alzaidi & Alya Alshammari & Abdulkhaleq Q. A. Hassan & Samia Nawaz Yousafzai & Adel Thaljaoui & Norma Latif Fitriyani & Changgyun Kim & Muhammad Syafrudin, 2024. "An Efficient Fusion Network for Fake News Classification," Mathematics, MDPI, vol. 12(20), pages 1-20, October.
    4. Martin Leo & Suneel Sharma & K. Maddulety, 2019. "Machine Learning in Banking Risk Management: A Literature Review," Risks, MDPI, vol. 7(1), pages 1-22, March.
    5. Uyar, Ali & Nimer, Khalil & Kuzey, Cemil & Shahbaz, Muhammad & Schneider, Friedrich, 2021. "Can e-government initiatives alleviate tax evasion? The moderation effect of ICT," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    6. Pappa, Evi & Sajedi, Rana & Vella, Eugenia, 2015. "Fiscal consolidation with tax evasion and corruption," Journal of International Economics, Elsevier, vol. 96(S1), pages 56-75.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
    2. R.D. Asanka Maithreerathna & P. Chamika Mummullage & Athula Naranpanawa & Chandika Gunasinghe, 2019. "An Empirical Analysis of the Impact of Total Debt on the Economic Growth of Sri Lanka," Discussion Papers in Economics economics:201903, Griffith University, Department of Accounting, Finance and Economics.
    3. Dellas, Harris & Malliaropulos, Dimitris & Papageorgiou, Dimitris & Vourvachaki, Evangelia, 2024. "Fiscal policy with an informal sector," Journal of Economic Dynamics and Control, Elsevier, vol. 160(C).
    4. Guilherme Bandeira & Evi Pappa & Rana Sajedi & Eugenia Vella, 2018. "Fiscal Consolidation in a Low-Inflation Environment: Pay Cuts versus Lost Jobs," International Journal of Central Banking, International Journal of Central Banking, vol. 14(3), pages 7-52, June.
    5. Keita, Kady & Rabaud, Isabelle & Turcu, Camelia, 2023. "Fiscal outcomes, current account imbalances, and institutions in Europe: Exploring nonlinearities," International Economics, Elsevier, vol. 175(C), pages 121-134.
    6. Bacha, Radia & Gasmi, Farid, 2022. "The broadband diffusion process and its determinants in Algeria: A simultaneous estimation," TSE Working Papers 22-1309, Toulouse School of Economics (TSE).
    7. Freitas, Bruno, 2020. "Labour Share Heterogeneity and Fiscal Consolidation Programs," MPRA Paper 98973, University Library of Munich, Germany.
    8. Xu, Chang & Jin, Long, 2024. "Effects of government digitalization on firm investment efficiency: Evidence from China," International Review of Economics & Finance, Elsevier, vol. 92(C), pages 819-834.
    9. Yanos Zylberberg & Francesco Pappada, 2014. "Austerity plans and tax evasion : theory and evidence from Greece," 2014 Meeting Papers 1031, Society for Economic Dynamics.
    10. Thibault Lemaire, 2020. "Fiscal Consolidations and Informality in Latin America and the Caribbean," Post-Print halshs-02492309, HAL.
    11. Emilio Colombo & Davide Furceri & Pietro Pizzuto & Patrizio Tirelli, 2022. "Fiscal Multipliers and Informality," DISEIS - Quaderni del Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo dis2201, Università Cattolica del Sacro Cuore, Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo (DISEIS).
    12. Habib Saragih, Arfah & Ali, Syaiful & Suwardi, Eko & Utomo, Hargo, 2024. "Finding the missing pieces to an optimal corporate tax savings: Information technology governance and internal information quality," International Journal of Accounting Information Systems, Elsevier, vol. 52(C).
    13. Bandeira, Guilherme & Caballé, Jordi & Vella, Eugenia, 2022. "Emigration and fiscal austerity in a depression," Journal of Economic Dynamics and Control, Elsevier, vol. 144(C).
    14. Sakkas, Stelios & Varthalitis, Petros, 2018. "The (intertemporal) equity-efficiency trade-off of fiscal consolidation," MPRA Paper 90983, University Library of Munich, Germany.
    15. Langot, François & Merola, Rossana & Oh, Samil, 2022. "Can taxes help ensure a fair globalization?," International Economics, Elsevier, vol. 171(C), pages 191-213.
    16. Keerthana Sivamayil & Elakkiya Rajasekar & Belqasem Aljafari & Srete Nikolovski & Subramaniyaswamy Vairavasundaram & Indragandhi Vairavasundaram, 2023. "A Systematic Study on Reinforcement Learning Based Applications," Energies, MDPI, vol. 16(3), pages 1-23, February.
    17. Bauer, Kevin & Nofer, Michael & Abdel-Karim, Benjamin M. & Hinz, Oliver, 2022. "The effects of discontinuing machine learning decision support," SAFE Working Paper Series 370, Leibniz Institute for Financial Research SAFE.
    18. Pejman Peykani & Mostafa Sargolzaei & Mohammad Hashem Botshekan & Camelia Oprean-Stan & Amir Takaloo, 2023. "Optimization of Asset and Liability Management of Banks with Minimum Possible Changes," Mathematics, MDPI, vol. 11(12), pages 1-24, June.
    19. Matteo Salto, 2016. "Fiscal Policy after the Crisis – Workshop Proceedings," European Economy - Discussion Papers 035, Directorate General Economic and Financial Affairs (DG ECFIN), European Commission.
    20. Dmytro Kovalenko & Olga Afanasieva & Nani Zabuta & Tetiana Boiko & Rosen Rosenov Baltov, 2021. "Model of Assessing the Overdue Debts in a Commercial Bank Using Neuro-Fuzzy Technologies," JRFM, MDPI, vol. 14(5), pages 1-20, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:4:p:642-:d:1592301. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.