IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i15p2446-d1712885.html
   My bibliography  Save this article

Statistical Data-Generative Machine Learning-Based Credit Card Fraud Detection Systems

Author

Listed:
  • Xiaomei Feng

    (Faculty of Applied Sciences, Macao Polytechnic University, R. de Luis Gonzaga Gomes, Macao SAR, China)

  • Song-Kyoo Kim

    (Faculty of Applied Sciences, Macao Polytechnic University, R. de Luis Gonzaga Gomes, Macao SAR, China)

Abstract

This study addresses the challenges of data imbalance and missing values in credit card transaction datasets by employing mode-based imputation and various machine learning models. We analyzed two distinct datasets: one consisting of European cardholders and the other from American Express, applying multiple machine learning algorithms, including Artificial Neural Networks, Convolutional Neural Networks, and Gradient Boosted Decision Trees, as well as others. Notably, the Gradient Boosted Decision Tree demonstrated superior predictive performance, with accuracy increasing by 4.53%, reaching 96.92% on the European cardholders dataset. Mode imputation significantly improved data quality, enabling stable and reliable analysis of merged datasets with up to 50% missing values. Hypothesis testing confirmed that the performance of the merged dataset was statistically significant compared to the original datasets. This study highlights the importance of robust data handling techniques in developing effective fraud detection systems, setting the stage for future research on combining different datasets and improving predictive accuracy in the financial sector.

Suggested Citation

  • Xiaomei Feng & Song-Kyoo Kim, 2025. "Statistical Data-Generative Machine Learning-Based Credit Card Fraud Detection Systems," Mathematics, MDPI, vol. 13(15), pages 1-15, July.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2446-:d:1712885
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/15/2446/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/15/2446/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Xiaomei Feng & Song-Kyoo Kim, 2024. "Novel Machine Learning Based Credit Card Fraud Detection Systems," Mathematics, MDPI, vol. 12(12), pages 1-11, June.
    2. E. Nur Ozkan‐Gunay & Mehmed Ozkan, 2007. "Prediction of bank failures in emerging financial markets: an ANN approach," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 8(5), pages 465-480, November.
    3. Sima Siami-Namini & Akbar Siami Namin, 2018. "Forecasting Economics and Financial Time Series: ARIMA vs. LSTM," Papers 1803.06386, arXiv.org.
    4. Esraa Faisal Malik & Khai Wah Khaw & Bahari Belaton & Wai Peng Wong & XinYing Chew, 2022. "Credit Card Fraud Detection Using a New Hybrid Machine Learning Architecture," Mathematics, MDPI, vol. 10(9), pages 1-16, April.
    5. Eling, Martin & Schuhmacher, Frank, 2007. "Does the choice of performance measure influence the evaluation of hedge funds?," Journal of Banking & Finance, Elsevier, vol. 31(9), pages 2632-2647, September.
    6. Tang, Qihe & Tong, Zhiwei & Yang, Yang, 2021. "Large portfolio losses in a turbulent market," European Journal of Operational Research, Elsevier, vol. 292(2), pages 755-769.
    7. Hisham AbouGrad & Lakshmi Sankuru, 2025. "Online Banking Fraud Detection Model: Decentralized Machine Learning Framework to Enhance Effectiveness and Compliance with Data Privacy Regulations," Mathematics, MDPI, vol. 13(13), pages 1-19, June.
    8. E. Nur Ozkan-Gunay & Mehmed Ozkan, 2007. "Prediction of bank failures in emerging financial markets: an ANN approach," Journal of Risk Finance, Emerald Group Publishing, vol. 8(5), pages 465-480, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petre Caraiani & Alina Mihaela Dima & Cristian Păun & Tănase Stamule & Madalina Vanesa Vargas, 2024. "Production networks and resilience: How dense production networks shield economies in financial crisis," PLOS ONE, Public Library of Science, vol. 19(4), pages 1-14, April.
    2. Pirgaip, Burak & Arslan-Ayaydin, Özgür & Karan, Mehmet Baha, 2021. "Do Sukuk provide diversification benefits to conventional bond investors? Evidence from Turkey," Global Finance Journal, Elsevier, vol. 50(C).
    3. José Eduardo Gómez González & Ines Paola Orozco Hinojosa, 2010. "Un modelo de alerta temprana para el sistema financiero colombiano," Revista ESPE - Ensayos sobre Política Económica, Banco de la Republica de Colombia, vol. 28(62), pages 124-147, June.
    4. Martin Eling, 2006. "Performance measurement of hedge funds using data envelopment analysis," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 20(4), pages 442-471, December.
    5. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    6. Amparo Soler Domínguez & Juan Carlos Matallín Sáez & Emili Tortosa Ausina, 2011. "On the informativeness of persistence for mutual funds' performance evaluation using partial frontiers," Working Papers. Serie EC 2011-08, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
    7. Tavakoli Baghdadabad, Mohammad Reza, 2014. "Average drawdown risk reduction and risk tolerances," Research in Economics, Elsevier, vol. 68(3), pages 264-276.
    8. A. Hoffmann, 2012. "Determinants of carry trades in Central and Eastern Europe," Applied Financial Economics, Taylor & Francis Journals, vol. 22(18), pages 1479-1490, September.
    9. Sima Siami‐Namini & Darren Hudson & Adao Alexandre Trindade & Conrad Lyford, 2019. "Commodity price volatility and U.S. monetary policy: Commodity price overshooting revisited," Agribusiness, John Wiley & Sons, Ltd., vol. 35(2), pages 200-218, April.
    10. Schuhmacher, Frank & Eling, Martin, 2012. "A decision-theoretic foundation for reward-to-risk performance measures," Journal of Banking & Finance, Elsevier, vol. 36(7), pages 2077-2082.
    11. Karstanje, Dennis & Sojli, Elvira & Tham, Wing Wah & van der Wel, Michel, 2013. "Economic valuation of liquidity timing," Journal of Banking & Finance, Elsevier, vol. 37(12), pages 5073-5087.
    12. Selim Mankaï & Aymen Belgacem, 2013. "Interactions Between Risk-Taking, Capital, and Reinsurance for Property-Liability Insurance Firms," EconomiX Working Papers 2013-23, University of Paris Nanterre, EconomiX.
    13. Aniruddha Dutta & Saket Kumar & Meheli Basu, 2020. "A Gated Recurrent Unit Approach to Bitcoin Price Prediction," JRFM, MDPI, vol. 13(2), pages 1-16, February.
    14. Lassance, Nathan & Vrins, Frédéric, 2021. "Portfolio selection with parsimonious higher comoments estimation," Journal of Banking & Finance, Elsevier, vol. 126(C).
    15. Balder, Sven & Schweizer, Nikolaus, 2017. "Risk aversion vs. the Omega ratio: Consistency results," Finance Research Letters, Elsevier, vol. 21(C), pages 78-84.
    16. Fischer, Thomas & Lundtofte, Frederik, 2020. "Unequal returns: Using the Atkinson index to measure financial risk," Journal of Banking & Finance, Elsevier, vol. 116(C).
    17. Benjamin Auer, 2013. "The low return distortion of the Sharpe ratio," Financial Markets and Portfolio Management, Springer;Swiss Society for Financial Market Research, vol. 27(3), pages 299-306, September.
    18. Guo, Biao & Xiao, Yugu, 2016. "A note on why doesn't the choice of performance measure matter?," Finance Research Letters, Elsevier, vol. 16(C), pages 248-254.
    19. Eling, Martin & Gatzert, Nadine & Schmeiser, Hato, 2009. "Minimum standards for investment performance: A new perspective on non-life insurer solvency," Insurance: Mathematics and Economics, Elsevier, vol. 45(1), pages 113-122, August.
    20. Yang, Fan & Havranek, Tomas & Irsova, Zuzana & Novak, Jiri, 2022. "Hedge Fund Performance: A Quantitative Survey," EconStor Preprints 260612, ZBW - Leibniz Information Centre for Economics.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2446-:d:1712885. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.