IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v62y2023i4d10.1007_s10614-022-10314-x.html
   My bibliography  Save this article

Performance of Different Machine Learning Algorithms in Detecting Financial Fraud

Author

Listed:
  • Alhanouf Abdulrahman Saleh Alsuwailem

    (Imam Mohammad Ibn Saud Islamic University (IMSIU))

  • Emad Salem

    (Institute of Public Administration)

  • Abdul Khader Jilani Saudagar

    (Imam Mohammad Ibn Saud Islamic University (IMSIU))

Abstract

This research investigates how the problem of money laundering (ML) can be detected in Saudi Arabia with supervised machine learning, specifically at two levels: the establishment-level means that each establishment in the dataset only has one unique record, while the annual level means each establishment has four main records for each year from 2016 to 2019. The main contribution of this study is to show how effective applying machine learning is in detecting ML activities in establishments. It helps to improve the detection process to be in a proactive manner. This research also considers the significance of machine learning techniques in improving the work of the Financial Intelligent Unit, lowering the risks and consequences of financial crime, and fulfilling the Financial Action Task Force’s priorities. The Saudi General Organization for Social Insurance contributed the data used in this study from 2016 to 2019. The data pertains to medium and small establishments, it is classified using supervised machine learning algorithms [Random Forest (RF), Decision Tree (DT), Gradient Boosting (GB), and Nearest Neighbor (KNN)]. Each classifier’s performance was assessed in terms of accuracy, precision, recall, fi-measure, and area under the curve. The main findings show that the RF classifier provided the best result with 93% accuracy for the establishment level by classifying the establishments and assigning classes for them based on risk levels. Then, the DT achieved an accuracy of 90%, GB and KNN are 74% and 87%, respectively. While at the annual level, the DT and RF are both achieved the same accuracy with 98%, then GB with 92% and 97% for KNN. This research was written due to its importance in improving the investigation process in Saudi Arabia and performing a deep analysis for the establishments that play the main role in passing illegal activities including ML under their umbrella.

Suggested Citation

  • Alhanouf Abdulrahman Saleh Alsuwailem & Emad Salem & Abdul Khader Jilani Saudagar, 2023. "Performance of Different Machine Learning Algorithms in Detecting Financial Fraud," Computational Economics, Springer;Society for Computational Economics, vol. 62(4), pages 1631-1667, December.
  • Handle: RePEc:kap:compec:v:62:y:2023:i:4:d:10.1007_s10614-022-10314-x
    DOI: 10.1007/s10614-022-10314-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-022-10314-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-022-10314-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Niklas Bussmann & Paolo Giudici & Dimitri Marinelli & Jochen Papenbrock, 2021. "Explainable Machine Learning in Credit Risk Management," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 203-216, January.
    2. Mohammed Ahmad Naheem, 2019. "Saudi Arabia’s efforts on combating money laundering and terrorist financing," Journal of Money Laundering Control, Emerald Group Publishing Limited, vol. 22(2), pages 233-246, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dangxing Chen, 2023. "Can I Trust the Explanations? Investigating Explainable Machine Learning Methods for Monotonic Models," Papers 2309.13246, arXiv.org.
    2. Stephen T. Onifade & Bright A. Gyamfi & Ilham Haouas & Simplice A. Asongu, 2023. "Extending the Frontiers of Financial Development for Sustainability of the MENA States: The Roles of Resource Abundance and Institutional Quality," Working Papers 23/055, European Xtramile Centre of African Studies (EXCAS).
    3. Bastos, João A. & Matos, Sara M., 2022. "Explainable models of credit losses," European Journal of Operational Research, Elsevier, vol. 301(1), pages 386-394.
    4. Kim Long Tran & Hoang Anh Le & Thanh Hien Nguyen & Duc Trung Nguyen, 2022. "Explainable Machine Learning for Financial Distress Prediction: Evidence from Vietnam," Data, MDPI, vol. 7(11), pages 1-12, November.
    5. Zhang, Tianjiao & Zhu, Weidong & Wu, Yong & Wu, Zihao & Zhang, Chao & Hu, Xue, 2023. "An explainable financial risk early warning model based on the DS-XGBoost model," Finance Research Letters, Elsevier, vol. 56(C).
    6. Chen, Dangxing & Ye, Jiahui & Ye, Weicheng, 2023. "Interpretable selective learning in credit risk," Research in International Business and Finance, Elsevier, vol. 65(C).
    7. Babaei, Golnoosh & Giudici, Paolo & Raffinetti, Emanuela, 2022. "Explainable artificial intelligence for crypto asset allocation," Finance Research Letters, Elsevier, vol. 47(PB).
    8. Alex Gramegna & Paolo Giudici, 2020. "Why to Buy Insurance? An Explainable Artificial Intelligence Approach," Risks, MDPI, vol. 8(4), pages 1-9, December.
    9. Julien Chevallier & Dominique Guégan & Stéphane Goutte, 2021. "Is It Possible to Forecast the Price of Bitcoin?," Forecasting, MDPI, vol. 3(2), pages 1-44, May.
    10. Lisa Crosato & Caterina Liberati & Marco Repetto, 2021. "Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default," Papers 2108.13914, arXiv.org, revised Sep 2021.
    11. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    12. Sunghyon Kyeong & Daehee Kim & Jinho Shin, 2021. "Can System Log Data Enhance the Performance of Credit Scoring?—Evidence from an Internet Bank in Korea," Sustainability, MDPI, vol. 14(1), pages 1-12, December.
    13. Wei Jie Yeo & Wihan van der Heever & Rui Mao & Erik Cambria & Ranjan Satapathy & Gianmarco Mengaldo, 2023. "A Comprehensive Review on Financial Explainable AI," Papers 2309.11960, arXiv.org.
    14. Md Shajalal & Alexander Boden & Gunnar Stevens, 2022. "Explainable product backorder prediction exploiting CNN: Introducing explainable models in businesses," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(4), pages 2107-2122, December.
    15. Marc Wildi & Branka Hadji Misheva, 2022. "A Time Series Approach to Explainability for Neural Nets with Applications to Risk-Management and Fraud Detection," Papers 2212.02906, arXiv.org.
    16. Ahelegbey, Daniel & Giudici, Paolo & Pediroda, Valentino, 2023. "A network based fintech inclusion platform," Socio-Economic Planning Sciences, Elsevier, vol. 87(PB).
    17. David Mhlanga, 2021. "Financial Inclusion in Emerging Economies: The Application of Machine Learning and Artificial Intelligence in Credit Risk Assessment," IJFS, MDPI, vol. 9(3), pages 1-16, July.
    18. Babaei, Golnoosh & Giudici, Paolo & Raffinetti, Emanuela, 2023. "Explainable FinTech lending," Journal of Economics and Business, Elsevier, vol. 125.
    19. Yanhui Shen, 2023. "American Option Pricing using Self-Attention GRU and Shapley Value Interpretation," Papers 2310.12500, arXiv.org.
    20. Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:62:y:2023:i:4:d:10.1007_s10614-022-10314-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.