IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v10y2023i3d10.1007_s40745-021-00362-9.html
   My bibliography  Save this article

A Comprehensive Survey of Anomaly Detection Algorithms

Author

Listed:
  • Durgesh Samariya

    (Federation University
    Charotar University of Science and Technology (CHARUSAT))

  • Amit Thakkar

    (Charotar University of Science and Technology (CHARUSAT))

Abstract

Anomaly or outlier detection is consider as one of the vital application of data mining, which deals with anomalies or outliers. Anomalies are considered as data points that are dramatically different from the rest of the data points. In this survey, we comprehensively present anomaly detection algorithms in an organized manner. We begin this survey with the definition of anomaly, then provide essential elements of anomaly detection, such as different types of anomaly, different application domains, and evaluation measures. Such anomaly detection algorithms are categorized in seven categories based on their working mechanisms, which includes total of 52 algorithms. The categories are anomaly detection algorithms based on statistics, density, distance, clustering, isolation, ensemble and subspace. For each category, we provide the time complexity of each algorithm and their general advantages and disadvantages. In the end, we compared all discussed anomaly detection algorithms in detail.

Suggested Citation

  • Durgesh Samariya & Amit Thakkar, 2023. "A Comprehensive Survey of Anomaly Detection Algorithms," Annals of Data Science, Springer, vol. 10(3), pages 829-850, June.
  • Handle: RePEc:spr:aodasc:v:10:y:2023:i:3:d:10.1007_s40745-021-00362-9
    DOI: 10.1007/s40745-021-00362-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-021-00362-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-021-00362-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. James M. Tien, 2017. "Internet of Things, Real-Time Decision Making, and Artificial Intelligence," Annals of Data Science, Springer, vol. 4(2), pages 149-178, June.
    2. Douglas M. Hawkins, 1980. "Critical Values for Identifying Outliers," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(1), pages 95-96, March.
    3. Mohiuddin Ahmed, 2018. "Collective Anomaly Detection Techniques for Network Traffic Analysis," Annals of Data Science, Springer, vol. 5(4), pages 497-512, December.
    4. Markus Goldstein & Seiichi Uchida, 2016. "A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-31, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Taha Yehia & Ali Wahba & Sondos Mostafa & Omar Mahmoud, 2022. "Suitability of Different Machine Learning Outlier Detection Algorithms to Improve Shale Gas Production Data for Effective Decline Curve Analysis," Energies, MDPI, vol. 15(23), pages 1-25, November.
    2. Aidin Zehtab-Salmasi & Ali-Reza Feizi-Derakhshi & Narjes Nikzad-Khasmakhi & Meysam Asgari-Chenaghlu & Saeideh Nabipour, 2023. "Multimodal Price Prediction," Annals of Data Science, Springer, vol. 10(3), pages 619-635, June.
    3. Vinicius Francisco Rofatto & Marcelo Tomio Matsuoka & Ivandro Klein & Maurício Roberto Veronez & Luiz Gonzaga da Silveira Junior, 2020. "On the effects of hard and soft equality constraints in the iterative outlier elimination procedure," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-29, August.
    4. Yin, Sihua & Yang, Haidong & Xu, Kangkang & Zhu, Chengjiu & Zhang, Shaqing & Liu, Guosheng, 2022. "Dynamic real–time abnormal energy consumption detection and energy efficiency optimization analysis considering uncertainty," Applied Energy, Elsevier, vol. 307(C).
    5. Mohiuddin Ahmed & A. K. M. Najmul Islam, 2020. "Deep Learning: Hope or Hype," Annals of Data Science, Springer, vol. 7(3), pages 427-432, September.
    6. Heba Soltan Mohamed & M. Masoom Ali & Haitham M. Yousof, 2023. "The Lindley Gompertz Model for Estimating the Survival Rates: Properties and Applications in Insurance," Annals of Data Science, Springer, vol. 10(5), pages 1199-1216, October.
    7. Patrick Osatohanmwen & Eferhonore Efe-Eyefia & Francis O. Oyegue & Joseph E. Osemwenkhae & Sunday M. Ogbonmwan & Benson A. Afere, 2022. "The Exponentiated Gumbel–Weibull {Logistic} Distribution with Application to Nigeria’s COVID-19 Infections Data," Annals of Data Science, Springer, vol. 9(5), pages 909-943, October.
    8. Petar Radanliev & David Roure & Rob Walton & Max Kleek & Omar Santos & La’Treall Maddox, 2022. "What Country, University, or Research Institute, Performed the Best on Covid-19 During the First Wave of the Pandemic?," Annals of Data Science, Springer, vol. 9(5), pages 1049-1067, October.
    9. Roberto Moro-Visconti & Salvador Cruz Rambaud & Joaquín López Pascual, 2023. "Artificial intelligence-driven scalability and its impact on the sustainability and valuation of traditional firms," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-14, December.
    10. Adele Ravagnani & Fabrizio Lillo & Paola Deriu & Piero Mazzarisi & Francesca Medda & Antonio Russo, 2024. "Dimensionality reduction techniques to support insider trading detection," Papers 2403.00707, arXiv.org, revised May 2024.
    11. Davide Nicola Continanza & Andrea del Monaco & Marco di Lucido & Daniele Figoli & Pasquale Maddaloni & Filippo Quarta & Giuseppe Turturiello, 2023. "Stacking machine learning models for anomaly detection: comparing AnaCredit to other banking data sets," IFC Bulletins chapters, in: Bank for International Settlements (ed.), Data science in central banking: applications and tools, volume 59, Bank for International Settlements.
    12. Anjan Mukherjee & Abhik Mukherjee, 2022. "Interval-Valued Intuitionistic Fuzzy Soft Rough Approximation Operators and Their Applications in Decision Making Problem," Annals of Data Science, Springer, vol. 9(3), pages 611-625, June.
    13. Mansoureh Beheshti Nejad & Seyed Mahmoud Zanjirchi & Seyed Mojtaba Hosseini Bamakan & Negar Jalilian, 2024. "Blockchain Adoption in Operations Management: A Systematic Literature Review of 14 Years of Research," Annals of Data Science, Springer, vol. 11(4), pages 1361-1389, August.
    14. M. Sridharan, 2023. "Generalized Regression Neural Network Model Based Estimation of Global Solar Energy Using Meteorological Parameters," Annals of Data Science, Springer, vol. 10(4), pages 1107-1125, August.
    15. Sisman, S. & Aydinoglu, A.C., 2022. "Improving performance of mass real estate valuation through application of the dataset optimization and Spatially Constrained Multivariate Clustering Analysis," Land Use Policy, Elsevier, vol. 119(C).
    16. Guangrui Tang & Neng Fan, 2022. "A Survey of Solution Path Algorithms for Regression and Classification Models," Annals of Data Science, Springer, vol. 9(4), pages 749-789, August.
    17. Gasser, Patrick, 2020. "A review on energy security indices to compare country performances," Energy Policy, Elsevier, vol. 139(C).
    18. Amaal Elsayed Mubarak & Ehab Mohamed Almetwally, 2024. "Modelling and Forecasting of Covid-19 Using Periodical ARIMA Models," Annals of Data Science, Springer, vol. 11(4), pages 1483-1502, August.
    19. Xueyan Xu & Fusheng Yu & Runjun Wan, 2023. "A Determining Degree-Based Method for Classification Problems with Interval-Valued Attributes," Annals of Data Science, Springer, vol. 10(2), pages 393-413, April.
    20. Qinghua Zheng & Chutong Yang & Haijun Yang & Jianhe Zhou, 2020. "A Fast Exact Algorithm for Deployment of Sensor Nodes for Internet of Things," Information Systems Frontiers, Springer, vol. 22(4), pages 829-842, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:10:y:2023:i:3:d:10.1007_s40745-021-00362-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.