IDEAS home Printed from https://ideas.repec.org/a/sae/intdis/v18y2022i7p15501329221105159.html
   My bibliography  Save this article

The robustness of popular multiclass machine learning models against poisoning attacks: Lessons and insights

Author

Listed:
  • Majdi Maabreh
  • Arwa Maabreh
  • Basheer Qolomany
  • Ala Al-Fuqaha

Abstract

Despite the encouraging outcomes of machine learning and artificial intelligence applications, the safety of artificial intelligence–based systems is one of the most severe challenges that need further exploration. Data set poisoning is a severe problem that may lead to the corruption of machine learning models. The attacker injects data into the data set that are faulty or mislabeled by flipping the actual labels into the incorrect ones. The word “robustness†refers to a machine learning algorithm’s ability to cope with hostile situations. Here, instead of flipping the labels randomly, we use the clustering approach to choose the training samples for label changes to influence the classifiers’ performance and the distance-based anomaly detection capacity in quarantining the poisoned samples. According to our experiments on a benchmark data set, random label flipping may have a short-term negative impact on the classifier’s accuracy. Yet, an anomaly filter would discover on average 63% of them. On the contrary, the proposed clustering-based flipping might inject dormant poisoned samples until the number of poisoned samples is enough to influence the classifiers’ performance severely; on average, the same anomaly filter would discover 25% of them. We also highlight important lessons and observations during this experiment about the performance and robustness of popular multiclass learners against training data set–poisoning attacks that include: trade-offs, complexity, categories, poisoning resistance, and hyperparameter optimization.

Suggested Citation

  • Majdi Maabreh & Arwa Maabreh & Basheer Qolomany & Ala Al-Fuqaha, 2022. "The robustness of popular multiclass machine learning models against poisoning attacks: Lessons and insights," International Journal of Distributed Sensor Networks, , vol. 18(7), pages 15501329221, July.
  • Handle: RePEc:sae:intdis:v:18:y:2022:i:7:p:15501329221105159
    DOI: 10.1177/15501329221105159
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/15501329221105159
    Download Restriction: no

    File URL: https://libkey.io/10.1177/15501329221105159?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:intdis:v:18:y:2022:i:7:p:15501329221105159. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.