Detection of Adversarial Training Examples in Poisoning Attacks through Anomaly Detection
Andrea Paudice, Luis Mu\~noz-Gonz\'alez, Andras Gyorgy, Emil C. Lupu

TL;DR
This paper introduces an anomaly detection-based defense mechanism to identify and mitigate adversarial poisoning attacks in machine learning by detecting malicious training samples that differ from genuine data.
Contribution
It proposes a novel outlier detection approach to identify adversarial poisoning samples, improving the robustness of machine learning models against such attacks.
Findings
Adversarial poisoning samples are distinguishable from genuine data.
Pre-filtering training data can effectively detect malicious samples.
The method enhances model robustness against optimal poisoning attacks.
Abstract
Machine learning has become an important component for many systems and applications including computer vision, spam filtering, malware and network intrusion detection, among others. Despite the capabilities of machine learning algorithms to extract valuable information from data and produce accurate predictions, it has been shown that these algorithms are vulnerable to attacks. Data poisoning is one of the most relevant security threats against machine learning systems, where attackers can subvert the learning process by injecting malicious samples in the training data. Recent work in adversarial machine learning has shown that the so-called optimal attack strategies can successfully poison linear classifiers, degrading the performance of the system dramatically after compromising a small fraction of the training dataset. In this paper we propose a defence mechanism to mitigate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Network Security and Intrusion Detection
