Detection of Adversarial Training Examples in Poisoning Attacks through   Anomaly Detection

Andrea Paudice; Luis Mu\~noz-Gonz\'alez; Andras Gyorgy; Emil C. Lupu

arXiv:1802.03041·stat.ML·February 14, 2018·113 cites

Detection of Adversarial Training Examples in Poisoning Attacks through Anomaly Detection

Andrea Paudice, Luis Mu\~noz-Gonz\'alez, Andras Gyorgy, Emil C. Lupu

PDF

Open Access 1 Repo

TL;DR

This paper introduces an anomaly detection-based defense mechanism to identify and mitigate adversarial poisoning attacks in machine learning by detecting malicious training samples that differ from genuine data.

Contribution

It proposes a novel outlier detection approach to identify adversarial poisoning samples, improving the robustness of machine learning models against such attacks.

Findings

01

Adversarial poisoning samples are distinguishable from genuine data.

02

Pre-filtering training data can effectively detect malicious samples.

03

The method enhances model robustness against optimal poisoning attacks.

Abstract

Machine learning has become an important component for many systems and applications including computer vision, spam filtering, malware and network intrusion detection, among others. Despite the capabilities of machine learning algorithms to extract valuable information from data and produce accurate predictions, it has been shown that these algorithms are vulnerable to attacks. Data poisoning is one of the most relevant security threats against machine learning systems, where attackers can subvert the learning process by injecting malicious samples in the training data. Recent work in adversarial machine learning has shown that the so-called optimal attack strategies can successfully poison linear classifiers, degrading the performance of the system dramatically after compromising a small fraction of the training dataset. In this paper we propose a defence mechanism to mitigate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lmunoz-gonzalez/Poisoning-Attacks-with-Back-gradient-Optimization
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Network Security and Intrusion Detection