Poisoning Attacks with Generative Adversarial Nets

Luis Mu\~noz-Gonz\'alez; Bjarne Pfitzner; Matteo Russo; Javier; Carnerero-Cano; Emil C. Lupu

arXiv:1906.07773·cs.LG·September 26, 2019·39 cites

Poisoning Attacks with Generative Adversarial Nets

Luis Mu\~noz-Gonz\'alez, Bjarne Pfitzner, Matteo Russo, Javier, Carnerero-Cano, Emil C. Lupu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel generative adversarial network-based method to craft effective poisoning attacks on machine learning classifiers, including deep networks, by generating adversarial training samples that degrade model performance.

Contribution

The paper presents a new GAN-based approach for systematic poisoning attacks that models detectability constraints and identifies vulnerable data regions, improving attack realism and effectiveness.

Findings

01

Effective attack on deep networks demonstrated

02

Identifies vulnerable data regions for poisoning

03

Models detectability constraints realistically

Abstract

Machine learning algorithms are vulnerable to poisoning attacks: An adversary can inject malicious points in the training dataset to influence the learning process and degrade the algorithm's performance. Optimal poisoning attacks have already been proposed to evaluate worst-case scenarios, modelling attacks as a bi-level optimization problem. Solving these problems is computationally demanding and has limited applicability for some models such as deep networks. In this paper we introduce a novel generative model to craft systematic poisoning attacks against machine learning classifiers generating adversarial training examples, i.e. samples that look like genuine data points but that degrade the classifier's accuracy when used for training. We propose a Generative Adversarial Net with three components: generator, discriminator, and the target classifier. This approach allows us to model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lmunoz-gonzalez/Poisoning-Attacks-with-Back-gradient-Optimization
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis