Constructing Unrestricted Adversarial Examples with Generative Models

Yang Song; Rui Shu; Nate Kushman; Stefano Ermon

arXiv:1805.07894·cs.LG·December 4, 2018·125 cites

Constructing Unrestricted Adversarial Examples with Generative Models

Yang Song, Rui Shu, Nate Kushman, Stefano Ermon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new type of adversarial example generated from scratch using generative models, which can bypass existing defenses against traditional perturbation-based attacks.

Contribution

It proposes a novel threat model and method for creating unrestricted adversarial examples with generative models, expanding the scope of adversarial attack research.

Findings

01

Unrestricted adversarial examples can bypass strong defenses.

02

Generated examples are legitimate and belong to the target class.

03

Method works across multiple datasets like MNIST, SVHN, and CelebA.

Abstract

Adversarial examples are typically constructed by perturbing an existing data point within a small matrix norm, and current defense methods are focused on guarding against this type of attack. In this paper, we propose unrestricted adversarial examples, a new threat model where the attackers are not restricted to small norm-bounded perturbations. Different from perturbation-based attacks, we propose to synthesize unrestricted adversarial examples entirely from scratch using conditional generative models. Specifically, we first train an Auxiliary Classifier Generative Adversarial Network (AC-GAN) to model the class-conditional distribution over data samples. Then, conditioned on a desired class, we search over the AC-GAN latent space to find images that are likely under the generative model and are misclassified by a target classifier. We demonstrate through human evaluation that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ermongroup/generative_adversary
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security