Adversarial Transformation Networks: Learning to Generate Adversarial   Examples

Shumeet Baluja; Ian Fischer

arXiv:1703.09387·cs.NE·March 29, 2017·221 cites

Adversarial Transformation Networks: Learning to Generate Adversarial Examples

Shumeet Baluja, Ian Fischer

PDF

Open Access 2 Repos

TL;DR

This paper introduces Adversarial Transformation Networks (ATNs), a fast and diverse method for generating adversarial examples by training neural networks to produce targeted modifications that fool classifiers.

Contribution

The paper proposes a novel, efficient approach to generate adversarial examples using trained feed-forward networks, enabling rapid and diverse attacks against various classifiers.

Findings

01

ATNs can generate effective adversarial examples quickly.

02

ATNs produce diverse adversarial outputs.

03

ATNs successfully attack MNIST and ImageNet classifiers.

Abstract

Multiple different approaches of generating adversarial examples have been proposed to attack deep neural networks. These approaches involve either directly computing gradients with respect to the image pixels, or directly solving an optimization on the image pixels. In this work, we present a fundamentally new method for generating adversarial examples that is fast to execute and provides exceptional diversity of output. We efficiently train feed-forward neural networks in a self-supervised manner to generate adversarial examples against a target network or set of networks. We call such a network an Adversarial Transformation Network (ATN). ATNs are trained to generate adversarial examples that minimally modify the classifier's outputs given the original input, while constraining the new classification to match an adversarial target class. We present methods to train ATNs and analyze…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Anomaly Detection Techniques and Applications

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection