Delving into Transferable Adversarial Examples and Black-box Attacks

Yanpei Liu; Xinyun Chen; Chang Liu; Dawn Song

arXiv:1611.02770·cs.LG·February 8, 2017·820 cites

Delving into Transferable Adversarial Examples and Black-box Attacks

Yanpei Liu, Xinyun Chen, Chang Liu, Dawn Song

PDF

Open Access 2 Repos

TL;DR

This paper conducts a large-scale study on the transferability of adversarial examples in deep neural networks, introduces ensemble-based methods to generate transferable targeted adversarial examples, and demonstrates their effectiveness against black-box systems.

Contribution

It is the first to analyze transferability on large models and datasets, and to generate targeted adversarial examples that transfer successfully using novel ensemble-based approaches.

Findings

01

Transferable non-targeted adversarial examples are easy to generate.

02

Targeted adversarial examples rarely transfer with their target labels using existing methods.

03

Ensemble-based approaches significantly improve transferability of targeted adversarial examples.

Abstract

An intriguing property of deep neural networks is the existence of adversarial examples, which can transfer among different architectures. These transferable adversarial examples may severely hinder deep neural network-based applications. Previous works mostly study the transferability using small scale datasets. In this work, we are the first to conduct an extensive study of the transferability over large models and a large scale dataset, and we are also the first to study the transferability of targeted adversarial examples with their target labels. We study both non-targeted and targeted adversarial examples, and show that while transferable non-targeted adversarial examples are easy to find, targeted adversarial examples generated using existing approaches almost never transfer with their target labels. Therefore, we propose novel ensemble-based approaches to generating transferable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Physical Unclonable Functions (PUFs) and Hardware Security