NATTACK: Learning the Distributions of Adversarial Examples for an   Improved Black-Box Attack on Deep Neural Networks

Yandong Li; Lijun Li; Liqiang Wang; Tong Zhang; Boqing Gong

arXiv:1905.00441·cs.LG·December 11, 2019·45 cites

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

Yandong Li, Lijun Li, Liqiang Wang, Tong Zhang, Boqing Gong

PDF

Open Access 1 Repo

TL;DR

This paper introduces NATTACK, a black-box adversarial attack method that learns the distribution of adversarial examples around inputs, effectively attacking various neural networks and revealing insights into defense robustness.

Contribution

NATTACK is a novel black-box attack algorithm that models the distribution of adversarial examples, outperforming existing methods and applicable across different neural network architectures.

Findings

01

NATTACK outperforms state-of-the-art attack methods on multiple DNNs.

02

Adversarial training remains effective against NATTACK.

03

Adversarial examples are less transferable across defended DNNs than vanilla ones.

Abstract

Powerful adversarial attack methods are vital for understanding how to construct robust deep neural networks (DNNs) and for thoroughly testing defense techniques. In this paper, we propose a black-box adversarial attack algorithm that can defeat both vanilla DNNs and those generated by various defense techniques developed recently. Instead of searching for an "optimal" adversarial example for a benign input to a targeted DNN, our algorithm finds a probability density distribution over a small region centered around the input, such that a sample drawn from this distribution is likely an adversarial example, without the need of accessing the DNN's internal layers or weights. Our approach is universal as it can successfully attack different neural networks by a single algorithm. It is also strong; according to the testing against 2 vanilla DNNs and 13 defended ones, it outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Cold-Winter/Nattack
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques