Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong

Warren He; James Wei; Xinyun Chen; Nicholas Carlini; Dawn; Song

arXiv:1706.04701·cs.LG·June 16, 2017·80 cites

Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong

Warren He, James Wei, Xinyun Chen, Nicholas Carlini, Dawn, Song

PDF

Open Access

TL;DR

This paper evaluates whether combining multiple weak defenses can create a robust adversarial defense, concluding that such ensembles are ineffective against adaptive adversaries.

Contribution

It systematically tests ensemble defenses, demonstrating that combining weak defenses does not prevent successful adversarial attacks.

Findings

01

Ensemble of weak defenses fails against adaptive attacks

02

Two recent defenses are ineffective when combined

03

Combining three independent defenses does not improve robustness

Abstract

Ongoing research has proposed several methods to defend neural networks against adversarial examples, many of which researchers have shown to be ineffective. We ask whether a strong defense can be created by combining multiple (possibly weak) defenses. To answer this question, we study three defenses that follow this approach. Two of these are recently proposed defenses that intentionally combine components designed to work well together. A third defense combines three independent defenses. For all the components of these defenses and the combined defenses themselves, we show that an adaptive adversary can create adversarial examples successfully with low distortion. Thus, our work implies that ensemble of weak defenses is not sufficient to provide strong defense against adversarial examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security