Are adversarial examples inevitable?

Ali Shafahi; W. Ronny Huang; Christoph Studer; Soheil Feizi; Tom; Goldstein

arXiv:1809.02104·cs.LG·February 5, 2020

Are adversarial examples inevitable?

Ali Shafahi, W. Ronny Huang, Christoph Studer, Soheil Feizi, Tom, Goldstein

PDF

TL;DR

This paper investigates whether adversarial examples are unavoidable in neural networks by providing theoretical bounds on their susceptibility and exploring how problem complexity affects robustness.

Contribution

It offers a theoretical analysis demonstrating the inevitability of adversarial examples for certain problem classes and examines practical implications for real-world classifiers.

Findings

01

Adversarial examples are inescapable for some problem classes.

02

Dimensionality and image complexity limit classifier robustness.

03

Theoretical bounds inform the fundamental limits of adversarial defenses.

Abstract

A wide range of defenses have been proposed to harden neural networks against adversarial attacks. However, a pattern has emerged in which the majority of adversarial defenses are quickly broken by new attacks. Given the lack of success at generating robust defenses, we are led to ask a fundamental question: Are adversarial attacks inevitable? This paper analyzes adversarial examples from a theoretical perspective, and identifies fundamental bounds on the susceptibility of a classifier to adversarial attacks. We show that, for certain classes of problems, adversarial examples are inescapable. Using experiments, we explore the implications of theoretical guarantees for real-world problems and discuss how factors such as dimensionality and image complexity limit a classifier's robustness against adversarial examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.