A Bayes-Optimal View on Adversarial Examples
Eitan Richardson, Yair Weiss

TL;DR
This paper analyzes adversarial examples through the lens of Bayes-Optimal classifiers, demonstrating that optimal classifiers can be robust while CNNs often remain vulnerable, suggesting adversarial examples are an avoidable flaw.
Contribution
It introduces a framework for understanding adversarial robustness using Bayes-Optimal classifiers and provides conditions under which classifiers are provably robust in high dimensions.
Findings
Bayes-Optimal classifiers can be robust against adversarial attacks.
CNNs trained on the same data are often vulnerable despite optimality.
RBF SVMs trained on the same data are consistently robust.
Abstract
Since the discovery of adversarial examples - the ability to fool modern CNN classifiers with tiny perturbations of the input, there has been much discussion whether they are a "bug" that is specific to current neural architectures and training methods or an inevitable "feature" of high dimensional geometry. In this paper, we argue for examining adversarial examples from the perspective of Bayes-Optimal classification. We construct realistic image datasets for which the Bayes-Optimal classifier can be efficiently computed and derive analytic conditions on the distributions under which these classifiers are provably robust against any adversarial attack even in high dimensions. Our results show that even when these "gold standard" optimal classifiers are robust, CNNs trained on the same datasets consistently learn a vulnerable classifier, indicating that adversarial examples are often an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques
