Solving adversarial examples requires solving exponential misalignment
Alessandro Salvatore, Stanislav Fort, Surya Ganguli

TL;DR
This paper investigates the geometric and dimensional properties of neural network perceptual manifolds, revealing exponential misalignment with human concepts as a key factor behind adversarial vulnerability.
Contribution
It introduces the concept of perceptual manifolds, analyzes their high dimensionality, and links this to adversarial examples, providing a new geometric perspective on robustness.
Findings
Neural network perceptual manifolds have much higher dimensionality than human concepts.
Exponential misalignment exists between machine and human perceptual manifolds.
Robust networks still exhibit exponential misalignment, limiting adversarial robustness.
Abstract
Adversarial attacks - input perturbations imperceptible to humans that fool neural networks - remain both a persistent failure mode in machine learning, and a phenomenon with mysterious origins. To shed light, we define and analyze a network's perceptual manifold (PM) for a class concept as the space of all inputs confidently assigned to that class by the network. We find, strikingly, that the dimensionalities of neural network PMs are orders of magnitude higher than those of natural human concepts. Since volume typically grows exponentially with dimension, this suggests exponential misalignment between machines and humans, with exponentially many inputs confidently assigned to concepts by machines but not humans. Furthermore, this provides a natural geometric hypothesis for the origin of adversarial examples: because a network's PM fills such a large region of input space, any input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
