Solving adversarial examples requires solving exponential misalignment

Alessandro Salvatore; Stanislav Fort; Surya Ganguli

arXiv:2603.03507·cs.LG·March 12, 2026

Solving adversarial examples requires solving exponential misalignment

Alessandro Salvatore, Stanislav Fort, Surya Ganguli

PDF

Open Access

TL;DR

This paper investigates the geometric and dimensional properties of neural network perceptual manifolds, revealing exponential misalignment with human concepts as a key factor behind adversarial vulnerability.

Contribution

It introduces the concept of perceptual manifolds, analyzes their high dimensionality, and links this to adversarial examples, providing a new geometric perspective on robustness.

Findings

01

Neural network perceptual manifolds have much higher dimensionality than human concepts.

02

Exponential misalignment exists between machine and human perceptual manifolds.

03

Robust networks still exhibit exponential misalignment, limiting adversarial robustness.

Abstract

Adversarial attacks - input perturbations imperceptible to humans that fool neural networks - remain both a persistent failure mode in machine learning, and a phenomenon with mysterious origins. To shed light, we define and analyze a network's perceptual manifold (PM) for a class concept as the space of all inputs confidently assigned to that class by the network. We find, strikingly, that the dimensionalities of neural network PMs are orders of magnitude higher than those of natural human concepts. Since volume typically grows exponentially with dimension, this suggests exponential misalignment between machines and humans, with exponentially many inputs confidently assigned to concepts by machines but not humans. Furthermore, this provides a natural geometric hypothesis for the origin of adversarial examples: because a network's PM fills such a large region of input space, any input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI