Relationship between manifold smoothness and adversarial vulnerability in deep learning with local errors
Zijian Jiang, Jianwen Zhou, and Haiping Huang

TL;DR
This paper investigates how the geometric properties of hidden representations in deep neural networks relate to their vulnerability to adversarial attacks, revealing that manifold smoothness impacts generalization and robustness.
Contribution
It establishes a fundamental link between manifold geometry and adversarial vulnerability in deep networks trained with local errors, providing new insights into robustness mechanisms.
Findings
High generalization accuracy correlates with fast eigen-spectrum decay.
Monotonic relationship between accuracy and eigen-spectrum exponent under Gaussian noise.
Non-monotonic behavior observed under FGSM adversarial attacks.
Abstract
Artificial neural networks can achieve impressive performances, and even outperform humans in some specific tasks. Nevertheless, unlike biological brains, the artificial neural networks suffer from tiny perturbations in sensory input, under various kinds of adversarial attacks. It is therefore necessary to study the origin of the adversarial vulnerability. Here, we establish a fundamental relationship between geometry of hidden representations (manifold perspective) and the generalization capability of the deep networks. For this purpose, we choose a deep neural network trained by local errors, and then analyze emergent properties of trained networks through the manifold dimensionality, manifold smoothness, and the generalization capability. To explore effects of adversarial examples, we consider independent Gaussian noise attacks and fast-gradient-sign-method (FGSM) attacks. Our study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
