Relationship between manifold smoothness and adversarial vulnerability   in deep learning with local errors

Zijian Jiang; Jianwen Zhou; and Haiping Huang

arXiv:2007.02047·cs.LG·March 24, 2021

Relationship between manifold smoothness and adversarial vulnerability in deep learning with local errors

Zijian Jiang, Jianwen Zhou, and Haiping Huang

PDF

TL;DR

This paper investigates how the geometric properties of hidden representations in deep neural networks relate to their vulnerability to adversarial attacks, revealing that manifold smoothness impacts generalization and robustness.

Contribution

It establishes a fundamental link between manifold geometry and adversarial vulnerability in deep networks trained with local errors, providing new insights into robustness mechanisms.

Findings

01

High generalization accuracy correlates with fast eigen-spectrum decay.

02

Monotonic relationship between accuracy and eigen-spectrum exponent under Gaussian noise.

03

Non-monotonic behavior observed under FGSM adversarial attacks.

Abstract

Artificial neural networks can achieve impressive performances, and even outperform humans in some specific tasks. Nevertheless, unlike biological brains, the artificial neural networks suffer from tiny perturbations in sensory input, under various kinds of adversarial attacks. It is therefore necessary to study the origin of the adversarial vulnerability. Here, we establish a fundamental relationship between geometry of hidden representations (manifold perspective) and the generalization capability of the deep networks. For this purpose, we choose a deep neural network trained by local errors, and then analyze emergent properties of trained networks through the manifold dimensionality, manifold smoothness, and the generalization capability. To explore effects of adversarial examples, we consider independent Gaussian noise attacks and fast-gradient-sign-method (FGSM) attacks. Our study…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.