Adversarial Examples in Random Neural Networks with General Activations
Andrea Montanari, Yuchen Wu

TL;DR
This paper proves that adversarial examples are common in random neural networks with general activation functions, extending previous results to networks of any width and activation type using Gaussian conditioning techniques.
Contribution
It generalizes the theoretical understanding of adversarial examples to all widths and activation functions in random neural networks, beyond ReLU and smooth activations.
Findings
Adversarial examples exist with high probability along gradient directions.
The proof uses Gaussian conditioning to analyze joint distributions.
Results apply to networks with arbitrary width and locally Lipschitz activations.
Abstract
A substantial body of empirical work documents the lack of robustness in deep learning models to adversarial examples. Recent theoretical work proved that adversarial examples are ubiquitous in two-layers networks with sub-exponential width and ReLU or smooth activations, and multi-layer ReLU networks with sub-exponential width. We present a result of the same type, with no restriction on width and for general locally Lipschitz continuous activations. More precisely, given a neural network with random weights , and feature vector , we show that an adversarial example can be found with high probability along the direction of the gradient . Our proof is based on a Gaussian conditioning technique. Instead of proving that is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Nuclear reactor physics and engineering · Stochastic Gradient Optimization Techniques
