Adversarial Examples in Random Neural Networks with General Activations

Andrea Montanari; Yuchen Wu

arXiv:2203.17209·cs.LG·January 24, 2023

Adversarial Examples in Random Neural Networks with General Activations

Andrea Montanari, Yuchen Wu

PDF

Open Access

TL;DR

This paper proves that adversarial examples are common in random neural networks with general activation functions, extending previous results to networks of any width and activation type using Gaussian conditioning techniques.

Contribution

It generalizes the theoretical understanding of adversarial examples to all widths and activation functions in random neural networks, beyond ReLU and smooth activations.

Findings

01

Adversarial examples exist with high probability along gradient directions.

02

The proof uses Gaussian conditioning to analyze joint distributions.

03

Results apply to networks with arbitrary width and locally Lipschitz activations.

Abstract

A substantial body of empirical work documents the lack of robustness in deep learning models to adversarial examples. Recent theoretical work proved that adversarial examples are ubiquitous in two-layers networks with sub-exponential width and ReLU or smooth activations, and multi-layer ReLU networks with sub-exponential width. We present a result of the same type, with no restriction on width and for general locally Lipschitz continuous activations. More precisely, given a neural network $f (\cdot; θ)$ with random weights $θ$ , and feature vector $x$ , we show that an adversarial example $x^{'}$ can be found with high probability along the direction of the gradient $\nabla_{x} f (x; θ)$ . Our proof is based on a Gaussian conditioning technique. Instead of proving that $f$ is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Nuclear reactor physics and engineering · Stochastic Gradient Optimization Techniques