Uniform Convergence, Adversarial Spheres and a Simple Remedy
Gregor Bachmann, Seyed-Mohsen Moosavi-Dezfooli, Thomas Hofmann

TL;DR
This paper investigates the limitations of uniform convergence in explaining neural network generalization, especially against adversarial examples, and proposes a bias adjustment as a simple remedy supported by theoretical and empirical evidence.
Contribution
It provides a theoretical analysis of the adversarial phenomenon in neural networks and NTKs, revealing the role of output bias and proposing a mitigation strategy.
Findings
Uniform convergence bounds are vacuous for adversarial sets.
Adjusting output bias mitigates adversarial misclassification.
Critical sample sizes exist where adversarial effects vanish.
Abstract
Previous work has cast doubt on the general framework of uniform convergence and its ability to explain generalization in neural networks. By considering a specific dataset, it was observed that a neural network completely misclassifies a projection of the training data (adversarial set), rendering any existing generalization bound based on uniform convergence vacuous. We provide an extensive theoretical investigation of the previously studied data setting through the lens of infinitely-wide models. We prove that the Neural Tangent Kernel (NTK) also suffers from the same phenomenon and we uncover its origin. We highlight the important role of the output bias and show theoretically as well as empirically how a sensible choice completely mitigates the problem. We identify sharp phase transitions in the accuracy on the adversarial set and study its dependency on the training sample size.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Neural Networks and Applications
