Evolution of Neural Tangent Kernels under Benign and Adversarial Training
Noel Loo, Ramin Hasani, Alexander Amini, Daniela Rus

TL;DR
This paper empirically studies how neural tangent kernels evolve during training, revealing that adversarial training leads to a different kernel that enhances robustness even when combined with non-robust training.
Contribution
It provides the first empirical analysis of NTK evolution under adversarial training, showing how it differs from standard training and improves robustness.
Findings
Adversarial training causes the NTK to rapidly converge to a different kernel.
The new kernel obtained through adversarial training offers robustness against PGD attacks.
Adversarial training on a fixed kernel achieves 76.1% robust accuracy on CIFAR-10.
Abstract
Two key challenges facing modern deep learning are mitigating deep networks' vulnerability to adversarial attacks and understanding deep learning's generalization capabilities. Towards the first issue, many defense strategies have been developed, with the most common being Adversarial Training (AT). Towards the second challenge, one of the dominant theories that has emerged is the Neural Tangent Kernel (NTK) -- a characterization of neural network behavior in the infinite-width limit. In this limit, the kernel is frozen, and the underlying feature map is fixed. In finite widths, however, there is evidence that feature learning happens at the earlier stages of the training (kernel learning) before a second phase where the kernel remains fixed (lazy training). While prior work has aimed at studying adversarial vulnerability through the lens of the frozen infinite-width NTK, there is no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsNeural Tangent Kernel
