Understanding the Evolution of the Neural Tangent Kernel at the Edge of Stability
Kaiqi Jiang, Jeremy Cohen, Yuanzhi Li

TL;DR
This paper investigates how the eigenvectors of Neural Tangent Kernels evolve during the Edge of Stability in gradient descent, revealing their alignment with training targets and providing theoretical insights for linear networks.
Contribution
It offers the first detailed analysis of NTK eigenvector behavior during EoS, linking learning rates to eigenvector alignment and providing a theoretical model for linear networks.
Findings
Larger learning rates increase eigenvector alignment with targets.
NTK eigenvectors oscillate around the EoS during training.
Theoretical analysis explains eigenvector dynamics in linear networks.
Abstract
The study of Neural Tangent Kernels (NTKs) in deep learning has drawn increasing attention in recent years. NTKs typically actively change during training and are related to feature learning. In parallel, recent work on Gradient Descent (GD) has found a phenomenon called Edge of Stability (EoS), in which the largest eigenvalue of the NTK oscillates around a value inversely proportional to the step size. However, although follow-up works have explored the underlying mechanism of such eigenvalue behavior in depth, the understanding of the behavior of the NTK eigenvectors during EoS is still missing. This paper examines the dynamics of NTK eigenvectors during EoS in detail. Across different architectures, we observe that larger learning rates cause the leading eigenvectors of the final NTK, as well as the full NTK matrix, to have greater alignment with the training target. We then study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural dynamics and brain function
