A Fine-Grained Spectral Perspective on Neural Networks
Greg Yang, Hadi Salman

TL;DR
This paper uses spectral analysis of Conjugate Kernel and Neural Tangent Kernel to understand neural network properties like bias, depth effects, and training strategies, supported by theoretical insights and extensive experiments.
Contribution
It introduces a spectral perspective to analyze neural networks, providing new insights into their training dynamics and generalization, with efficient algorithms for spectral computation.
Findings
Spectra of CK and NTK reveal network bias toward simple functions.
Eigenvalues inform about the impact of depth and training strategies.
Spectral properties are consistent across high-dimensional data distributions.
Abstract
Are neural networks biased toward simple functions? Does depth always help learn more complex features? Is training the last layer of a network as good as training all layers? How to set the range for learning rate tuning? These questions seem unrelated at face value, but in this work we give all of them a common treatment from the spectral perspective. We will study the spectra of the *Conjugate Kernel, CK,* (also called the *Neural Network-Gaussian Process Kernel*), and the *Neural Tangent Kernel, NTK*. Roughly, the CK and the NTK tell us respectively "what a network looks like at initialization" and "what a network looks like during and after training." Their spectra then encode valuable information about the initial distribution and the training and generalization properties of neural networks. By analyzing the eigenvalues, we lend novel insights into the questions put forth at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Advanced Neural Network Applications
MethodsNeural Tangent Kernel
