MLPs at the EOC: Spectrum of the NTK
D\'avid Terj\'ek, Diego Gonz\'alez-S\'anchez

TL;DR
This paper analyzes the spectral properties of the Neural Tangent Kernel for infinitely wide MLPs with specific activations at the Edge Of Chaos, revealing how the kernel's condition number depends on activation parameters and network depth.
Contribution
It introduces a novel approximation of NTK entries via inverse cosine distances and derives spectral bounds, linking activation parameters to kernel conditioning at the EOC.
Findings
Inverse cosine distance approximates NTK entries better with increasing depth.
Spectral bounds depend on the activation parameter ratio , affecting the NTK condition number.
Absolute value activation (=1) yields faster convergence of the NTK condition number.
Abstract
We study the properties of the Neural Tangent Kernel (NTK) corresponding to infinitely wide -layer Multilayer Perceptrons (MLPs) taking inputs from to outputs in equipped with activation functions for some and initialized at the Edge Of Chaos (EOC). We find that the entries can be approximated by the inverses of the cosine distances of the activations corresponding to and increasingly better as the depth increases. By quantifying these inverse cosine distances and the spectrum of the matrix containing them, we obtain tight spectral bounds for the NTK matrix $\overset{\scriptscriptstyle\infty}{K} = [\frac{1}{n}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsNeural Tangent Kernel · *Communicated@Fast*How Do I Communicate to Expedia?
