MLPs at the EOC: Concentration of the NTK

D\'avid Terj\'ek; Diego Gonz\'alez-S\'anchez

arXiv:2501.14724·cs.LG·January 27, 2025

MLPs at the EOC: Concentration of the NTK

D\'avid Terj\'ek, Diego Gonz\'alez-S\'anchez

PDF

Open Access

TL;DR

This paper analyzes the concentration of the Neural Tangent Kernel (NTK) in multilayer perceptrons initialized at the Edge Of Chaos, showing finite-width conditions for the NTK to approximate its infinite-width limit without relying on gradient independence.

Contribution

It proves that the NTK concentrates around its limit at finite width for MLPs with specific activation functions, without assuming linear overparameterization, and identifies quadratic hidden layer width growth as sufficient.

Findings

01

NTK concentrates around its infinite-width limit at finite width.

02

Activation functions with certain parameters improve NTK concentration.

03

Quadratic growth in hidden layer widths ensures accurate NTK approximation.

Abstract

We study the concentration of the Neural Tangent Kernel (NTK) $K_{θ} : R^{m_{0}} \times R^{m_{0}} \to R^{m_{l} \times m_{l}}$ of $l$ -layer Multilayer Perceptrons (MLPs) $N : R^{m_{0}} \times Θ \to R^{m_{l}}$ equipped with activation functions $ϕ (s) = a s + b ∣ s ∣$ for some $a, b \in R$ with the parameter $θ \in Θ$ being initialized at the Edge Of Chaos (EOC). Without relying on the gradient independence assumption that has only been shown to hold asymptotically in the infinitely wide limit, we prove that an approximate version of gradient independence holds at finite width. Showing that the NTK entries $K_{θ} (x_{i_{1}}, x_{i_{2}})$ for $i_{1}, i_{2} \in [1 : n]$ over a dataset ${x_{1}, \dots, x_{n}} \subset R^{m_{0}}$ concentrate simultaneously via maximal inequalities, we prove that the NTK matrix $K(\theta) =…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging and Pathology Studies

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Neural Tangent Kernel