Largest Eigenvalues of the Conjugate Kernel of Single-Layered Neural   Networks

Lucas Benigni; Sandrine P\'ech\'e

arXiv:2201.04753·math.PR·January 14, 2022·1 cites

Largest Eigenvalues of the Conjugate Kernel of Single-Layered Neural Networks

Lucas Benigni, Sandrine P\'ech\'e

PDF

Open Access

TL;DR

This paper analyzes the asymptotic behavior of the largest eigenvalues of the conjugate kernel in single-layer neural networks, revealing phase transitions and connections to well-known random matrix models.

Contribution

It establishes the limiting distribution of the largest eigenvalue for nonlinear random matrices from neural networks, linking it to classical models and identifying phase transitions.

Findings

01

Largest eigenvalue converges to a known limit in probability

02

Identifies phase transition depending on activation function and data distribution

03

Connects neural network conjugate kernel eigenvalues to information-plus-noise models

Abstract

This paper is concerned with the asymptotic distribution of the largest eigenvalues for some nonlinear random matrix ensemble stemming from the study of neural networks. More precisely we consider $M = \frac{1}{m} Y Y^{⊤}$ with $Y = f (W X)$ where $W$ and $X$ are random rectangular matrices with i.i.d. centered entries. This models the data covariance matrix or the Conjugate Kernel of a single layered random Feed-Forward Neural Network. The function $f$ is applied entrywise and can be seen as the activation function of the neural network. We show that the largest eigenvalue has the same limit (in probability) as that of some well-known linear random matrix ensembles. In particular, we relate the asymptotic limit of the largest eigenvalue for the nonlinear model to that of an information-plus-noise random matrix, establishing a possible phase transition depending on the function $f$ and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Random Matrices and Applications · Statistical Mechanics and Entropy