Global law of conjugate kernel random matrices with heavy-tailed weights
Alice Guionnet, Vanessa Piccolo

TL;DR
This paper analyzes the spectral distribution of conjugate kernel matrices in neural networks with heavy-tailed weight distributions, revealing unique spectral behaviors due to heavy tails.
Contribution
It extends the understanding of neural network spectral properties to heavy-tailed weight matrices, including stable laws and sparse matrices, with new theoretical results.
Findings
Heavy-tailed weights cause strong correlations in $Y$ entries.
Spectral distribution differs significantly from light-tailed cases.
Derived moments characterize the limiting eigenvalue distribution.
Abstract
We study the asymptotic spectral distribution of the conjugate kernel random matrix , where arises from a two-layer neural network model. We consider the setting where and are random rectangular matrices with i.i.d.\ entries, where the entries of follow a heavy-tailed distribution, while those of have light tails. Our assumptions on include a broad class of heavy-tailed distributions, such as symmetric -stable laws with and sparse matrices with nonzero entries per row. The activation function , applied entrywise, is bounded, smooth, odd, and nonlinear. We compute the limiting eigenvalue distribution of through its moments and show that heavy-tailed weights induce strong correlations between the entries of , resulting in richer and fundamentally different spectral behavior compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Advanced Algebra and Geometry · advanced mathematical theories
