On Random Matrices Arising in Deep Neural Networks. Gaussian Case
Leonid Pastur

TL;DR
This paper analyzes the distribution of singular values of products of Gaussian random matrices in deep neural networks, providing theoretical justification for free probability approaches and extending results to broader data distributions.
Contribution
It offers a rigorous justification for using free probability in analyzing neural network matrices with Gaussian data and extends universality results to non-Gaussian data.
Findings
Distribution of singular values characterized for Gaussian data matrices.
Justification of free probability methods in this context.
Extension of universality to non-Gaussian data with finite moments.
Abstract
The paper deals with distribution of singular values of product of random matrices arising in the analysis of deep neural networks. The matrices resemble the product analogs of the sample covariance matrices, however, an important difference is that the population covariance matrices, which are assumed to be non-random in the standard setting of statistics and random matrix theory, are now random, moreover, are certain functions of random data matrices. The problem has been considered in recent work [21] by using the techniques of free probability theory. Since, however, free probability theory deals with population matrices which are independent of the data matrices, its applicability in this case requires an additional justification. We present this justification by using a version of the standard techniques of random matrix theory under the assumption that the entries of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Random Matrices and Applications · Neural Networks and Applications
