On Random Matrices Arising in Deep Neural Networks: General I.I.D. Case
L. Pastur, V. Slavin

TL;DR
This paper analyzes the singular value distribution of products of random matrices in deep neural networks, extending previous results to more general i.i.d. entries and validating the universality of these properties.
Contribution
It introduces a streamlined random matrix theory approach to generalize singular value distribution results to i.i.d. matrices with finite fourth moments in neural network models.
Findings
Extended universality results to i.i.d. matrices with finite fourth moments.
Validated the applicability of free probability techniques in neural network analysis.
Provided a more general framework for understanding singular values in deep learning models.
Abstract
We study the distribution of singular values of product of random matrices pertinent to the analysis of deep neural networks. The matrices resemble the product of the sample covariance matrices, however, an important difference is that the population covariance matrices assumed to be non-random or random but independent of the random data matrix in statistics and random matrix theory are now certain functions of random data matrices (synaptic weight matrices in the deep neural network terminology). The problem has been treated in recent work [25, 13] by using the techniques of free probability theory. Since, however, free probability theory deals with population covariance matrices which are independent of the data matrices, its applicability has to be justified. The justification has been given in [22] for Gaussian data matrices with independent entries, a standard analytical model of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Random Matrices and Applications
