Eigenvalue Distribution of Large Random Matrices Arising in Deep Neural   Networks: Orthogonal Case

Leonid Pastur

arXiv:2201.04543·stat.ML·July 13, 2022

Eigenvalue Distribution of Large Random Matrices Arising in Deep Neural Networks: Orthogonal Case

Leonid Pastur

PDF

TL;DR

This paper investigates the singular value distribution of Jacobians in deep neural networks with orthogonal weight matrices, providing a rigorous proof using random matrix theory techniques for the Haar distributed case.

Contribution

It offers a new rigorous proof for the eigenvalue distribution of Jacobians in deep networks with orthogonal weights, extending previous results to the Haar distributed case.

Findings

01

Eigenvalue distribution matches that of simplified diagonal matrix models.

02

Rigorous proof established for Haar distributed orthogonal weights.

03

Supports the universality of spectral properties in deep network Jacobians.

Abstract

The paper deals with the distribution of singular values of the input-output Jacobian of deep untrained neural networks in the limit of their infinite width. The Jacobian is the product of random matrices where the independent rectangular weight matrices alternate with diagonal matrices whose entries depend on the corresponding column of the nearest neighbor weight matrix. The problem was considered in \cite{Pe-Co:18} for the Gaussian weights and biases and also for the weights that are Haar distributed orthogonal matrices and Gaussian biases. Basing on a free probability argument, it was claimed that in these cases the singular value distribution of the Jacobian in the limit of infinite width (matrix size) coincides with that of the analog of the Jacobian with special random but weight independent diagonal matrices, the case well known in random matrix theory. The claim was rigorously…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.