Random weights of DNNs and emergence of fixed points
L. Berlyand, O. Krupchytskyi, V. Slavin

TL;DR
This paper investigates how the distribution of random weights in deep neural networks affects the number and stability of fixed points, revealing different behaviors for light and heavy-tail distributions and their dependence on network architecture.
Contribution
It provides a novel analysis of fixed points in randomly initialized DNNs, highlighting the impact of weight distribution tails and network depth on fixed point properties.
Findings
Light tails lead to a single stable fixed point across architectures.
Heavy tails result in multiple stable fixed points.
Number of fixed points varies non-monotonically with network depth.
Abstract
This paper is concerned with a special class of deep neural networks (DNNs) where the input and the output vectors have the same dimension. Such DNNs are widely used in applications, e.g., autoencoders. The training of such networks can be characterized by their fixed points (FPs). We are concerned with the dependence of the FPs number and their stability on the distribution of randomly initialized DNNs' weight matrices. Specifically, we consider the i.i.d. random weights with heavy and light-tail distributions. Our objectives are twofold. First, the dependence of FPs number and stability of FPs on the type of the distribution tail. Second, the dependence of the number of FPs on the DNNs' architecture. We perform extensive simulations and show that for light tails (e.g., Gaussian), which are typically used for initialization, a single stable FP exists for broad types of architectures.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
