Random weights of DNNs and emergence of fixed points

L. Berlyand; O. Krupchytskyi; V. Slavin

arXiv:2501.04182·cs.LG·July 8, 2025

Random weights of DNNs and emergence of fixed points

L. Berlyand, O. Krupchytskyi, V. Slavin

PDF

Open Access

TL;DR

This paper investigates how the distribution of random weights in deep neural networks affects the number and stability of fixed points, revealing different behaviors for light and heavy-tail distributions and their dependence on network architecture.

Contribution

It provides a novel analysis of fixed points in randomly initialized DNNs, highlighting the impact of weight distribution tails and network depth on fixed point properties.

Findings

01

Light tails lead to a single stable fixed point across architectures.

02

Heavy tails result in multiple stable fixed points.

03

Number of fixed points varies non-monotonically with network depth.

Abstract

This paper is concerned with a special class of deep neural networks (DNNs) where the input and the output vectors have the same dimension. Such DNNs are widely used in applications, e.g., autoencoders. The training of such networks can be characterized by their fixed points (FPs). We are concerned with the dependence of the FPs number and their stability on the distribution of randomly initialized DNNs' weight matrices. Specifically, we consider the i.i.d. random weights with heavy and light-tail distributions. Our objectives are twofold. First, the dependence of FPs number and stability of FPs on the type of the distribution tail. Second, the dependence of the number of FPs on the DNNs' architecture. We perform extensive simulations and show that for light tails (e.g., Gaussian), which are typically used for initialization, a single stable FP exists for broad types of architectures.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications