Entropic Regularization in the Deep Linear Network

Alan Chen; Tejas Kotwal; Govind Menon

arXiv:2512.06137·cs.NE·December 9, 2025

Entropic Regularization in the Deep Linear Network

Alan Chen, Tejas Kotwal, Govind Menon

PDF

Open Access

TL;DR

This paper analyzes the effects of entropic regularization on deep linear networks, characterizing equilibria, gradient flow dynamics, and the concavity of entropy in different geometric settings.

Contribution

It provides a detailed theoretical analysis of entropic regularization in DLNs, including explicit solutions for gradient flow and insights into the geometry of the entropy landscape.

Findings

01

Equilibria are minimizers forming an orthogonal orbit.

02

Gradient flow reduces to a one-dimensional ODE with explicit relaxation rates.

03

Entropy is strictly concave in Euclidean geometry but not in Riemannian geometry.

Abstract

We study regularization for the deep linear network (DLN) using the entropy formula introduced in arXiv:2509.09088. The equilibria and gradient flow of the free energy on the Riemannian manifold of end-to-end maps of the DLN are characterized for energies that depend symmetrically on the singular values of the end-to-end matrix. The only equilibria are minimizers and the set of minimizers is an orbit of the orthogonal group. In contrast with random matrix theory there is no singular value repulsion. The corresponding gradient flow reduces to a one-dimensional ordinary differential equation whose solution gives explicit relaxation rates toward the minimizers. We also study the concavity of the entropy in the chamber of singular values. The entropy is shown to be strictly concave in the Euclidean geometry on the chamber but not in the Riemannian geometry defined by the DLN metric.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Quantum many-body systems · Statistical Mechanics and Entropy