Characterizing Dynamical Stability of Stochastic Gradient Descent in Overparameterized Learning

Dennis Chemnitz; Maximilian Engel

arXiv:2407.20209·cs.LG·July 18, 2025

Characterizing Dynamical Stability of Stochastic Gradient Descent in Overparameterized Learning

Dennis Chemnitz, Maximilian Engel

PDF

Open Access

TL;DR

This paper analyzes the stability of global minima in overparameterized machine learning models under stochastic gradient descent, introducing a Lyapunov exponent to predict convergence behavior.

Contribution

It introduces a Lyapunov exponent to characterize the stability of minima and rigorously links its sign to SGD's ability to converge to those minima.

Findings

01

Lyapunov exponent determines stability of minima under SGD.

02

Unstable minima cannot be accumulated by SGD.

03

Provides a theoretical framework for understanding convergence in overparameterized models.

Abstract

For overparameterized optimization tasks, such as those found in modern machine learning, global minima are generally not unique. In order to understand generalization in these settings, it is vital to study to which minimum an optimization algorithm converges. The possibility of having minima that are unstable under the dynamics imposed by the optimization algorithm limits the potential minima that the algorithm can find. In this paper, we characterize the global minima that are dynamically stable/unstable for both deterministic and stochastic gradient descent (SGD). In particular, we introduce a characteristic Lyapunov exponent that depends on the local dynamics around a global minimum and rigorously prove that the sign of this Lyapunov exponent determines whether SGD can accumulate at the respective global minimum.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Advanced Bandit Algorithms Research

MethodsStochastic Gradient Descent