High-Dimensional Limit of Stochastic Gradient Flow via Dynamical Mean-Field Theory
Sota Nishiyama, Masaaki Imaizumi

TL;DR
This paper develops a dynamical mean-field theory framework to analyze the high-dimensional asymptotic behavior of stochastic gradient flow, providing a unifying perspective on SGD dynamics in complex models.
Contribution
It introduces a novel DMFT-based analytical approach to characterize the high-dimensional limit of stochastic gradient flow for nonlinear models, extending existing theories.
Findings
Derives low-dimensional equations describing SGF in high dimensions
Recovers existing high-dimensional SGD descriptions as special cases
Provides a unified theoretical framework for analyzing SGD dynamics
Abstract
Modern machine learning models are typically trained via multi-pass stochastic gradient descent (SGD) with small batch sizes, and understanding their dynamics in high dimensions is of great interest. However, an analytical framework for describing the high-dimensional asymptotic behavior of multi-pass SGD with small batch sizes for nonlinear models is currently missing. In this study, we address this gap by analyzing the high-dimensional dynamics of a stochastic differential equation called a \emph{stochastic gradient flow} (SGF), which approximates multi-pass SGD in this regime. In the limit where the number of data samples and the dimension grow proportionally, we derive a closed system of low-dimensional and continuous-time equations and prove that it characterizes the asymptotic distribution of the SGF parameters. Our theory is based on the dynamical mean-field theory (DMFT)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Markov Chains and Monte Carlo Methods
