From Stability to Chaos: Analyzing Gradient Descent Dynamics in Quadratic Regression
Xuxing Chen, Krishnakumar Balasubramanian, Promit Ghosal, Bhavya, Agrawalla

TL;DR
This paper analyzes the complex dynamics of gradient descent in quadratic regression, revealing five distinct training phases including chaos, and explores their implications for generalization and stability.
Contribution
It introduces a cubic map model for gradient descent dynamics and characterizes five training phases through bifurcation analysis, extending insights to neural networks and data types.
Findings
Identification of five training phases: monotonic, catapult, periodic, chaotic, divergent.
Demonstration of phase boundaries via bifurcation analysis.
Empirical evidence that phase behavior persists with non-orthogonal data and affects generalization.
Abstract
We conduct a comprehensive investigation into the dynamics of gradient descent using large-order constant step-sizes in the context of quadratic regression models. Within this framework, we reveal that the dynamics can be encapsulated by a specific cubic map, naturally parameterized by the step-size. Through a fine-grained bifurcation analysis concerning the step-size parameter, we delineate five distinct training phases: (1) monotonic, (2) catapult, (3) periodic, (4) chaotic, and (5) divergent, precisely demarcating the boundaries of each phase. As illustrations, we provide examples involving phase retrieval and two-layer neural networks employing quadratic activation functions and constant outer-layers, utilizing orthogonal training data. Our simulations indicate that these five phases also manifest with generic non-orthogonal data. We also empirically investigate the generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced X-ray Imaging Techniques · Machine Learning in Materials Science · Model Reduction and Neural Networks
