One-dimensional System Arising in Stochastic Gradient Descent
Konstantinos Karatapanis

TL;DR
This paper analyzes stochastic differential equations modeling one-dimensional stochastic gradient descent, identifying conditions under which the process converges to zero or escapes saddle points, depending on the decay rate parameter.
Contribution
It establishes a threshold for the decay parameter in SDEs related to SGD, determining convergence or escape from saddle points based on the function's behavior.
Findings
Existence of a threshold for determining convergence to zero.
Extension of results to discrete processes with bounded martingale differences.
Ability to escape saddle points for functions with polynomial order second derivatives.
Abstract
We consider SDEs of the form , where behaves comparably to in a neighborhood of the origin, for . We show that there exists a threshold value for , depending on , such that when then , and for the rest of the permissible values . The previous results extend for discrete processes that satisfy . Here, are martingale differences that are a.s. bounded. This result shows that for a function , whose second derivative at degenerate saddle points is of polynomial order, it is always possible to escape saddle points via the iteration for a suitable choice of .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
