Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size
Micha{\l} Derezi\'nski, Xiaoyu Dong

TL;DR
This paper analyzes the last-iterate convergence rate of SGD with greedy step size in solving smooth quadratic problems, improving upon previous guarantees and introducing a novel analytical framework.
Contribution
It establishes an $O(1/t^{3/4})$ convergence rate for last-iterate SGD with greedy step size, advancing understanding of iterative linear solvers.
Findings
Achieves a faster $O(1/t^{3/4})$ convergence rate compared to previous $O(1/t^{1/2})$ guarantees.
Introduces stochastic contraction processes and a discrete-to-continuous analysis approach.
Addresses a previously open question in the convergence behavior of these algorithms.
Abstract
We study last-iterate convergence of SGD with greedy step size over smooth quadratics in the interpolation regime, a setting which captures the classical Randomized Kaczmarz algorithm as well as other popular iterative linear system solvers. For these methods, we show that the -th iterate attains an convergence rate, addressing a question posed by Attia, Schliserman, Sherman, and Koren, who gave an guarantee for this setting. In the proof, we introduce the family of stochastic contraction processes, whose behavior can be described by the evolution of a certain deterministic eigenvalue equation, which we analyze via a careful discrete-to-continuous reduction.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
