Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

Micha{\l} Derezi\'nski; Xiaoyu Dong

arXiv:2604.09909·cs.LG·April 14, 2026

Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

Micha{\l} Derezi\'nski, Xiaoyu Dong

PDF

TL;DR

This paper analyzes the last-iterate convergence rate of SGD with greedy step size in solving smooth quadratic problems, improving upon previous guarantees and introducing a novel analytical framework.

Contribution

It establishes an $O(1/t^{3/4})$ convergence rate for last-iterate SGD with greedy step size, advancing understanding of iterative linear solvers.

Findings

01

Achieves a faster $O(1/t^{3/4})$ convergence rate compared to previous $O(1/t^{1/2})$ guarantees.

02

Introduces stochastic contraction processes and a discrete-to-continuous analysis approach.

03

Addresses a previously open question in the convergence behavior of these algorithms.

Abstract

We study last-iterate convergence of SGD with greedy step size over smooth quadratics in the interpolation regime, a setting which captures the classical Randomized Kaczmarz algorithm as well as other popular iterative linear system solvers. For these methods, we show that the $t$ -th iterate attains an $O (1/ t^{3/4})$ convergence rate, addressing a question posed by Attia, Schliserman, Sherman, and Koren, who gave an $O (1/ t^{1/2})$ guarantee for this setting. In the proof, we introduce the family of stochastic contraction processes, whose behavior can be described by the evolution of a certain deterministic eigenvalue equation, which we analyze via a careful discrete-to-continuous reduction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.