Last-Iterate Guarantees for Learning in Co-coercive Games

Siddharth Chandak; Ramanan Tamizholi; Nicholas Bambos

arXiv:2604.19065·cs.GT·April 22, 2026

Last-Iterate Guarantees for Learning in Co-coercive Games

Siddharth Chandak, Ramanan Tamizholi, Nicholas Bambos

PDF

TL;DR

This paper proves finite-time last-iterate convergence guarantees for stochastic gradient descent in a broad class of co-coercive games under realistic noise conditions, extending prior results.

Contribution

It introduces the first last-iterate convergence bound for co-coercive games with non-vanishing noise, broadening understanding of learning dynamics.

Findings

01

Last-iterate bound of order O(log(t)/t^{1/3}) under general noise

02

Almost sure convergence of iterates to Nash equilibria

03

Time-average convergence guarantees

Abstract

We establish finite-time last-iterate guarantees for vanilla stochastic gradient descent in co-coercive games under noisy feedback. This is a broad class of games that is more general than strongly monotone games, allows for multiple Nash equilibria, and includes examples such as quadratic games with negative semidefinite interaction matrices and potential games with smooth concave potentials. Prior work in this setting has relied on relative noise models, where the noise vanishes as iterates approach equilibrium, an assumption that is often unrealistic in practice. We work instead under a substantially more general noise model in which the second moment of the noise is allowed to scale affinely with the squared norm of the iterates, an assumption natural in learning with unbounded action spaces. Under this model, we prove a last-iterate bound of order $O (lo g (t) / t^{1/3})$ , the first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.