Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games
Tianyi Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael I., Jordan

TL;DR
This paper establishes finite-time last-iterate convergence rates for multi-agent online gradient descent in $\lambda$-cocoercive games, introduces a fully adaptive algorithm, and extends results to noisy feedback scenarios, filling key gaps in the literature.
Contribution
It provides the first finite-time convergence analysis for adaptive multi-agent learning algorithms in $\lambda$-cocoercive games, including noisy feedback cases.
Findings
Finite-time last-iterate convergence rates established.
Adaptive OGD algorithm matches non-adaptive convergence.
Results extend to noisy gradient feedback with non-decreasing step-sizes.
Abstract
In this paper, we consider multi-agent learning via online gradient descent in a class of games called -cocoercive games, a fairly broad class of games that admits many Nash equilibria and that properly includes unconstrained strongly monotone games. We characterize the finite-time last-iterate convergence rate for joint OGD learning on -cocoercive games; further, building on this result, we develop a fully adaptive OGD learning algorithm that does not require any knowledge of problem parameter (e.g. cocoercive constant ) and show, via a novel double-stopping time technique, that this adaptive algorithm achieves same finite-time last-iterate convergence rate as non-adaptive counterpart. Subsequently, we extend OGD learning to the noisy gradient feedback case and establish last-iterate convergence results -- first qualitative almost sure convergence, then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Adaptive Dynamic Programming Control
