Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

Tianyi Lin; Zhengyuan Zhou; Panayotis Mertikopoulos; Michael I.; Jordan

arXiv:2002.09806·math.OC·July 20, 2021·6 cites

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

Tianyi Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael I., Jordan

PDF

Open Access 1 Video

TL;DR

This paper establishes finite-time last-iterate convergence rates for multi-agent online gradient descent in $\lambda$-cocoercive games, introduces a fully adaptive algorithm, and extends results to noisy feedback scenarios, filling key gaps in the literature.

Contribution

It provides the first finite-time convergence analysis for adaptive multi-agent learning algorithms in $\lambda$-cocoercive games, including noisy feedback cases.

Findings

01

Finite-time last-iterate convergence rates established.

02

Adaptive OGD algorithm matches non-adaptive convergence.

03

Results extend to noisy gradient feedback with non-decreasing step-sizes.

Abstract

In this paper, we consider multi-agent learning via online gradient descent in a class of games called $λ$ -cocoercive games, a fairly broad class of games that admits many Nash equilibria and that properly includes unconstrained strongly monotone games. We characterize the finite-time last-iterate convergence rate for joint OGD learning on $λ$ -cocoercive games; further, building on this result, we develop a fully adaptive OGD learning algorithm that does not require any knowledge of problem parameter (e.g. cocoercive constant $λ$ ) and show, via a novel double-stopping time technique, that this adaptive algorithm achieves same finite-time last-iterate convergence rate as non-adaptive counterpart. Subsequently, we extend OGD learning to the noisy gradient feedback case and establish last-iterate convergence results -- first qualitative almost sure convergence, then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Adaptive Dynamic Programming Control