Regret Matching+: (In)Stability and Fast Convergence in Games

Gabriele Farina; Julien Grand-Cl\'ement; Christian Kroer and; Chung-Wei Lee; Haipeng Luo

arXiv:2305.14709·cs.GT·May 25, 2023·1 cites

Regret Matching+: (In)Stability and Fast Convergence in Games

Gabriele Farina, Julien Grand-Cl\'ement, Christian Kroer and, Chung-Wei Lee, Haipeng Luo

PDF

Open Access 1 Video

TL;DR

This paper investigates the stability and convergence properties of Regret Matching+ (RM+) in large-scale games, identifies instability issues, and proposes fixes that improve regret bounds and practical performance.

Contribution

The paper reveals instability in RM+ algorithms, introduces fixes like restarting and chopping, and proves improved regret bounds and stability in game settings.

Findings

01

RM+ can be unstable in practice

02

Proposed fixes achieve $O(T^{1/4})$ individual regret

03

Algorithms outperform vanilla RM+ in experiments

Abstract

Regret Matching+ (RM+) and its variants are important algorithms for solving large-scale games. However, a theoretical understanding of their success in practice is still a mystery. Moreover, recent advances on fast convergence in games are limited to no-regret algorithms such as online mirror descent, which satisfy stability. In this paper, we first give counterexamples showing that RM+ and its predictive version can be unstable, which might cause other players to suffer large regret. We then provide two fixes: restarting and chopping off the positive orthant that RM+ works in. We show that these fixes are sufficient to get $O (T^{1/4})$ individual regret and $O (1)$ social regret in normal-form games via RM+ with predictions. We also apply our stabilizing techniques to clairvoyant updates in the uncoupled learning setting for RM+ and prove desirable results akin to recent works for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Regret Matching+: (In)Stability and Fast Convergence in Games· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Quantum many-body systems · Stochastic Gradient Optimization Techniques