Regret Matching+: (In)Stability and Fast Convergence in Games
Gabriele Farina, Julien Grand-Cl\'ement, Christian Kroer and, Chung-Wei Lee, Haipeng Luo

TL;DR
This paper investigates the stability and convergence properties of Regret Matching+ (RM+) in large-scale games, identifies instability issues, and proposes fixes that improve regret bounds and practical performance.
Contribution
The paper reveals instability in RM+ algorithms, introduces fixes like restarting and chopping, and proves improved regret bounds and stability in game settings.
Findings
RM+ can be unstable in practice
Proposed fixes achieve $O(T^{1/4})$ individual regret
Algorithms outperform vanilla RM+ in experiments
Abstract
Regret Matching+ (RM+) and its variants are important algorithms for solving large-scale games. However, a theoretical understanding of their success in practice is still a mystery. Moreover, recent advances on fast convergence in games are limited to no-regret algorithms such as online mirror descent, which satisfy stability. In this paper, we first give counterexamples showing that RM+ and its predictive version can be unstable, which might cause other players to suffer large regret. We then provide two fixes: restarting and chopping off the positive orthant that RM+ works in. We show that these fixes are sufficient to get individual regret and social regret in normal-form games via RM+ with predictions. We also apply our stabilizing techniques to clairvoyant updates in the uncoupled learning setting for RM+ and prove desirable results akin to recent works for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Quantum many-body systems · Stochastic Gradient Optimization Techniques
