Hedging in games: Faster convergence of external and swap regrets

Xi Chen; Binghui Peng

arXiv:2006.04953·cs.GT·October 21, 2020·6 cites

Hedging in games: Faster convergence of external and swap regrets

Xi Chen, Binghui Peng

PDF

Open Access 1 Video

TL;DR

This paper improves convergence bounds for Hedge algorithms in repeated n-action games, showing faster regret decay for optimistic Hedge and establishing limits for vanilla Hedge, with implications for equilibrium convergence.

Contribution

It provides new regret decay rates for optimistic Hedge, clarifies the limitations of vanilla Hedge, and extends results to multi-player games with faster convergence to equilibria.

Findings

01

Optimistic Hedge achieves regret decay of O(1/T^{5/6}) in two-player games.

02

Vanilla Hedge's regret decay is at most O(1/ rac{1}{2} \

03

O(1/ rac{1}{2} \

Abstract

We consider the setting where players run the Hedge algorithm or its optimistic variant to play an $n$ -action game repeatedly for $T$ rounds. 1) For two-player games, we show that the regret of optimistic Hedge decays at $\tilde{O} (1/ T^{5/6})$ , improving the previous bound $O (1/ T^{3/4})$ by Syrgkanis, Agarwal, Luo and Schapire (NIPS'15) 2) In contrast, we show that the convergence rate of vanilla Hedge is no better than $\tilde{Ω} (1/ T)$ , addressing an open question posted in Syrgkanis, Agarwal, Luo and Schapire (NIPS'15). For general m-player games, we show that the swap regret of each player decays at rate $\tilde{O} (m^{1/2} (n / T)^{3/4})$ when they combine optimistic Hedge with the classical external-to-internal reduction of Blum and Mansour (JMLR'07). The algorithm can also be modified to achieve the same rate against itself and a rate of $\tilde{O} (n / T)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hedging in games: Faster convergence of external and swap regrets· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Game Theory and Applications