Let's be Honest: An Optimal No-Regret Framework for Zero-Sum Games

Ehsan Asadi Kangarshahi; Ya-Ping Hsieh; Mehmet Fatih Sahin; Volkan; Cevher

arXiv:1802.04221·cs.GT·June 7, 2018

Let's be Honest: An Optimal No-Regret Framework for Zero-Sum Games

Ehsan Asadi Kangarshahi, Ya-Ping Hsieh, Mehmet Fatih Sahin, Volkan, Cevher

PDF

Open Access

TL;DR

This paper introduces a unified framework for solving two-player zero-sum games that achieves optimal regret bounds and removes previous convergence limitations, applicable in both honest and adversarial settings.

Contribution

It presents a novel analysis of optimistic mirror descent, proposes the robust optimistic mirror descent algorithm, and introduces a signaling scheme to combine their advantages.

Findings

01

Achieves fast convergence for honest regret and game value.

02

Attains optimal adversarial regret without prior horizon knowledge.

03

Numerical results show competitive performance of the proposed algorithms.

Abstract

We revisit the problem of solving two-player zero-sum games in the decentralized setting. We propose a simple algorithmic framework that simultaneously achieves the best rates for honest regret as well as adversarial regret, and in addition resolves the open problem of removing the logarithmic terms in convergence to the value of the game. We achieve this goal in three steps. First, we provide a novel analysis of the optimistic mirror descent (OMD), showing that it can be modified to guarantee fast convergence for both honest regret and value of the game, when the players are playing collaboratively. Second, we propose a new algorithm, dubbed as robust optimistic mirror descent (ROMD), which attains optimal adversarial regret without knowing the time horizon beforehand. Finally, we propose a simple signaling scheme, which enables us to bridge OMD and ROMD to achieve the best of both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Quantum Computing Algorithms and Architecture · Stochastic Gradient Optimization Techniques