Efficient Regret Minimization in Non-Convex Games

Elad Hazan; Karan Singh; Cyril Zhang

arXiv:1708.00075·cs.LG·November 6, 2017·20 cites

Efficient Regret Minimization in Non-Convex Games

Elad Hazan, Karan Singh, Cyril Zhang

PDF

Open Access

TL;DR

This paper introduces an efficient approach for regret minimization in non-convex games, enabling convergence to equilibrium through gradient-based methods despite the computational challenges of standard regret minimization.

Contribution

It proposes a new notion of regret suitable for non-convex games and provides gradient-based algorithms that achieve optimal regret and convergence guarantees.

Findings

01

Achieves optimal regret bounds in non-convex settings

02

Guarantees convergence to equilibrium using the proposed methods

03

Introduces a computationally feasible regret notion for non-convex games

Abstract

We consider regret minimization in repeated games with non-convex loss functions. Minimizing the standard notion of regret is computationally intractable. Thus, we define a natural notion of regret which permits efficient optimization and generalizes offline guarantees for convergence to an approximate local optimum. We give gradient-based methods that achieve optimal regret, which in turn guarantee convergence to equilibrium in this framework.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Algorithms