Approachability in unknown games: Online learning meets multi-objective   optimization

Shie Mannor (EE-Technion); Vianney Perchet; Gilles Stoltz (GREGH)

arXiv:1402.2043·stat.ML·June 20, 2016·2 cites

Approachability in unknown games: Online learning meets multi-objective optimization

Shie Mannor (EE-Technion), Vianney Perchet, Gilles Stoltz (GREGH)

PDF

Open Access

TL;DR

This paper extends the classical approachability framework to an online learning setting where the game structure is unknown, proposing strategies to approach the best possible set in hindsight despite inherent limitations.

Contribution

It introduces a novel approach to approachability without prior knowledge of the game, proposing achievable goals and strategies in an unknown, multi-objective online learning context.

Findings

01

Impossible to approach the best target set in general

02

Proposed a switching strategy between scalar regret minimizers

03

Applications demonstrated in cost minimization and constrained approachability

Abstract

In the standard setting of approachability there are two players and a target set. The players play repeatedly a known vector-valued game where the first player wants to have the average vector-valued payoff converge to the target set which the other player tries to exclude it from this set. We revisit this setting in the spirit of online learning and do not assume that the first player knows the game structure: she receives an arbitrary vector-valued reward vector at every round. She wishes to approach the smallest ("best") possible set given the observed average payoffs in hindsight. This extension of the standard setting has implications even when the original target set is not approachable and when it is not obvious which expansion of it should be approached instead. We show that it is impossible, in general, to approach the best target set in hindsight and propose achievable though…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics