Response-Based Approachability and its Application to Generalized   No-Regret Algorithms

Andrey Bernstein; Nahum Shimkin

arXiv:1312.7658·cs.LG·December 31, 2013·5 cites

Response-Based Approachability and its Application to Generalized No-Regret Algorithms

Andrey Bernstein, Nahum Shimkin

PDF

Open Access

TL;DR

This paper introduces a novel approachability algorithm based on Blackwell's dual condition, enabling efficient response computations for generalized regret minimization problems in online learning.

Contribution

The paper proposes a response-based approachability algorithm leveraging Blackwell's dual condition, simplifying computations in complex regret minimization scenarios.

Findings

01

The new algorithm effectively handles generalized regret minimization problems.

02

Response-based method reduces computational complexity compared to projection-based approaches.

03

Demonstrated applicability to problems with side constraints and global cost functions.

Abstract

Approachability theory, introduced by Blackwell (1956), provides fundamental results on repeated games with vector-valued payoffs, and has been usefully applied since in the theory of learning in games and to learning algorithms in the online adversarial setup. Given a repeated game with vector payoffs, a target set $S$ is approachable by a certain player (the agent) if he can ensure that the average payoff vector converges to that set no matter what his adversary opponent does. Blackwell provided two equivalent sets of conditions for a convex set to be approachable. The first (primary) condition is a geometric separation condition, while the second (dual) condition requires that the set be {\em non-excludable}, namely that for every mixed action of the opponent there exists a mixed action of the agent (a {\em response}) such that the resulting payoff vector belongs to $S$ . Existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Algorithms