Response-Based Approachability and its Application to Generalized No-Regret Algorithms
Andrey Bernstein, Nahum Shimkin

TL;DR
This paper introduces a novel approachability algorithm based on Blackwell's dual condition, enabling efficient response computations for generalized regret minimization problems in online learning.
Contribution
The paper proposes a response-based approachability algorithm leveraging Blackwell's dual condition, simplifying computations in complex regret minimization scenarios.
Findings
The new algorithm effectively handles generalized regret minimization problems.
Response-based method reduces computational complexity compared to projection-based approaches.
Demonstrated applicability to problems with side constraints and global cost functions.
Abstract
Approachability theory, introduced by Blackwell (1956), provides fundamental results on repeated games with vector-valued payoffs, and has been usefully applied since in the theory of learning in games and to learning algorithms in the online adversarial setup. Given a repeated game with vector payoffs, a target set is approachable by a certain player (the agent) if he can ensure that the average payoff vector converges to that set no matter what his adversary opponent does. Blackwell provided two equivalent sets of conditions for a convex set to be approachable. The first (primary) condition is a geometric separation condition, while the second (dual) condition requires that the set be {\em non-excludable}, namely that for every mixed action of the opponent there exists a mixed action of the agent (a {\em response}) such that the resulting payoff vector belongs to . Existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Algorithms
