An Online Convex Optimization Approach to Blackwell's Approachability

Nahum Shimkin

arXiv:1503.00255·cs.GT·March 3, 2015·1 cites

An Online Convex Optimization Approach to Blackwell's Approachability

Nahum Shimkin

PDF

Open Access

TL;DR

This paper introduces a new approachability algorithm based on online convex optimization, generalizing Blackwell's method and providing a more direct formulation using the support function of convex sets.

Contribution

It presents a novel approachability algorithm leveraging online convex optimization and support functions, extending Blackwell's approach to general convex sets.

Findings

01

The new algorithm generalizes Blackwell's approachability method.

02

It demonstrates convergence to the target set under the proposed framework.

03

The approach unifies and extends previous algorithms for convex target sets.

Abstract

The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing {\em steering directions} as projections from the current average payoff vector to the (convex) target set. Recently, Abernethy, Batlett and Hazan (2011) proposed a class of approachability algorithms that rely on the no-regret properties of Online Linear Programming for computing a suitable sequence of steering directions. This is first carried out for target sets that are convex cones, and then generalized to any convex set by embedding it in a higher-dimensional convex cone. In this paper we present a more direct formulation that relies on the support function of the set, along with suitable Online Convex Optimization algorithms, which leads to a general class of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Reinforcement Learning in Robotics