Volumetric Spanners: an Efficient Exploration Basis for Learning

Elad Hazan; Zohar Karnin; Raghu Mehka

arXiv:1312.6214·cs.LG·May 27, 2014·5 cites

Volumetric Spanners: an Efficient Exploration Basis for Learning

Elad Hazan, Zohar Karnin, Raghu Mehka

PDF

Open Access

TL;DR

This paper introduces volumetric spanners, a new geometric exploration basis with low variance, enabling the first efficient and optimal regret algorithm for bandit linear optimization over general convex sets.

Contribution

It defines volumetric spanners and provides algorithms to construct them, extending regret minimization results to broader convex sets beyond special cases.

Findings

01

Efficient algorithms for constructing volumetric spanners.

02

First optimal regret algorithm for bandit linear optimization over general convex sets.

03

Extension of previous results to broader convex geometries.

Abstract

Numerous machine learning problems require an exploration basis - a mechanism to explore the action space. We define a novel geometric notion of exploration basis with low variance, called volumetric spanners, and give efficient algorithms to construct such a basis. We show how efficient volumetric spanners give rise to the first efficient and optimal regret algorithm for bandit linear optimization over general convex sets. Previously such results were known only for specific convex sets, or under special conditions such as the existence of an efficient self-concordant barrier for the underlying set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics