Sharp Dichotomies for Regret Minimization in Metric Spaces

Robert Kleinberg; Aleksandrs Slivkins

arXiv:0911.1174·cs.DS·November 9, 2009

Sharp Dichotomies for Regret Minimization in Metric Spaces

Robert Kleinberg, Aleksandrs Slivkins

PDF

Open Access

TL;DR

This paper establishes a universal dichotomy in regret bounds for Lipschitz multi-armed bandit problems across all metric spaces, linking the bounds to topological properties like compactness and countability.

Contribution

It proves a universal dichotomy for regret bounds in Lipschitz MAB problems, connecting online learning bounds with classical topology.

Findings

01

Regret is either logarithmic or sublinear, depending on the metric space's topology.

02

The dichotomy depends on whether the metric space's completion is compact and countable.

03

A similar dichotomy is shown for the full-feedback (best-expert) setting.

Abstract

The Lipschitz multi-armed bandit (MAB) problem generalizes the classical multi-armed bandit problem by assuming one is given side information consisting of a priori upper bounds on the difference in expected payoff between certain pairs of strategies. Classical results of (Lai and Robbins 1985) and (Auer et al. 2002) imply a logarithmic regret bound for the Lipschitz MAB problem on finite metric spaces. Recent results on continuum-armed bandit problems and their generalizations imply lower bounds of $t$ , or stronger, for many infinite metric spaces such as the unit interval. Is this dichotomy universal? We prove that the answer is yes: for every metric space, the optimal regret of a Lipschitz MAB algorithm is either bounded above by any $f \in ω (lo g t)$ , or bounded below by any $g \in o (t)$ . Perhaps surprisingly, this dichotomy does not coincide with the distinction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms