Constant Regret, Generalized Mixability, and Mirror Descent

Zakaria Mhammedi; Robert C. Williamson

arXiv:1802.06965·cs.LG·November 1, 2018·1 cites

Constant Regret, Generalized Mixability, and Mirror Descent

Zakaria Mhammedi, Robert C. Williamson

PDF

Open Access

TL;DR

This paper characterizes which loss functions allow for constant regret in prediction with expert advice using generalized mixability, showing the fundamental role of Shannon entropy and proposing an adaptive algorithm with regret guarantees.

Contribution

It fully characterizes -mixability, establishes the fundamental role of Shannon entropy, and introduces an adaptive GAA with regret analysis.

Findings

01

-mixability is fully characterized.

02

Shannon entropy is fundamental for mixability.

03

The adaptive GAA achieves provable regret bounds.

Abstract

We consider the setting of prediction with expert advice; a learner makes predictions by aggregating those of a group of experts. Under this setting, and for the right choice of loss function and "mixing" algorithm, it is possible for the learner to achieve a constant regret regardless of the number of prediction rounds. For example, a constant regret can be achieved for \emph{mixable} losses using the \emph{aggregating algorithm}. The \emph{Generalized Aggregating Algorithm} (GAA) is a name for a family of algorithms parameterized by convex functions on simplices (entropies), which reduce to the aggregating algorithm when using the \emph{Shannon entropy} $S$ . For a given entropy $Φ$ , losses for which a constant regret is possible using the \textsc{GAA} are called $Φ$ -mixable. Which losses are $Φ$ -mixable was previously left as an open question. We fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques