Constant Regret, Generalized Mixability, and Mirror Descent
Zakaria Mhammedi, Robert C. Williamson

TL;DR
This paper characterizes which loss functions allow for constant regret in prediction with expert advice using generalized mixability, showing the fundamental role of Shannon entropy and proposing an adaptive algorithm with regret guarantees.
Contribution
It fully characterizes -mixability, establishes the fundamental role of Shannon entropy, and introduces an adaptive GAA with regret analysis.
Findings
-mixability is fully characterized.
Shannon entropy is fundamental for mixability.
The adaptive GAA achieves provable regret bounds.
Abstract
We consider the setting of prediction with expert advice; a learner makes predictions by aggregating those of a group of experts. Under this setting, and for the right choice of loss function and "mixing" algorithm, it is possible for the learner to achieve a constant regret regardless of the number of prediction rounds. For example, a constant regret can be achieved for \emph{mixable} losses using the \emph{aggregating algorithm}. The \emph{Generalized Aggregating Algorithm} (GAA) is a name for a family of algorithms parameterized by convex functions on simplices (entropies), which reduce to the aggregating algorithm when using the \emph{Shannon entropy} . For a given entropy , losses for which a constant regret is possible using the \textsc{GAA} are called -mixable. Which losses are -mixable was previously left as an open question. We fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques
