Adaptive Hedge

Tim van Erven; Peter Gr\"unwald; Wouter M. Koolen; Steven de Rooij

arXiv:1110.6416·stat.ML·March 4, 2015

Adaptive Hedge

Tim van Erven, Peter Gr\"unwald, Wouter M. Koolen, Steven de Rooij

PDF

Open Access

TL;DR

This paper introduces an adaptive version of the Hedge algorithm that dynamically adjusts the learning rate based on problem difficulty, achieving optimal worst-case performance and significantly lower regret on easier instances.

Contribution

It presents a novel adaptive learning rate scheme for Hedge, improving performance on easy problems while maintaining worst-case guarantees.

Findings

01

Achieves constant regret in probabilistic settings with a better action.

02

Outperforms existing methods in simulations on easy instances.

03

Maintains optimal worst-case performance.

Abstract

Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new way of setting the learning rate, which adapts to the difficulty of the learning problem: in the worst case our procedure still guarantees optimal performance, but on easy instances it achieves much smaller regret. In particular, our adaptive method achieves constant regret in a probabilistic setting, when there exists an action that on average obtains strictly smaller loss than all other actions. We also provide a simulation study comparing our approach to existing methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning and Algorithms