MetaGrad: Multiple Learning Rates in Online Learning
Tim van Erven, Wouter M. Koolen

TL;DR
MetaGrad is an adaptive online learning algorithm that automatically adjusts multiple learning rates to efficiently handle a wide variety of convex functions, including non-curved and stochastic cases, without manual tuning.
Contribution
It introduces MetaGrad, a novel method that adapts to diverse convex functions by considering multiple learning rates weighted by empirical performance, extending beyond prior adaptive algorithms.
Findings
Achieves logarithmic regret on unregularized hinge loss.
Adapts to exp-concave and strongly convex functions.
Handles stochastic and non-stochastic functions without curvature.
Abstract
In online convex optimization it is well known that certain subclasses of objective functions are much easier than arbitrary convex functions. We are interested in designing adaptive methods that can automatically get fast rates in as many such subclasses as possible, without any manual tuning. Previous adaptive methods are able to interpolate between strongly convex and general convex functions. We present a new method, MetaGrad, that adapts to a much broader class of functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. For instance, MetaGrad can achieve logarithmic regret on the unregularized hinge loss, even though it has no curvature, if the data come from a favourable probability distribution. MetaGrad's main feature is that it simultaneously considers multiple learning rates. Unlike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
