MetaGrad: Multiple Learning Rates in Online Learning

Tim van Erven; Wouter M. Koolen

arXiv:1604.08740·cs.LG·August 31, 2021·20 cites

MetaGrad: Multiple Learning Rates in Online Learning

Tim van Erven, Wouter M. Koolen

PDF

Open Access 1 Repo

TL;DR

MetaGrad is an adaptive online learning algorithm that automatically adjusts multiple learning rates to efficiently handle a wide variety of convex functions, including non-curved and stochastic cases, without manual tuning.

Contribution

It introduces MetaGrad, a novel method that adapts to diverse convex functions by considering multiple learning rates weighted by empirical performance, extending beyond prior adaptive algorithms.

Findings

01

Achieves logarithmic regret on unregularized hinge loss.

02

Adapts to exp-concave and strongly convex functions.

03

Handles stochastic and non-stochastic functions without curvature.

Abstract

In online convex optimization it is well known that certain subclasses of objective functions are much easier than arbitrary convex functions. We are interested in designing adaptive methods that can automatically get fast rates in as many such subclasses as possible, without any manual tuning. Previous adaptive methods are able to interpolate between strongly convex and general convex functions. We present a new method, MetaGrad, that adapts to a much broader class of functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. For instance, MetaGrad can achieve logarithmic regret on the unregularized hinge loss, even though it has no curvature, if the data come from a favourable probability distribution. MetaGrad's main feature is that it simultaneously considers multiple learning rates. Unlike…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bitbucket.org/wmkoolen/metagrad
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques