Scale-Free Online Learning

Francesco Orabona; D\'avid P\'al

arXiv:1601.01974·cs.LG·December 15, 2016

Scale-Free Online Learning

Francesco Orabona, D\'avid P\'al

PDF

TL;DR

This paper introduces scale-invariant online learning algorithms that adapt to loss vector norms without prior bounds, achieving optimal regret for both bounded and unbounded decision sets.

Contribution

It presents the first adaptive algorithms for unbounded decision sets in online linear optimization that are scale-invariant and achieve optimal regret.

Findings

01

Algorithms are scale-invariant and adapt to loss vector norms.

02

First adaptive algorithms with non-vacuous regret bounds for unbounded decision sets.

03

Lower bounds show limitations of Mirror Descent-based scale-free algorithms.

Abstract

We design and analyze algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. Our algorithms are instances of the Follow the Regularized Leader (FTRL) and Mirror Descent (MD) meta-algorithms. We achieve adaptiveness to the norms of the loss vectors by scale invariance, i.e., our algorithms make exactly the same decisions if the sequence of loss vectors is multiplied by any positive constant. The algorithm based on FTRL works for any decision set, bounded or unbounded. For unbounded decisions sets, this is the first adaptive algorithm for online linear optimization with a non-vacuous regret bound. In contrast, we show lower bounds on scale-free algorithms based on MD on unbounded domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.