A Survey of Algorithms and Analysis for Adaptive Online Learning

H. Brendan McMahan

arXiv:1403.3465·cs.LG·November 10, 2015·26 cites

A Survey of Algorithms and Analysis for Adaptive Online Learning

H. Brendan McMahan

PDF

Open Access

TL;DR

This survey unifies and extends the analysis of adaptive online learning algorithms like FTRL, Dual Averaging, and Mirror Descent, providing tight, data-dependent regret bounds and a framework for analyzing various algorithms under different conditions.

Contribution

It introduces a unified, tight analysis framework for adaptive online algorithms, establishing an exact equivalence between Mirror Descent and FTRL, and generalizing regret bounds for arbitrary norms and regularizers.

Findings

01

Unified analysis of FTRL, Dual Averaging, and Mirror Descent algorithms.

02

Proved exact equivalence between adaptive Mirror Descent and FTRL algorithms.

03

Derived tight, data-dependent regret bounds for various adaptive algorithms.

Abstract

We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, prox-function or learning rate schedule) is chosen adaptively based on the data. Adaptivity can be used to prove regret bounds that hold on every round, and also allows for data-dependent regret bounds as in AdaGrad-style algorithms (e.g., Online Gradient Descent with adaptive per-coordinate learning rates). We present results from a large number of prior works in a unified manner, using a modular and tight analysis that isolates the key arguments in easily re-usable lemmas. This approach strengthens pre-viously known FTRL analysis techniques to produce bounds as tight as those achieved by potential functions or primal-dual analysis. Further, we prove a general and exact equivalence between an arbitrary adaptive Mirror Descent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Online Learning and Analytics · Reinforcement Learning in Robotics