The Computational Power of Optimization in Online Learning

Elad Hazan; Tomer Koren

arXiv:1504.02089·cs.LG·January 28, 2016·2 cites

The Computational Power of Optimization in Online Learning

Elad Hazan, Tomer Koren

PDF

Open Access

TL;DR

This paper introduces a novel online learning algorithm leveraging an optimization oracle that achieves vanishing regret with significantly reduced computation time, demonstrating a quadratic speedup over traditional methods.

Contribution

The paper presents a new online algorithm with $ ilde{O}( ext{sqrt}(N))$ runtime for expert advice prediction, establishing a lower bound and highlighting a quadratic speedup due to optimization oracles.

Findings

01

Achieves vanishing regret in $ ilde{O}( ext{sqrt}(N))$ time.

02

Proves a lower bound matching the upper bound, confirming optimality.

03

Shows exponential gap between optimization power in online vs. statistical learning.

Abstract

We consider the fundamental problem of prediction with expert advice where the experts are "optimizable": there is a black-box optimization oracle that can be used to compute, in constant time, the leading expert in retrospect at any point in time. In this setting, we give a novel online algorithm that attains vanishing regret with respect to $N$ experts in total $O (N)$ computation time. We also give a lower bound showing that this running time cannot be improved (up to log factors) in the oracle model, thereby exhibiting a quadratic speedup as compared to the standard, oracle-free setting where the required time for vanishing regret is $Θ (N)$ . These results demonstrate an exponential gap between the power of optimization in online learning and its power in statistical learning: in the latter, an optimization oracle---i.e., an efficient empirical risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics