No-Regret Algorithms for Unconstrained Online Convex Optimization

Matthew Streeter; H. Brendan McMahan

arXiv:1211.2260·cs.LG·November 13, 2012·32 cites

No-Regret Algorithms for Unconstrained Online Convex Optimization

Matthew Streeter, H. Brendan McMahan

PDF

Open Access

TL;DR

This paper introduces new no-regret algorithms for unconstrained online convex optimization that achieve near-optimal regret bounds without prior knowledge of the comparator, including constant regret for the zero comparator.

Contribution

The authors develop algorithms that attain near-optimal regret in unconstrained online convex optimization without needing prior knowledge of the comparator point.

Findings

01

Achieve near-optimal regret bounds in unconstrained settings

02

Regret with respect to x^* = 0 is constant

03

Prove lower bounds showing near-optimality of their guarantees

Abstract

Some of the most compelling applications of online convex optimization, including online prediction and classification, are unconstrained: the natural feasible set is R^n. Existing algorithms fail to achieve sub-linear regret in this setting unless constraints on the comparator point x^* are known in advance. We present algorithms that, without such prior knowledge, offer near-optimal regret bounds with respect to any choice of x^*. In particular, regret with respect to x^* = 0 is constant. We then prove lower bounds showing that our guarantees are near-optimal in this setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Advanced Wireless Network Optimization