Artificial Constraints and Lipschitz Hints for Unconstrained Online   Learning

Ashok Cutkosky

arXiv:1902.09013·stat.ML·February 26, 2019·1 cites

Artificial Constraints and Lipschitz Hints for Unconstrained Online Learning

Ashok Cutkosky

PDF

Open Access

TL;DR

This paper introduces algorithms for online convex optimization that achieve regret bounds without prior knowledge of the Lipschitz constant or comparison norm, improving over previous exponential penalties with polynomial bounds.

Contribution

The authors develop new algorithms with regret bounds that do not require prior knowledge of key parameters, and they show these bounds are nearly optimal with polynomial dependence.

Findings

01

Regret bounds without prior knowledge of G or ||u||

02

Polynomial penalty bounds in all parameters

03

Optimal adaptation to unknown ||u||

Abstract

We provide algorithms that guarantee regret $R_{T} (u) \leq \tilde{O} (G ∥ u ∥^{3} + G (∥ u ∥ + 1) T)$ or $R_{T} (u) \leq \tilde{O} (G ∥ u ∥^{3} T^{1/3} + G T^{1/3} + G ∥ u ∥ T)$ for online convex optimization with $G$ -Lipschitz losses for any comparison point $u$ without prior knowledge of either $G$ or $∥ u ∥$ . Previous algorithms dispense with the $O (∥ u ∥^{3})$ term at the expense of knowledge of one or both of these parameters, while a lower bound shows that some additional penalty term over $G ∥ u ∥ T$ is necessary. Previous penalties were exponential while our bounds are polynomial in all quantities. Further, given a known bound $∥ u ∥ \leq D$ , our same techniques allow us to design algorithms that adapt optimally to the unknown value of $∥ u ∥$ without requiring knowledge of $G$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems