Artificial Constraints and Lipschitz Hints for Unconstrained Online Learning
Ashok Cutkosky

TL;DR
This paper introduces algorithms for online convex optimization that achieve regret bounds without prior knowledge of the Lipschitz constant or comparison norm, improving over previous exponential penalties with polynomial bounds.
Contribution
The authors develop new algorithms with regret bounds that do not require prior knowledge of key parameters, and they show these bounds are nearly optimal with polynomial dependence.
Findings
Regret bounds without prior knowledge of G or ||u||
Polynomial penalty bounds in all parameters
Optimal adaptation to unknown ||u||
Abstract
We provide algorithms that guarantee regret or for online convex optimization with -Lipschitz losses for any comparison point without prior knowledge of either or . Previous algorithms dispense with the term at the expense of knowledge of one or both of these parameters, while a lower bound shows that some additional penalty term over is necessary. Previous penalties were exponential while our bounds are polynomial in all quantities. Further, given a known bound , our same techniques allow us to design algorithms that adapt optimally to the unknown value of without requiring knowledge of .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
