Fully Unconstrained Online Learning
Ashok Cutkosky, Zakaria Mhammedi

TL;DR
This paper introduces an online learning algorithm that achieves near-optimal regret bounds without prior knowledge of key parameters, effectively handling all scenarios where sublinear regret is possible.
Contribution
It presents a fully unconstrained online learning algorithm that attains optimal regret bounds without knowing the Lipschitz constant or comparison norm beforehand.
Findings
Achieves regret close to the best known bounds without prior parameter knowledge.
Matches the optimal regret bounds in all sublinear regret scenarios.
Handles arbitrary comparison points with no prior constraints.
Abstract
We provide an online learning algorithm that obtains regret on -Lipschitz convex losses for any comparison point without knowing either or . Importantly, this matches the optimal bound available with such knowledge (up to logarithmic factors), unless either or is so large that even is roughly linear in . Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Teaching and Learning Programming
