Fully Unconstrained Online Learning

Ashok Cutkosky; Zakaria Mhammedi

arXiv:2405.20540·cs.LG·June 3, 2024

Fully Unconstrained Online Learning

Ashok Cutkosky, Zakaria Mhammedi

PDF

Open Access

TL;DR

This paper introduces an online learning algorithm that achieves near-optimal regret bounds without prior knowledge of key parameters, effectively handling all scenarios where sublinear regret is possible.

Contribution

It presents a fully unconstrained online learning algorithm that attains optimal regret bounds without knowing the Lipschitz constant or comparison norm beforehand.

Findings

01

Achieves regret close to the best known bounds without prior parameter knowledge.

02

Matches the optimal regret bounds in all sublinear regret scenarios.

03

Handles arbitrary comparison points with no prior constraints.

Abstract

We provide an online learning algorithm that obtains regret $G ∥ w_{⋆} ∥ T lo g (∥ w_{⋆} ∥ G T) + ∥ w_{⋆} ∥^{2} + G^{2}$ on $G$ -Lipschitz convex losses for any comparison point $w_{⋆}$ without knowing either $G$ or $∥ w_{⋆} ∥$ . Importantly, this matches the optimal bound $G ∥ w_{⋆} ∥ T$ available with such knowledge (up to logarithmic factors), unless either $∥ w_{⋆} ∥$ or $G$ is so large that even $G ∥ w_{⋆} ∥ T$ is roughly linear in $T$ . Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Teaching and Learning Programming