Parameter-free Mirror Descent

Andrew Jacobsen; Ashok Cutkosky

arXiv:2203.00444·cs.LG·February 12, 2024

Parameter-free Mirror Descent

Andrew Jacobsen, Ashok Cutkosky

PDF

Open Access

TL;DR

This paper introduces a new parameter-free mirror descent framework for online optimization in unbounded domains, achieving optimal dynamic regret and surpassing traditional strategies like Follow-the-Regularized-Leader.

Contribution

It presents the first unconstrained online linear optimization algorithm with optimal dynamic regret and develops new parameter-free implicit and scale-free algorithms.

Findings

01

Achieves optimal dynamic regret in unbounded domains.

02

Shows traditional strategies like Follow-the-Regularized-Leader are insufficient.

03

Provides simplified, improved unconstrained scale-free algorithms.

Abstract

We develop a modified online mirror descent framework that is suitable for building adaptive and parameter-free algorithms in unbounded domains. We leverage this technique to develop the first unconstrained online linear optimization algorithm achieving an optimal dynamic regret bound, and we further demonstrate that natural strategies based on Follow-the-Regularized-Leader are unable to achieve similar results. We also apply our mirror descent framework to build new parameter-free implicit updates, as well as a simplified and improved unconstrained scale-free algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Optimization and Search Problems