Dual Averaging on Compactly-Supported Distributions And Application to No-Regret Learning on a Continuum
Walid Krichene

TL;DR
This paper develops a dual averaging method for online learning over a continuum of decisions, providing regret bounds and demonstrating sublinear regret on certain non-convex sets.
Contribution
It introduces a dual averaging algorithm with $ ext{omega}$-potentials for continuum decision spaces and proves regret bounds under weaker conditions than convexity.
Findings
Achieves sublinear regret on uniformly fat sets.
Provides regret bounds for dual averaging on $L^2(S)$.
Extends online convex optimization to continuum decision spaces.
Abstract
We consider an online learning problem on a continuum. A decision maker is given a compact feasible set , and is faced with the following sequential problem: at iteration~, the decision maker chooses a distribution , then a loss function is revealed, and the decision maker incurs expected loss . We view the problem as an online convex optimization problem on the space of Lebesgue-continnuous distributions on . We prove a general regret bound for the Dual Averaging method on , then prove that dual averaging with -potentials (a class of strongly convex regularizers) achieves sublinear regret when is uniformly fat (a condition weaker than convexity).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
