Dual Averaging on Compactly-Supported Distributions And Application to   No-Regret Learning on a Continuum

Walid Krichene

arXiv:1504.07720·cs.LG·April 30, 2015·2 cites

Dual Averaging on Compactly-Supported Distributions And Application to No-Regret Learning on a Continuum

Walid Krichene

PDF

Open Access

TL;DR

This paper develops a dual averaging method for online learning over a continuum of decisions, providing regret bounds and demonstrating sublinear regret on certain non-convex sets.

Contribution

It introduces a dual averaging algorithm with $ ext{omega}$-potentials for continuum decision spaces and proves regret bounds under weaker conditions than convexity.

Findings

01

Achieves sublinear regret on uniformly fat sets.

02

Provides regret bounds for dual averaging on $L^2(S)$.

03

Extends online convex optimization to continuum decision spaces.

Abstract

We consider an online learning problem on a continuum. A decision maker is given a compact feasible set $S$ , and is faced with the following sequential problem: at iteration~ $t$ , the decision maker chooses a distribution $x^{(t)} \in Δ (S)$ , then a loss function $ℓ^{(t)} : S \to R_{+}$ is revealed, and the decision maker incurs expected loss $⟨ ℓ^{(t)}, x^{(t)} ⟩ = E_{s \sim x^{(t)}} ℓ^{(t)} (s)$ . We view the problem as an online convex optimization problem on the space $Δ (S)$ of Lebesgue-continnuous distributions on $S$ . We prove a general regret bound for the Dual Averaging method on $L^{2} (S)$ , then prove that dual averaging with $ω$ -potentials (a class of strongly convex regularizers) achieves sublinear regret when $S$ is uniformly fat (a condition weaker than convexity).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms