Online Linear Optimization via Smoothing

Jacob Abernethy; Chansoo Lee; Abhinav Sinha; Ambuj Tewari

arXiv:1405.6076·cs.LG·May 26, 2014·29 cites

Online Linear Optimization via Smoothing

Jacob Abernethy, Chansoo Lee, Abhinav Sinha, Ambuj Tewari

PDF

Open Access

TL;DR

This paper introduces a new analysis framework for online linear optimization algorithms, connecting regularization and perturbation techniques through smoothing operations to improve regret bounds.

Contribution

It establishes an equivalence between Follow the Regularized Leader and Follow the Perturbed Leader, leading to improved regret bounds in online optimization.

Findings

01

Unified analysis framework for FTRL and FPL algorithms

02

Improved regret bounds for Follow the Perturbed Leader

03

Insight into smoothing operations in online optimization

Abstract

We present a new optimization-theoretic approach to analyzing Follow-the-Leader style algorithms, particularly in the setting where perturbations are used as a tool for regularization. We show that adding a strongly convex penalty function to the decision rule and adding stochastic perturbations to data correspond to deterministic and stochastic smoothing operations, respectively. We establish an equivalence between "Follow the Regularized Leader" and "Follow the Perturbed Leader" up to the smoothness properties. This intuition leads to a new generic analysis framework that recovers and improves the previous known regret bounds of the class of algorithms commonly known as Follow the Perturbed Leader.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Optimization and Search Problems