Optimistic Online Convex Optimization in Dynamic Environments

Qing-xin Meng; Jian-wei Liu

arXiv:2203.14520·cs.LG·March 29, 2022

Optimistic Online Convex Optimization in Dynamic Environments

Qing-xin Meng, Jian-wei Liu

PDF

Open Access

TL;DR

This paper develops environment-adaptive algorithms for optimistic online convex optimization in dynamic settings, improving regret bounds by replacing traditional components with optimistic variants and extending the doubling trick.

Contribution

It introduces ONES-OGP, an environment-adaptive algorithm for optimistic online convex optimization, replacing non-adaptive components and extending the doubling trick for better regret bounds.

Findings

01

Achieves environment-adaptive regret bounds.

02

Replaces GP and NES with optimistic variants.

03

Extends the doubling trick to an adaptive version.

Abstract

In this paper, we study the optimistic online convex optimization problem in dynamic environments. Existing works have shown that Ader enjoys an $O ((1 + P_{T}) T)$ dynamic regret upper bound, where $T$ is the number of rounds, and $P_{T}$ is the path length of the reference strategy sequence. However, Ader is not environment-adaptive. Based on the fact that optimism provides a framework for implementing environment-adaptive, we replace Greedy Projection (GP) and Normalized Exponentiated Subgradient (NES) in Ader with Optimistic-GP and Optimistic-NES respectively, and name the corresponding algorithm ONES-OGP. We also extend the doubling trick to the adaptive trick, and introduce three characteristic terms naturally arise from optimism, namely $M_{T}$ , $M_{T}$ and $V_{T} + 1_{L^{2} ρ (ρ + 2 P_{T}) ⩽ ϱ^{2} V_{T}} D_{T}$ , to replace the dependence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms