Optimistic Online Convex Optimization in Dynamic Environments
Qing-xin Meng, Jian-wei Liu

TL;DR
This paper develops environment-adaptive algorithms for optimistic online convex optimization in dynamic settings, improving regret bounds by replacing traditional components with optimistic variants and extending the doubling trick.
Contribution
It introduces ONES-OGP, an environment-adaptive algorithm for optimistic online convex optimization, replacing non-adaptive components and extending the doubling trick for better regret bounds.
Findings
Achieves environment-adaptive regret bounds.
Replaces GP and NES with optimistic variants.
Extends the doubling trick to an adaptive version.
Abstract
In this paper, we study the optimistic online convex optimization problem in dynamic environments. Existing works have shown that Ader enjoys an dynamic regret upper bound, where is the number of rounds, and is the path length of the reference strategy sequence. However, Ader is not environment-adaptive. Based on the fact that optimism provides a framework for implementing environment-adaptive, we replace Greedy Projection (GP) and Normalized Exponentiated Subgradient (NES) in Ader with Optimistic-GP and Optimistic-NES respectively, and name the corresponding algorithm ONES-OGP. We also extend the doubling trick to the adaptive trick, and introduce three characteristic terms naturally arise from optimism, namely , and , to replace the dependence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
