Online Convex Optimization with Memory and Limited Predictions
Zhengmiao Wang, Zhi-Wei Liu, Ming Chi, Xiaoling Wang, Housheng Su, Lintao Ye

TL;DR
This paper introduces a new online convex optimization algorithm that leverages limited future predictions and memory to achieve exponentially decaying regret, supported by strong theoretical guarantees and numerical validation.
Contribution
It presents a novel predictive algorithm for online convex optimization with memory and limited predictions, providing exponential regret decay and new subroutines with optimal convergence rates.
Findings
Dynamic regret decays exponentially with prediction window length.
Achieves $ ext{sqrt}(TV_T)$-dynamic regret with bandit feedback.
Attains linear convergence rate for convex optimization.
Abstract
This paper addresses an online convex optimization problem where the cost function at each step depends on a history of past decisions (i.e., memory), and the decision maker has access to limited predictions of future cost values within a finite window. The goal is to design an algorithm that minimizes the dynamic regret against the optimal sequence of decisions in hindsight. To this end, we propose a novel predictive algorithm and establish strong theoretical guarantees for its performance. We show that the algorithm's dynamic regret decays exponentially with the length of the prediction window. Our algorithm comprises two general subroutines of independent interest. The first subroutine solves online convex optimization with memory and bandit feedback, achieving a -dynamic regret, where measures the variation of the optimal decision sequence. The second is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Advanced Wireless Network Optimization
