Lazy OCO: Online Convex Optimization on a Switching Budget
Uri Sherman, Tomer Koren

TL;DR
This paper introduces efficient algorithms for online convex optimization with limited decision switches, achieving regret bounds that improve with fewer switches, and provides matching lower bounds for some cases.
Contribution
It fills the gap in oblivious adversary settings by providing computationally efficient algorithms with regret bounds depending on the number of switches allowed.
Findings
Regret bound of O(T/S) for general convex losses
Regret bound of ~O(T/S^2) for strongly convex losses
Algorithms with logarithmic switches and regret overhead
Abstract
We study a variant of online convex optimization where the player is permitted to switch decisions at most times in expectation throughout rounds. Similar problems have been addressed in prior work for the discrete decision set setting, and more recently in the continuous setting but only with an adaptive adversary. In this work, we aim to fill the gap and present computationally efficient algorithms in the more prevalent oblivious setting, establishing a regret bound of for general convex losses and for strongly convex losses. In addition, for stochastic i.i.d.~losses, we present a simple algorithm that performs switches with only a multiplicative factor overhead in its regret in both the general and strongly convex settings. Finally, we complement our algorithms with lower bounds that match our upper bounds in some of the cases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Stochastic Gradient Optimization Techniques
