Lazy OCO: Online Convex Optimization on a Switching Budget

Uri Sherman; Tomer Koren

arXiv:2102.03803·cs.LG·September 19, 2023·1 cites

Lazy OCO: Online Convex Optimization on a Switching Budget

Uri Sherman, Tomer Koren

PDF

Open Access

TL;DR

This paper introduces efficient algorithms for online convex optimization with limited decision switches, achieving regret bounds that improve with fewer switches, and provides matching lower bounds for some cases.

Contribution

It fills the gap in oblivious adversary settings by providing computationally efficient algorithms with regret bounds depending on the number of switches allowed.

Findings

01

Regret bound of O(T/S) for general convex losses

02

Regret bound of ~O(T/S^2) for strongly convex losses

03

Algorithms with logarithmic switches and regret overhead

Abstract

We study a variant of online convex optimization where the player is permitted to switch decisions at most $S$ times in expectation throughout $T$ rounds. Similar problems have been addressed in prior work for the discrete decision set setting, and more recently in the continuous setting but only with an adaptive adversary. In this work, we aim to fill the gap and present computationally efficient algorithms in the more prevalent oblivious setting, establishing a regret bound of $O (T / S)$ for general convex losses and $O (T / S^{2})$ for strongly convex losses. In addition, for stochastic i.i.d.~losses, we present a simple algorithm that performs $lo g T$ switches with only a multiplicative $lo g T$ factor overhead in its regret in both the general and strongly convex settings. Finally, we complement our algorithms with lower bounds that match our upper bounds in some of the cases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Stochastic Gradient Optimization Techniques