Online Optimization with Feedback Delay and Nonlinear Switching Cost

Weici Pan; Guanya Shi; Yiheng Lin; Adam Wierman

arXiv:2111.00095·cs.LG·November 2, 2021

Online Optimization with Feedback Delay and Nonlinear Switching Cost

Weici Pan, Guanya Shi, Yiheng Lin, Adam Wierman

PDF

Open Access

TL;DR

This paper introduces a novel online optimization algorithm, iROBD, that effectively handles delayed feedback and nonlinear multi-step switching costs, achieving a constant competitive ratio independent of problem dimension.

Contribution

The paper proposes the iROBD algorithm with a constant, dimension-free competitive ratio for online optimization with feedback delay and nonlinear switching costs, supported by tight lower bounds.

Findings

01

iROBD achieves an $O(L^{2k})$ competitive ratio.

02

Lower bounds show the necessity of the Lipschitz condition and tightness of dependencies.

03

Reductions connect the problem to online control with delay and nonlinear dynamics.

Abstract

We study a variant of online optimization in which the learner receives $k$ -round $delayed feedback$ about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio that is $O (L^{2 k})$ , where $L$ is the Lipschitz constant of the switching cost. Additionally, we provide lower bounds that illustrate the Lipschitz condition is required and the dependencies on $k$ and $L$ are tight. Finally, via reductions, we show that this setting is closely related to online control problems with delay, nonlinear dynamics, and adversarial disturbances, where iROBD directly offers constant-competitive online policies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms