Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Asaf Cassel (1), Alon Cohen (2, 3), Tomer Koren (1, 3) ((1), School of Computer Science, Tel Aviv University, (2) School of Electrical, Engineering, Tel Aviv University, (3) Google Research, Tel Aviv)

TL;DR
This paper introduces a computationally-efficient online control algorithm for unknown linear systems that achieves optimal regret rates in adversarial environments without requiring strong cost convexity assumptions.
Contribution
It presents the first efficient algorithm attaining optimal -regret in adaptive linear control without strong convexity assumptions on costs.
Findings
Achieves -regret in adversarial linear control.
Develops non-convex confidence bounds for online costs.
Introduces a novel regret minimization technique leveraging non-convex structure.
Abstract
We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains an optimal -regret rate compared to the best stabilizing linear controller in hindsight, while avoiding stringent assumptions on the costs such as strong convexity. Our approach is based on a careful design of non-convex lower confidence bounds for the online costs, and uses a novel technique for computationally-efficient regret minimization of these bounds that leverages their particular non-convex structure.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Cognitive Radio Networks and Spectrum Sensing · Adaptive Dynamic Programming Control
