Rate-Optimal Online Convex Optimization in Adaptive Linear Control

Asaf Cassel (1); Alon Cohen (2; 3); Tomer Koren (1; 3) ((1); School of Computer Science; Tel Aviv University; (2) School of Electrical; Engineering; Tel Aviv University; (3) Google Research; Tel Aviv)

arXiv:2206.01426·cs.LG·June 6, 2022

Rate-Optimal Online Convex Optimization in Adaptive Linear Control

Asaf Cassel (1), Alon Cohen (2, 3), Tomer Koren (1, 3) ((1), School of Computer Science, Tel Aviv University, (2) School of Electrical, Engineering, Tel Aviv University, (3) Google Research, Tel Aviv)

PDF

Open Access 1 Video

TL;DR

This paper introduces a computationally-efficient online control algorithm for unknown linear systems that achieves optimal regret rates in adversarial environments without requiring strong cost convexity assumptions.

Contribution

It presents the first efficient algorithm attaining optimal -regret in adaptive linear control without strong convexity assumptions on costs.

Findings

01

Achieves -regret in adversarial linear control.

02

Develops non-convex confidence bounds for online costs.

03

Introduces a novel regret minimization technique leveraging non-convex structure.

Abstract

We consider the problem of controlling an unknown linear dynamical system under adversarially changing convex costs and full feedback of both the state and cost function. We present the first computationally-efficient algorithm that attains an optimal $T$ -regret rate compared to the best stabilizing linear controller in hindsight, while avoiding stringent assumptions on the costs such as strong convexity. Our approach is based on a careful design of non-convex lower confidence bounds for the online costs, and uses a novel technique for computationally-efficient regret minimization of these bounds that leverages their particular non-convex structure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Rate-Optimal Online Convex Optimization in Adaptive Linear Control· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Cognitive Radio Networks and Spectrum Sensing · Adaptive Dynamic Programming Control