Online Control with Adversarial Disturbances

Naman Agarwal; Brian Bullins; Elad Hazan; Sham M. Kakade; Karan Singh

arXiv:1902.08721·cs.LG·February 26, 2019·61 cites

Online Control with Adversarial Disturbances

Naman Agarwal, Brian Bullins, Elad Hazan, Sham M. Kakade, Karan Singh

PDF

Open Access

TL;DR

This paper introduces an efficient online control algorithm for linear systems with adversarial disturbances, achieving near-optimal regret bounds and extending previous models to include adversarial noise and convex costs.

Contribution

It presents a novel algorithm that handles adversarial disturbances in linear control with general convex costs, providing tight regret guarantees.

Findings

01

Achieves nearly tight regret bounds for adversarial disturbances.

02

Extends control models to include general convex costs.

03

Provides an efficient algorithm for online control under adversarial noise.

Abstract

We study the control of a linear dynamical system with adversarial disturbances (as opposed to statistical noise). The objective we consider is one of regret: we desire an online control procedure that can do nearly as well as that of a procedure that has full knowledge of the disturbances in hindsight. Our main result is an efficient algorithm that provides nearly tight regret bounds for this problem. From a technical standpoint, this work generalizes upon previous work in two main aspects: our model allows for adversarial noise in the dynamics, and allows for general convex costs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Control Systems Optimization · Model Reduction and Neural Networks