Online Control with Adversarial Disturbances
Naman Agarwal, Brian Bullins, Elad Hazan, Sham M. Kakade, Karan Singh

TL;DR
This paper introduces an efficient online control algorithm for linear systems with adversarial disturbances, achieving near-optimal regret bounds and extending previous models to include adversarial noise and convex costs.
Contribution
It presents a novel algorithm that handles adversarial disturbances in linear control with general convex costs, providing tight regret guarantees.
Findings
Achieves nearly tight regret bounds for adversarial disturbances.
Extends control models to include general convex costs.
Provides an efficient algorithm for online control under adversarial noise.
Abstract
We study the control of a linear dynamical system with adversarial disturbances (as opposed to statistical noise). The objective we consider is one of regret: we desire an online control procedure that can do nearly as well as that of a procedure that has full knowledge of the disturbances in hindsight. Our main result is an efficient algorithm that provides nearly tight regret bounds for this problem. From a technical standpoint, this work generalizes upon previous work in two main aspects: our model allows for adversarial noise in the dynamics, and allows for general convex costs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Control Systems Optimization · Model Reduction and Neural Networks
