Online Control of Unknown Time-Varying Dynamical Systems
Edgar Minasyan, Paula Gradu, Max Simchowitz, Elad Hazan

TL;DR
This paper investigates the challenges of online control for unknown, time-varying linear systems, establishing theoretical hardness results and proposing algorithms with sublinear regret bounds, including adaptive regret, for certain policy classes.
Contribution
It demonstrates the increased difficulty of controlling time-varying systems, proves lower bounds on regret, and provides an efficient algorithm with sublinear and adaptive regret guarantees.
Findings
Sublinear regret is impossible unless system variability is low.
Offline planning for state feedback is NP-hard.
An efficient algorithm achieves sublinear adaptive regret for disturbance response policies.
Abstract
We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is \emph{qualitatively harder} than that of either unknown time-invariant or known time-varying dynamics, and complement our negative results with algorithmic upper bounds in regimes where sublinear regret is possible. More specifically, we study regret bounds with respect to common classes of policies: Disturbance Action (SLS), Disturbance Response (Youla), and linear feedback policies. While these three classes are essentially equivalent for LTI systems, we demonstrate that these equivalences break down for time-varying systems. We prove a lower bound that no algorithm can obtain sublinear regret with respect to the first two classes unless a certain measure of system variability also scales sublinearly in the horizon.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIterative Learning Control Systems · Extremum Seeking Control Systems
