Regret Lower Bounds for Learning Linear Quadratic Gaussian Systems

Ingvar Ziemann; Henrik Sandberg

arXiv:2201.01680·cs.LG·June 13, 2024·6 cites

Regret Lower Bounds for Learning Linear Quadratic Gaussian Systems

Ingvar Ziemann, Henrik Sandberg

PDF

Open Access

TL;DR

This paper derives fundamental regret lower bounds for learning control in linear Gaussian systems, revealing how system properties influence the difficulty of control and learning simultaneously.

Contribution

It introduces new regret lower bounds that incorporate control-theoretic parameters, extending to partially observed systems and improving understanding of system difficulty.

Findings

01

Regret scales as (\u221a{T}) with time horizon T.

02

Hard-to-control systems are also hard to learn to control.

03

Results extend to partially observed systems with poor observability.

Abstract

TWe establish regret lower bounds for adaptively controlling an unknown linear Gaussian system with quadratic costs. We combine ideas from experiment design, estimation theory and a perturbation bound of certain information matrices to derive regret lower bounds exhibiting scaling on the order of magnitude $T$ in the time horizon $T$ . Our bounds accurately capture the role of control-theoretic parameters and we are able to show that systems that are hard to control are also hard to learn to control; when instantiated to state feedback systems we recover the dimensional dependency of earlier work but with improved scaling with system-theoretic constants such as system costs and Gramians. Furthermore, we extend our results to a class of partially observed systems and demonstrate that systems with poor observability structure also are hard to learn to control.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Machine Learning and Algorithms