Towards a Dimension-Free Understanding of Adaptive Linear Control

Juan C. Perdomo; Max Simchowitz; Alekh Agarwal; Peter Bartlett

arXiv:2103.10620·math.OC·July 16, 2021

Towards a Dimension-Free Understanding of Adaptive Linear Control

Juan C. Perdomo, Max Simchowitz, Alekh Agarwal, Peter Bartlett

PDF

Open Access

TL;DR

This paper establishes a dimension-free framework for adaptive linear control in high or infinite dimensions, providing regret bounds that depend on problem complexity rather than ambient dimension.

Contribution

It introduces the first regret bounds for infinite-dimensional LQR that replace ambient dimension dependence with natural complexity measures.

Findings

01

Regret bounds applicable to infinite-dimensional systems.

02

Dependence on problem complexity instead of ambient dimension.

03

Bounds recover near optimal dependence in finite-dimensional cases.

Abstract

We study the problem of adaptive control of the linear quadratic regulator for systems in very high, or even infinite dimension. We demonstrate that while sublinear regret requires finite dimensional inputs, the ambient state dimension of the system need not be bounded in order to perform online control. We provide the first regret bounds for LQR which hold for infinite dimensional systems, replacing dependence on ambient dimension with more natural notions of problem complexity. Our guarantees arise from a novel perturbation bound for certainty equivalence which scales with the prediction error in estimating the system parameters, without requiring consistent parameter recovery in more stringent measures like the operator norm. When specialized to finite dimensional settings, our bounds recover near optimal dimension and time horizon dependence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Model Reduction and Neural Networks · Adaptive Dynamic Programming Control