Learning the Globally Optimal Distributed LQ Regulator

Luca Furieri; Yang Zheng; Maryam Kamgarpour

arXiv:1912.08774·eess.SY·July 14, 2021·30 cites

Learning the Globally Optimal Distributed LQ Regulator

Luca Furieri, Yang Zheng, Maryam Kamgarpour

PDF

Open Access

TL;DR

This paper develops a model-free learning approach with sample complexity bounds for globally optimal distributed output-feedback LQ control under subspace constraints, addressing a key challenge in distributed control.

Contribution

It introduces the first sample-complexity bounds for learning globally optimal distributed output-feedback LQ controllers, leveraging recent zeroth-order optimization results.

Findings

01

Established sample complexity bounds for distributed LQ control.

02

Proved that Quadratically Invariant problems have the necessary gradient dominance property.

03

First to provide theoretical guarantees for learning globally optimal distributed control policies.

Abstract

We study model-free learning methods for the output-feedback Linear Quadratic (LQ) control problem in finite-horizon subject to subspace constraints on the control policy. Subspace constraints naturally arise in the field of distributed control and present a significant challenge in the sense that standard model-based optimization and learning leads to intractable numerical programs in general. Building upon recent results in zeroth-order optimization, we establish model-free sample-complexity bounds for the class of distributed LQ problems where a local gradient dominance constant exists on any sublevel set of the cost function. %which admit a local gradient dominance constant valid on the sublevel set of the cost function. We prove that a fundamental class of distributed control problems - commonly referred to as Quadratically Invariant (QI) problems - as well as others possess this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Adaptive Dynamic Programming Control · Advanced Bandit Algorithms Research