Sample Complexity of Data-Driven Stochastic LQR with Multiplicative   Uncertainty

Peter Coppens; Panagiotis Patrinos

arXiv:2005.12167·eess.SY·March 5, 2021

Sample Complexity of Data-Driven Stochastic LQR with Multiplicative Uncertainty

Peter Coppens, Panagiotis Patrinos

PDF

TL;DR

This paper analyzes how the sample size affects the performance of data-driven stochastic LQR controllers with multiplicative noise, providing bounds on suboptimality that decrease as more data is collected.

Contribution

It establishes theoretical bounds on the suboptimality of covariance estimation-based stochastic LQR, extending to unknown means and distributionally robust settings.

Findings

01

Suboptimality decreases proportionally to 1/N with more samples

02

Methodology generalizes to unknown mean and robust cases

03

Provides bounds based on matrix perturbation analysis

Abstract

This paper studies the sample complexity of the stochastic Linear Quadratic Regulator when applied to systems with multiplicative noise. We assume that the covariance of the noise is unknown and estimate it using the sample covariance, which results in suboptimal behaviour. The main contribution of this paper is then to bound the suboptimality of the methodology and prove that it decreases with 1/N, where N denotes the amount of samples. The methodology easily generalizes to the case where the mean is unknown and to the distributionally robust case studied in a previous work of the authors. The analysis is mostly based on results from matrix function perturbation analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.