Structured Output Feedback Control for Linear Quadratic Regulator Using   Policy Gradient Method

Shokichi Takakura; Kazuhiro Sato

arXiv:2203.00289·math.OC·March 21, 2023·IEEE Trans. Autom. Control.

Structured Output Feedback Control for Linear Quadratic Regulator Using Policy Gradient Method

Shokichi Takakura, Kazuhiro Sato

PDF

Open Access

TL;DR

This paper develops a model-free policy gradient method with variance reduction for structured output feedback control in LQR problems, demonstrating global convergence and efficiency in numerical experiments.

Contribution

It introduces a novel policy gradient algorithm with variance reduction for structured output feedback LQR, ensuring global convergence in a model-free setting.

Findings

01

The algorithm converges globally to ε-stationary points.

02

Variance reduction significantly improves gradient estimation.

03

Numerical results show efficient problem-solving in practice.

Abstract

We consider the static output feedback control for Linear Quadratic Regulator problems with structured constraints under the assumption that system parameters are unknown. To solve the problem in the model free setting, we propose the policy gradient algorithm based on the gradient projection method and show its global convergence to $ε$ -stationary points. In addition, we introduce a variance reduction technique and show both theoretically and numerically that it significantly reduces the variance in the gradient estimation. We also show in the numerical experiments that the model free approach efficiently solves the problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Stability and Control of Uncertain Systems · Adaptive Dynamic Programming Control