Structured Output Feedback Control for Linear Quadratic Regulator Using Policy Gradient Method
Shokichi Takakura, Kazuhiro Sato

TL;DR
This paper develops a model-free policy gradient method with variance reduction for structured output feedback control in LQR problems, demonstrating global convergence and efficiency in numerical experiments.
Contribution
It introduces a novel policy gradient algorithm with variance reduction for structured output feedback LQR, ensuring global convergence in a model-free setting.
Findings
The algorithm converges globally to ε-stationary points.
Variance reduction significantly improves gradient estimation.
Numerical results show efficient problem-solving in practice.
Abstract
We consider the static output feedback control for Linear Quadratic Regulator problems with structured constraints under the assumption that system parameters are unknown. To solve the problem in the model free setting, we propose the policy gradient algorithm based on the gradient projection method and show its global convergence to -stationary points. In addition, we introduce a variance reduction technique and show both theoretically and numerically that it significantly reduces the variance in the gradient estimation. We also show in the numerical experiments that the model free approach efficiently solves the problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Stability and Control of Uncertain Systems · Adaptive Dynamic Programming Control
