On the Optimization Landscape of Observer-based Dynamic Linear Quadratic Control
Jingliang Duan, Jie Li, Yinsong Ma, Liye Tang, Guofa Li, Liping Zhang, Shengbo Eben Li, Lin Zhao

TL;DR
This paper analyzes the optimization landscape of observer-based dynamic output-feedback control in LQR problems, revealing conditions for stationarity and deriving equations that characterize optimality.
Contribution
It provides a detailed analysis of the stationary points in observer-based LQR control, including new Sylvester equations that characterize optimality conditions.
Findings
Standard LQR controller and observer do not always form a stationary point.
Derived Sylvester equations characterize the stationary points.
Analysis offers insights for developing policy gradient methods.
Abstract
Understanding the optimization landscape of linear quadratic regulation (LQR) problems is fundamental to the design of efficient reinforcement learning solutions. Recent work has made significant progress in characterizing the landscape of static output-feedback control and linear quadratic Gaussian (LQG) control. For LQG, much of the analysis leverages the separation principle, which allows the controller and estimator to be designed independently. However, this simplification breaks down when the gradients with respect to the estimator and controller parameters are inherently coupled, leading to a more intricate analysis. This paper investigates the optimization landscape of observer-based dynamic output-feedback control of LQR problems. We derive the optimal observer-controller pair in settings where transient quadratic performance cannot be neglected. Our analysis reveals that, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
