On the Optimization Landscape of Observer-based Dynamic Linear Quadratic Control

Jingliang Duan; Jie Li; Yinsong Ma; Liye Tang; Guofa Li; Liping Zhang; Shengbo Eben Li; Lin Zhao

arXiv:2604.10635·eess.SY·April 14, 2026

On the Optimization Landscape of Observer-based Dynamic Linear Quadratic Control

Jingliang Duan, Jie Li, Yinsong Ma, Liye Tang, Guofa Li, Liping Zhang, Shengbo Eben Li, Lin Zhao

PDF

TL;DR

This paper analyzes the optimization landscape of observer-based dynamic output-feedback control in LQR problems, revealing conditions for stationarity and deriving equations that characterize optimality.

Contribution

It provides a detailed analysis of the stationary points in observer-based LQR control, including new Sylvester equations that characterize optimality conditions.

Findings

01

Standard LQR controller and observer do not always form a stationary point.

02

Derived Sylvester equations characterize the stationary points.

03

Analysis offers insights for developing policy gradient methods.

Abstract

Understanding the optimization landscape of linear quadratic regulation (LQR) problems is fundamental to the design of efficient reinforcement learning solutions. Recent work has made significant progress in characterizing the landscape of static output-feedback control and linear quadratic Gaussian (LQG) control. For LQG, much of the analysis leverages the separation principle, which allows the controller and estimator to be designed independently. However, this simplification breaks down when the gradients with respect to the estimator and controller parameters are inherently coupled, leading to a more intricate analysis. This paper investigates the optimization landscape of observer-based dynamic output-feedback control of LQR problems. We derive the optimal observer-controller pair in settings where transient quadratic performance cannot be neglected. Our analysis reveals that, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.