Data-Enabled Policy and Value Iteration for Continuous-Time Linear Quadratic Output Feedback Control

Jun Xie; Yuan-Hua Ni; Yiqin Yang; Bo Xu

arXiv:2603.14386·eess.SY·March 17, 2026

Data-Enabled Policy and Value Iteration for Continuous-Time Linear Quadratic Output Feedback Control

Jun Xie, Yuan-Hua Ni, Yiqin Yang, Bo Xu

PDF

Open Access

TL;DR

This paper introduces data-driven policy and value iteration algorithms for continuous-time LQ control with output feedback, eliminating the need for system knowledge and improving stability and efficiency.

Contribution

It develops a novel substitute state construction method using QR decomposition, enabling model-free policy iteration and value iteration for continuous-time LQ control.

Findings

01

Algorithms avoid system order knowledge and derivative calculations.

02

They demonstrate higher numerical stability and computational efficiency.

03

The methods work effectively in both single-output and multi-output systems.

Abstract

This paper proposes efficient policy iteration and value iteration algorithms for the continuous-time linear quadratic regulator problem with unmeasurable states and unknown system dynamics, from the perspective of direct data-driven control. Specifically, by re-examining the data characteristics of input-output filtered vectors and introducing QR decomposition, an improved substitute state construction method is presented that further eliminates redundant information, ensures a full row rank data matrix, and enables a complete parameterized representation of the feedback controller. Furthermore, the original problem is transformed into an equivalent linear quadratic regulator problem defined on the substitute state with a known input matrix, verifying the stabilizability and detectability of the transformed system. Consequently, model-free policy iteration and value iteration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Stability and Control of Uncertain Systems · Model Reduction and Neural Networks