Data-Enabled Policy and Value Iteration for Continuous-Time Linear Quadratic Output Feedback Control
Jun Xie, Yuan-Hua Ni, Yiqin Yang, Bo Xu

TL;DR
This paper introduces data-driven policy and value iteration algorithms for continuous-time LQ control with output feedback, eliminating the need for system knowledge and improving stability and efficiency.
Contribution
It develops a novel substitute state construction method using QR decomposition, enabling model-free policy iteration and value iteration for continuous-time LQ control.
Findings
Algorithms avoid system order knowledge and derivative calculations.
They demonstrate higher numerical stability and computational efficiency.
The methods work effectively in both single-output and multi-output systems.
Abstract
This paper proposes efficient policy iteration and value iteration algorithms for the continuous-time linear quadratic regulator problem with unmeasurable states and unknown system dynamics, from the perspective of direct data-driven control. Specifically, by re-examining the data characteristics of input-output filtered vectors and introducing QR decomposition, an improved substitute state construction method is presented that further eliminates redundant information, ensures a full row rank data matrix, and enables a complete parameterized representation of the feedback controller. Furthermore, the original problem is transformed into an equivalent linear quadratic regulator problem defined on the substitute state with a known input matrix, verifying the stabilizability and detectability of the transformed system. Consequently, model-free policy iteration and value iteration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Stability and Control of Uncertain Systems · Model Reduction and Neural Networks
