Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees
Ramin Esmzad, Hamidreza Modares

TL;DR
This paper introduces a one-shot data-driven LQR control method with robustness guarantees for stochastic systems, avoiding iterative data-hungry algorithms and ensuring stability and performance despite noise.
Contribution
It presents a novel adaptive dynamic programming approach that directly learns control gains and value functions from data while guaranteeing robustness and stability.
Findings
Outperforms existing methods in robustness and performance.
Ensures stability without regularization or tuning.
Validated on active car suspension system.
Abstract
This paper presents a one-shot learning approach with performance and robustness guarantees for the linear quadratic regulator (LQR) control of stochastic linear systems. Even though data-based LQR control has been widely considered, existing results suffer either from data hungriness due to the inherently iterative nature of the optimization formulation (e.g., value learning or policy gradient reinforcement learning algorithms) or from a lack of robustness guarantees in one-shot non-iterative algorithms. To avoid data hungriness while ensuing robustness guarantees, an adaptive dynamic programming formalization of the LQR is presented that relies on solving a Bellman inequality. The control gain and the value function are directly learned by using a control-oriented approach that characterizes the closed-loop system using data and a decision variable from which the control is obtained.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Fault Detection and Control Systems
