A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems

Sahel Vahedi Noori; Maryam Babazadeh

arXiv:2506.23716·eess.SY·September 30, 2025

A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems

Sahel Vahedi Noori, Maryam Babazadeh

PDF

TL;DR

This paper introduces a novel, data-driven, and sample-efficient control framework for finite-horizon linear quadratic control of linear time-varying systems, leveraging duality and convex optimization to improve over existing methods.

Contribution

It develops a non-iterative SDP algorithm that computes optimal feedback gains directly from data without system identification, applicable to both LTV and LTI systems.

Findings

01

Achieves superior optimality and sample efficiency in simulations.

02

Provides global optimality guarantees for unknown LTV systems.

03

Improves over existing Q-learning methods for finite-horizon LTI control.

Abstract

This paper presents a sample-efficient, data-driven control framework for finite-horizon linear quadratic (LQ) control of linear time-varying (LTV) systems. In contrast to the time-invariant case, the time-varying LQ problem involves a differential Riccati equation (DRE) with time-dependent parameters and terminal boundary constraints. We formulate the LQ problem as a nonconvex optimization problem and conduct a rigorous analysis of its dual structure. By exploiting the inherent convexity of the dual problem and analyzing the KKT conditions, we derive an explicit relationship between the optimal dual solution and the parameters of the associated Q-function in time-varying case. This theoretical insight supports the development of a novel, sample-efficient, non-iterative semidefinite programming (SDP) algorithm that directly computes the optimal sequence of feedback gains from an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.