Model-free stochastic linear quadratic design by semidefinite   programming

Jing Guo; Xiushan Jiang; Weihai Zhang

arXiv:2412.17230·math.OC·December 24, 2024·J. Frankl. Inst.

Model-free stochastic linear quadratic design by semidefinite programming

Jing Guo, Xiushan Jiang, Weihai Zhang

PDF

Open Access

TL;DR

This paper introduces a novel model-free semidefinite programming approach for designing stochastic linear quadratic controllers, linking dual problem optimality with Q-functions to enhance reinforcement learning methods.

Contribution

It develops a new SDP-based algorithm for model-free SLQ control, providing theoretical insights and practical tools for reinforcement learning applications.

Findings

01

The proposed algorithm effectively derives optimal control gains.

02

The approach offers a new perspective on Q-learning and RL algorithms.

03

Simulation results validate the method's effectiveness.

Abstract

In this article, we study a model-free design approach for stochastic linear quadratic (SLQ) controllers. Based on the convexity of the SLQ dual problem and the Karush-Kuhn-Tucker (KKT) conditions, we find the relationship between the optimal point of the dual problem and the Q-function, which can be used to develop a novel model-free semidefinite programming (SDP) algorithm for deriving optimal control gain. This study provides a new optimization perspective for understanding Q-learning algorithms and lays a theoretical foundation for effective reinforcement learning (RL) algorithms. Finally, the effectiveness of the proposed model-free SDP algorithm is demonstrated by two case simulations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Optimal Experimental Design Methods · Manufacturing Process and Optimization