Data-driven policy iteration algorithm for continuous-time stochastic   linear-quadratic optimal control problems

Heng Zhang; Na Li

arXiv:2209.14490·math.OC·September 30, 2022

Data-driven policy iteration algorithm for continuous-time stochastic linear-quadratic optimal control problems

Heng Zhang, Na Li

PDF

Open Access

TL;DR

This paper introduces a data-driven policy iteration algorithm for solving infinite-horizon stochastic linear-quadratic control problems without prior knowledge of system matrices, demonstrating effectiveness through simulation.

Contribution

It proposes a novel data-driven approach to solve stochastic LQ problems by iteratively approximating the Riccati equation without system model knowledge.

Findings

01

Algorithm successfully approximates the solution to the SARE.

02

Simulation confirms the algorithm's effectiveness and applicability.

03

Method enables model-free control in stochastic LQ problems.

Abstract

This paper studies a continuous-time stochastic linear-quadratic (SLQ) optimal control problem on infinite-horizon. A data-driven policy iteration algorithm is proposed to solve the SLQ problem. Without knowing three system coefficient matrices, this algorithm uses the collected data to iteratively approximate a solution of the corresponding stochastic algebraic Riccati equation (SARE). A simulation example is provided to illustrate the effectiveness and applicability of the algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Energy Load and Power Forecasting · Stochastic processes and financial applications