Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite   Horizon

Vasanth Reddy; Hoda Eldardiry; Almuatazbellah Boker

arXiv:2306.05482·math.OC·June 12, 2023·1 cites

Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite Horizon

Vasanth Reddy, Hoda Eldardiry, Almuatazbellah Boker

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach for near-optimal control of nonlinear systems over finite horizons, decomposing the problem into infinite-horizon sub-problems to avoid complex equations.

Contribution

It presents a novel decomposition method using singular perturbation theory combined with policy iteration for finite-horizon nonlinear control.

Findings

01

Performance approaches model-based optimal as horizon increases

02

Decomposition simplifies solving time-varying HJB equations

03

Simulation results validate the effectiveness of the approach

Abstract

We examine the problem of two-point boundary optimal control of nonlinear systems over finite-horizon time periods with unknown model dynamics by employing reinforcement learning. We use techniques from singular perturbation theory to decompose the control problem over the finite horizon into two sub-problems, each solved over an infinite horizon. In the process, we avoid the need to solve the time-varying Hamilton-Jacobi-Bellman equation. Using a policy iteration method, which is made feasible as a result of this decomposition, it is now possible to learn the controller gains of both sub-problems. The overall control is then formed by piecing together the solutions to the two sub-problems. We show that the performance of the proposed closed-loop system approaches that of the model-based optimal performance as the time horizon gets long. Finally, we provide three simulation scenarios to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Advanced Control Systems Optimization · Mechanical Circulatory Support Devices