Adaptive dynamic programming for nonaffine nonlinear optimal control   problem with state constraints

Jingliang Duan; Zhengyu Liu; Shengbo Eben Li; Qi Sun; Zhenzhong Jia,; and Bo Cheng

arXiv:1911.11397·eess.SY·April 11, 2022

Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints

Jingliang Duan, Zhengyu Liu, Shengbo Eben Li, Qi Sun, Zhenzhong Jia,, and Bo Cheng

PDF

TL;DR

This paper introduces a novel constrained adaptive dynamic programming algorithm for nonlinear optimal control with state constraints, utilizing neural networks and a trust region approach for effective policy updates.

Contribution

It develops a constrained generalized policy iteration framework and an actor-critic method that directly handle state constraints in nonlinear control problems.

Findings

01

Successfully applied to vehicle path-tracking control.

02

Outperforms traditional methods in handling state constraints.

03

Ensures stable policy updates with trust region constraints.

Abstract

This paper presents a constrained adaptive dynamic programming (CADP) algorithm to solve general nonlinear nonaffine optimal control problems with known dynamics. Unlike previous ADP algorithms, it can directly deal with problems with state constraints. Firstly, a constrained generalized policy iteration (CGPI) framework is developed to handle state constraints by transforming the traditional policy improvement process into a constrained policy optimization problem. Next, we propose an actor-critic variant of CGPI, called CADP, in which both policy and value functions are approximated by multi-layer neural networks to directly map the system states to control inputs and value function, respectively. CADP linearizes the constrained optimization problem locally into a quadratically constrained linear programming problem, and then obtains the optimal update of the policy network by solving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.