Constrained Optimal Tracking Control of Unknown Systems: A Multi-Step   Linear Programming Approach

Alexandros Tanzanakis; John Lygeros

arXiv:2012.04318·eess.SY·December 9, 2020

Constrained Optimal Tracking Control of Unknown Systems: A Multi-Step Linear Programming Approach

Alexandros Tanzanakis, John Lygeros

PDF

TL;DR

This paper introduces a novel multi-step Q-learning and linear programming method for optimal tracking control of unknown systems with input constraints, improving convergence and eliminating the need for initial stabilizing policies.

Contribution

It develops a two-step transformation and a multi-step VI algorithm that handles input constraints and general stage costs without system model knowledge.

Findings

01

Enhanced convergence speed of the control algorithm

02

Elimination of initial stabilizing policy requirement

03

Successful simulation validation of the approach

Abstract

We study the problem of optimal state-feedback tracking control for unknown discrete-time deterministic systems with input constraints. To handle input constraints, state-of-art methods utilize a certain nonquadratic stage cost function, which is sometimes limiting real systems. Furthermore, it is well known that Policy Iteration (PI) and Value Iteration (VI), two widely used algorithms in data-driven control, offer complementary strengths and weaknesses. In this work, a two-step transformation is employed, which converts the constrained-input optimal tracking problem to an unconstrained augmented optimal regulation problem, and allows the consideration of general stage cost functions. Then, a novel multi-step VI algorithm based on Q-learning and linear programming is derived. The proposed algorithm improves the convergence speed of VI, avoids the requirement for an initial stabilizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.