Adaptive Optimal Control for Reference Tracking Independent of   Exo-System Dynamics

Florian K\"opf; Johannes Westermann; Michael Flad; S\"oren Hohmann

arXiv:1906.05085·eess.SY·December 2, 2019

Adaptive Optimal Control for Reference Tracking Independent of Exo-System Dynamics

Florian K\"opf, Johannes Westermann, Michael Flad, S\"oren Hohmann

PDF

TL;DR

This paper introduces an adaptive optimal control method that enables reinforcement learning-based tracking of arbitrary reference trajectories, overcoming limitations of existing methods tied to fixed exo-system dynamics.

Contribution

The paper presents a novel Q-function structure that incorporates reference trajectories directly, allowing for adaptive optimal control independent of exo-system dynamics.

Findings

01

Effective tracking of non-exo-system generated trajectories demonstrated

02

Convergent reinforcement learning algorithm developed for optimal control

03

Real-world examples validate the method's applicability

Abstract

Model-free control based on the idea of Reinforcement Learning is a promising approach that has recently gained extensive attention. However, Reinforcement-Learning-based control methods solely focus on the regulation problem or learn to track a reference that is generated by a time-invariant exo-system. In the latter case, controllers are only able to track the time-invariant reference dynamics which they have been trained on and need to be re-trained each time the reference dynamics change. Consequently, these methods fail in a number of applications which obviously rely on a trajectory not being generated by an exo-system. One prominent example is autonomous driving. This paper provides for the first time an adaptive optimal control method capable to track reference trajectories not being generated by a time-invariant exo-system. The main innovation is a novel Q-function that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.