Model-based reinforcement learning for infinite-horizon approximate   optimal tracking

Rushikesh Kamalapurkar; Lindsey Andrews; Patrick Walters; Warren E.; Dixon

arXiv:1506.00685·cs.SY·July 25, 2017

Model-based reinforcement learning for infinite-horizon approximate optimal tracking

Rushikesh Kamalapurkar, Lindsey Andrews, Patrick Walters, Warren E., Dixon

PDF

TL;DR

This paper introduces a model-based reinforcement learning approach for infinite-horizon optimal tracking in nonlinear systems, using concurrent learning to relax excitation conditions and Lyapunov analysis to ensure stability and convergence.

Contribution

It presents a novel online adaptive method combining reinforcement learning with concurrent learning for nonlinear control systems, enabling approximate optimal tracking without persistent excitation.

Findings

01

Effective tracking of desired trajectories demonstrated in simulations

02

Convergence to a neighborhood of the optimal policy established

03

Relaxed excitation conditions compared to traditional methods

Abstract

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for control-affine continuous-time nonlinear systems with unknown drift dynamics. Model-based reinforcement learning is used to relax the persistence of excitation condition. Model-based reinforcement learning is implemented using a concurrent learning-based system identifier to simulate experience by evaluating the Bellman error over unexplored areas of the state space. Tracking of the desired trajectory and convergence of the developed policy to a neighborhood of the optimal policy are established via Lyapunov-based stability analysis. Simulation results demonstrate the effectiveness of the developed technique.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.