Online Approximate Optimal Station Keeping of an Autonomous Underwater   Vehicle

Patrick Walters; Warren E. Dixon

arXiv:1310.0063·cs.SY·April 2, 2014·1 cites

Online Approximate Optimal Station Keeping of an Autonomous Underwater Vehicle

Patrick Walters, Warren E. Dixon

PDF

Open Access

TL;DR

This paper presents an online reinforcement learning approach for optimal station keeping of an autonomous underwater vehicle, ensuring bounded convergence without persistent excitation.

Contribution

It introduces a novel actor-critic framework that approximates the solution to a zero-sum game for vehicle control, guaranteeing convergence.

Findings

01

Guarantees UUB convergence of states and policies

02

Does not require persistence of excitation

03

Effective in real-time control scenarios

Abstract

Online approximation of an optimal station keeping strategy for a fully actuated six degrees-of-freedom autonomous underwater vehicle is considered. The developed controller is an approximation of the solution to a two player zero-sum game where the controller is the minimizing player and an external disturbance is the maximizing player. The solution is approximated using a reinforcement learning-based actor-critic framework. The result guarantees uniformly ultimately bounded (UUB) convergence of the states and UUB convergence of the approximated policies to the optimal polices without the requirement of persistence of excitation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Adaptive Control of Nonlinear Systems