Model-Reference Reinforcement Learning for Collision-Free Tracking   Control of Autonomous Surface Vehicles

Qingrui Zhang; Wei Pan; Vasso Reppa

arXiv:2008.07240·eess.SY·August 18, 2020·6 cites

Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles

Qingrui Zhang, Wei Pan, Vasso Reppa

PDF

Open Access

TL;DR

This paper introduces a model-reference reinforcement learning algorithm that improves collision-free tracking control of uncertain autonomous surface vehicles by combining traditional control with reinforcement learning, ensuring stability and efficiency.

Contribution

A novel reinforcement learning-based control method that guarantees stability and enhances collision avoidance in autonomous surface vehicles with uncertainties.

Findings

01

Achieves stable collision-free tracking in uncertain environments

02

Improves sample efficiency over traditional deep RL methods

03

Demonstrates effectiveness on autonomous surface vehicle simulations

Abstract

This paper presents a novel model-reference reinforcement learning algorithm for the intelligent tracking control of uncertain autonomous surface vehicles with collision avoidance. The proposed control algorithm combines a conventional control method with reinforcement learning to enhance control accuracy and intelligence. In the proposed control design, a nominal system is considered for the design of a baseline tracking controller using a conventional control approach. The nominal system also defines the desired behaviour of uncertain autonomous surface vehicles in an obstacle-free environment. Thanks to reinforcement learning, the overall tracking controller is capable of compensating for model uncertainties and achieving collision avoidance at the same time in environments with obstacles. In comparison to traditional deep reinforcement learning methods, our proposed learning-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Adaptive Control of Nonlinear Systems