Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles
Qingrui Zhang, Wei Pan, Vasso Reppa

TL;DR
This paper introduces a model-reference reinforcement learning algorithm that improves collision-free tracking control of uncertain autonomous surface vehicles by combining traditional control with reinforcement learning, ensuring stability and efficiency.
Contribution
A novel reinforcement learning-based control method that guarantees stability and enhances collision avoidance in autonomous surface vehicles with uncertainties.
Findings
Achieves stable collision-free tracking in uncertain environments
Improves sample efficiency over traditional deep RL methods
Demonstrates effectiveness on autonomous surface vehicle simulations
Abstract
This paper presents a novel model-reference reinforcement learning algorithm for the intelligent tracking control of uncertain autonomous surface vehicles with collision avoidance. The proposed control algorithm combines a conventional control method with reinforcement learning to enhance control accuracy and intelligence. In the proposed control design, a nominal system is considered for the design of a baseline tracking controller using a conventional control approach. The nominal system also defines the desired behaviour of uncertain autonomous surface vehicles in an obstacle-free environment. Thanks to reinforcement learning, the overall tracking controller is capable of compensating for model uncertainties and achieving collision avoidance at the same time in environments with obstacles. In comparison to traditional deep reinforcement learning methods, our proposed learning-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Adaptive Control of Nonlinear Systems
