Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of   Unmanned Aerial Vehicles

Ricardo Bedin Grando; Junior Costa de Jesus; Victor Augusto Kich,; Alisson Henrique Kolling; Paulo Lilles Jorge Drews-Jr

arXiv:2112.13724·cs.RO·December 28, 2021

Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles

Ricardo Bedin Grando, Junior Costa de Jesus, Victor Augusto Kich,, Alisson Henrique Kolling, Paulo Lilles Jorge Drews-Jr

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep reinforcement learning system using double critic models and RNNs for UAV mapless 3D navigation, outperforming previous methods with sparse range data.

Contribution

The paper proposes a novel deep RL framework with double critic models and RNNs for UAV navigation using sparse sensor data, improving over existing approaches.

Findings

01

Double critic models outperform DDPG and BUG2 algorithms.

02

RNN-based deep RL models outperform previous navigation structures.

03

Sparse range data is sufficient for effective UAV navigation.

Abstract

This paper presents a novel deep reinforcement learning-based system for 3D mapless navigation for Unmanned Aerial Vehicles (UAVs). Instead of using a image-based sensing approach, we propose a simple learning system that uses only a few sparse range data from a distance sensor to train a learning agent. We based our approaches on two state-of-art double critic Deep-RL models: Twin Delayed Deep Deterministic Policy Gradient (TD3) and Soft Actor-Critic (SAC). We show that our two approaches manage to outperform an approach based on the Deep Deterministic Policy Gradient (DDPG) technique and the BUG2 algorithm. Also, our new Deep-RL structure based on Recurrent Neural Networks (RNNs) outperforms the current structure used to perform mapless navigation of mobile robots. Overall, we conclude that Deep-RL approaches based on double critic with Recurrent Neural Networks (RNNs) are better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ricardogrando/hydrone_deep_rl_jint
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Reinforcement Learning in Robotics