A Vision Based Deep Reinforcement Learning Algorithm for UAV Obstacle   Avoidance

Jeremy Roghair; Kyungtae Ko; Amir Ehsan Niaraki Asli; Ali Jannesari

arXiv:2103.06403·cs.AI·March 12, 2021

A Vision Based Deep Reinforcement Learning Algorithm for UAV Obstacle Avoidance

Jeremy Roghair, Kyungtae Ko, Amir Ehsan Niaraki Asli, Ali Jannesari

PDF

TL;DR

This paper introduces two novel exploration techniques for deep reinforcement learning to enhance UAV obstacle avoidance, demonstrating significant performance improvements in simulated environments.

Contribution

It proposes convergence-based and guidance-based exploration methods using a Domain Network, improving UAV obstacle avoidance in deep reinforcement learning.

Findings

01

Two-fold increase in average rewards over existing methods

02

Effective exploration in sparse reward environments

03

Validated in multiple 3-D simulation environments

Abstract

Integration of reinforcement learning with unmanned aerial vehicles (UAVs) to achieve autonomous flight has been an active research area in recent years. An important part focuses on obstacle detection and avoidance for UAVs navigating through an environment. Exploration in an unseen environment can be tackled with Deep Q-Network (DQN). However, value exploration with uniform sampling of actions may lead to redundant states, where often the environments inherently bear sparse rewards. To resolve this, we present two techniques for improving exploration for UAV obstacle avoidance. The first is a convergence-based approach that uses convergence error to iterate through unexplored actions and temporal threshold to balance exploration and exploitation. The second is a guidance-based approach using a Domain Network which uses a Gaussian mixture distribution to compare previously seen states…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.