Deep Reinforcement Learning for Adaptive Exploration of Unknown   Environments

Ashley Peake; Joe McCalmon; Yixin Zhang; Daniel Myers; Sarra; Alqahtani; Paul Pauca

arXiv:2105.01606·cs.LG·May 5, 2021

Deep Reinforcement Learning for Adaptive Exploration of Unknown Environments

Ashley Peake, Joe McCalmon, Yixin Zhang, Daniel Myers, Sarra, Alqahtani, Paul Pauca

PDF

1 Repo

TL;DR

This paper introduces an adaptive deep reinforcement learning method for UAVs that efficiently balances exploration and exploitation in unknown environments, enabling more effective search for areas of interest without prior maps.

Contribution

It develops a unified DRL-based approach with environment map segmentation and extended algorithms to improve autonomous exploration and target search in unknown settings.

Findings

01

Outperforms baselines in environment coverage

02

Navigates efficiently in random environments

03

Covers more areas of interest in fewer steps

Abstract

Performing autonomous exploration is essential for unmanned aerial vehicles (UAVs) operating in unknown environments. Often, these missions start with building a map for the environment via pure exploration and subsequently using (i.e. exploiting) the generated map for downstream navigation tasks. Accomplishing these navigation tasks in two separate steps is not always possible or even disadvantageous for UAVs deployed in outdoor and dynamically changing environments. Current exploration approaches either use a priori human-generated maps or use heuristics such as frontier-based exploration. Other approaches use learning but focus only on learning policies for specific tasks by either using sample inefficient random exploration or by making impractical assumptions about full map availability. In this paper, we develop an adaptive exploration approach to trade off between exploration and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RL-WFU/Drone_field
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · A2C · Long Short-Term Memory