TL;DR
This paper introduces an adaptive deep reinforcement learning method for UAVs that efficiently balances exploration and exploitation in unknown environments, enabling more effective search for areas of interest without prior maps.
Contribution
It develops a unified DRL-based approach with environment map segmentation and extended algorithms to improve autonomous exploration and target search in unknown settings.
Findings
Outperforms baselines in environment coverage
Navigates efficiently in random environments
Covers more areas of interest in fewer steps
Abstract
Performing autonomous exploration is essential for unmanned aerial vehicles (UAVs) operating in unknown environments. Often, these missions start with building a map for the environment via pure exploration and subsequently using (i.e. exploiting) the generated map for downstream navigation tasks. Accomplishing these navigation tasks in two separate steps is not always possible or even disadvantageous for UAVs deployed in outdoor and dynamically changing environments. Current exploration approaches either use a priori human-generated maps or use heuristics such as frontier-based exploration. Other approaches use learning but focus only on learning policies for specific tasks by either using sample inefficient random exploration or by making impractical assumptions about full map availability. In this paper, we develop an adaptive exploration approach to trade off between exploration and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · A2C · Long Short-Term Memory
