A Critical Investigation of Deep Reinforcement Learning for Navigation
Vikas Dhiman, Shurjo Banerjee, Brent Griffin, Jeffrey M Siskind, Jason, J Corso

TL;DR
This paper investigates whether deep reinforcement learning algorithms can inherently explore, gather, and exploit map information during navigation, revealing limitations in transferability and optimality across different environments.
Contribution
The study introduces a systematic experimental framework to evaluate DRL's ability to handle map information in navigation tasks, highlighting current limitations and providing open-source tools for future research.
Findings
DRL algorithms successfully gather and exploit map information on trained maps.
They fail to transfer this ability to unseen maps.
Exploitation is suboptimal when goal locations are randomized.
Abstract
The navigation problem is classically approached in two steps: an exploration step, where map-information about the environment is gathered; and an exploitation step, where this information is used to navigate efficiently. Deep reinforcement learning (DRL) algorithms, alternatively, approach the problem of navigation in an end-to-end fashion. Inspired by the classical approach, we ask whether DRL algorithms are able to inherently explore, gather and exploit map-information over the course of navigation. We build upon Mirowski et al. [2017] work and introduce a systematic suite of experiments that vary three parameters: the agent's starting location, the agent's target location, and the maze structure. We choose evaluation metrics that explicitly measure the algorithm's ability to gather and exploit map-information. Our experiments show that when trained and tested on the same maps, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Optimization and Search Problems
