Deep Reinforcement Learning with Successor Features for Navigation   across Similar Environments

Jingwei Zhang; Jost Tobias Springenberg; Joschka Boedecker; Wolfram; Burgard

arXiv:1612.05533·cs.RO·July 25, 2017·33 cites

Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments

Jingwei Zhang, Jost Tobias Springenberg, Joschka Boedecker, Wolfram, Burgard

PDF

Open Access

TL;DR

This paper introduces a successor feature-based deep reinforcement learning approach for robot navigation that enables quick adaptation to new environments without localization or mapping, demonstrated through simulated and real robot experiments.

Contribution

The paper presents a novel successor feature-based deep reinforcement learning algorithm that transfers knowledge across related navigation tasks, reducing learning time in changing environments.

Findings

01

Significantly reduces learning time after initial task completion.

02

Effective in both simulated and real robot navigation scenarios.

03

Outperforms classical planning-based methods in experiments.

Abstract

In this paper we consider the problem of robot navigation in simple maze-like environments where the robot has to rely on its onboard sensors to perform the navigation task. In particular, we are interested in solutions to this problem that do not require localization, mapping or planning. Additionally, we require that our solution can quickly adapt to new situations (e.g., changing navigation goals and environments). To meet these criteria we frame this problem as a sequence of related reinforcement learning tasks. We propose a successor feature based deep reinforcement learning algorithm that can learn to transfer knowledge from previously mastered navigation tasks to new problem instances. Our algorithm substantially decreases the required learning time after the first task instance has been solved, which makes it easily adaptable to changing environments. We validate our method in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Optimization and Search Problems