Self-supervised Deep Reinforcement Learning with Generalized Computation   Graphs for Robot Navigation

Gregory Kahn; Adam Villaflor; Bosen Ding; Pieter Abbeel; Sergey Levine

arXiv:1709.10489·cs.LG·May 21, 2018

Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

Gregory Kahn, Adam Villaflor, Bosen Ding, Pieter Abbeel, Sergey Levine

PDF

2 Repos 1 Datasets

TL;DR

This paper introduces a generalized computation graph for robot navigation that combines model-free and model-based reinforcement learning, enabling sample-efficient, self-supervised learning from raw images in both simulated and real-world environments.

Contribution

The authors propose a unified computation graph framework that interpolates between model-free and model-based RL, improving sample efficiency and enabling autonomous learning for robot navigation.

Findings

01

Outperforms single-step and N-step double Q-learning in simulation.

02

Successfully learns to navigate a complex indoor environment with an RC car.

03

Achieves autonomous, self-supervised training within a few hours.

Abstract

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

catglossop/CAST-dataset
dataset· 59 dl
59 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.