Visual Foresight: Model-Based Deep Reinforcement Learning for   Vision-Based Robotic Control

Frederik Ebert; Chelsea Finn; Sudeep Dasari; Annie Xie; Alex Lee,; Sergey Levine

arXiv:1812.00568·cs.RO·December 4, 2018·264 cites

Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control

Frederik Ebert, Chelsea Finn, Sudeep Dasari, Annie Xie, Alex Lee,, Sergey Levine

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a self-supervised, model-based deep reinforcement learning approach for vision-based robotic control that generalizes to new objects and tasks without human supervision.

Contribution

It presents a practical, self-supervised deep RL method that predicts future sensory inputs for robotic manipulation, enabling generalization to unseen objects and tasks.

Findings

01

Successfully generalizes to unseen rigid and deformable objects

02

Solves diverse user-defined manipulation tasks

03

Operates without human supervision during training

Abstract

Deep reinforcement learning (RL) algorithms can learn complex robotic skills from raw sensory inputs, but have yet to achieve the kind of broad generalization and applicability demonstrated by deep learning methods in supervised domains. We present a deep RL method that is practical for real-world robotics tasks, such as robotic manipulation, and generalizes effectively to never-before-seen tasks and objects. In these settings, ground truth reward signals are typically unavailable, and we therefore propose a self-supervised model-based approach, where a predictive model learns to directly predict the future from raw sensory readings, such as camera images. At test time, we explore three distinct goal specification methods: designated pixels, where a user specifies desired object manipulation tasks by selecting particular pixels in an image and corresponding goal positions, goal images,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SudeepDasari/visual_foresight
none

Datasets

Nirav-Madhani/gr00t-g1-palm-pose-augmented
dataset· 121 dl
121 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Memory and Neural Computing