Simulator Predictive Control: Using Learned Task Representations and MPC   for Zero-Shot Generalization and Sequencing

Zhanpeng He; Ryan Julian; Eric Heiden; Hejia Zhang; Stefan Schaal,; Joseph J. Lim; Gaurav Sukhatme; Karol Hausman

arXiv:1810.02422·cs.RO·January 29, 2021

Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing

Zhanpeng He, Ryan Julian, Eric Heiden, Hejia Zhang, Stefan Schaal,, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method combining learned task representations with model-predictive control (MPC) to enable zero-shot generalization and sequencing of robotic skills, facilitating efficient real-world skill learning.

Contribution

It presents a novel approach that reuses simulation for foresight, learning a continuous skill embedding and a multi-skill policy conditioned on this embedding, transferable directly to real robots.

Findings

01

Successfully transferred multi-skill policy to a real Sawyer robot

02

Achieved zero-shot generalization to unseen tasks using MPC

03

Demonstrated effective motion tasks like drawing and block pushing

Abstract

Simulation-to-real transfer is an important strategy for making reinforcement learning practical with real robots. Successful sim-to-real transfer systems have difficulty producing policies which generalize across tasks, despite training for thousands of hours equivalent real robot time. To address this shortcoming, we present a novel approach to efficiently learning new robotic skills directly on a real robot, based on model-predictive control (MPC) and an algorithm for learning task representations. In short, we show how to reuse the simulation from the pre-training step of sim-to-real methods as a tool for foresight, allowing the sim-to-real policy adapt to unseen tasks. Rather than end-to-end learning policies for single tasks and attempting to transfer them, we first use simulation to simultaneously learn (1) a continuous parameterization (i.e. a skill embedding or latent) of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ryanjulian/embed2learn
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Control Systems Optimization