Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing
Zhanpeng He, Ryan Julian, Eric Heiden, Hejia Zhang, Stefan Schaal,, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

TL;DR
This paper introduces a method combining learned task representations with model-predictive control (MPC) to enable zero-shot generalization and sequencing of robotic skills, facilitating efficient real-world skill learning.
Contribution
It presents a novel approach that reuses simulation for foresight, learning a continuous skill embedding and a multi-skill policy conditioned on this embedding, transferable directly to real robots.
Findings
Successfully transferred multi-skill policy to a real Sawyer robot
Achieved zero-shot generalization to unseen tasks using MPC
Demonstrated effective motion tasks like drawing and block pushing
Abstract
Simulation-to-real transfer is an important strategy for making reinforcement learning practical with real robots. Successful sim-to-real transfer systems have difficulty producing policies which generalize across tasks, despite training for thousands of hours equivalent real robot time. To address this shortcoming, we present a novel approach to efficiently learning new robotic skills directly on a real robot, based on model-predictive control (MPC) and an algorithm for learning task representations. In short, we show how to reuse the simulation from the pre-training step of sim-to-real methods as a tool for foresight, allowing the sim-to-real policy adapt to unseen tasks. Rather than end-to-end learning policies for single tasks and attempting to transfer them, we first use simulation to simultaneously learn (1) a continuous parameterization (i.e. a skill embedding or latent) of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Control Systems Optimization
