Predicting Sim-to-Real Transfer with Probabilistic Dynamics Models

Lei M. Zhang; Matthias Plappert; Wojciech Zaremba

arXiv:2009.12864·cs.LG·September 29, 2020

Predicting Sim-to-Real Transfer with Probabilistic Dynamics Models

Lei M. Zhang, Matthias Plappert, Wojciech Zaremba

PDF

Open Access

TL;DR

This paper introduces a probabilistic dynamics model-based transfer metric that predicts the real-world performance of RL policies from simulation, reducing the need for costly real-world testing.

Contribution

It presents a novel transfer metric that correlates with real-world policy success and predicts the impact of training choices without extensive real-world trials.

Findings

01

Transfer metric correlates with real-world performance

02

Predicts effects of training setup variations

03

Reduces need for real-world rollouts

Abstract

We propose a method to predict the sim-to-real transfer performance of RL policies. Our transfer metric simplifies the selection of training setups (such as algorithm, hyperparameters, randomizations) and policies in simulation, without the need for extensive and time-consuming real-world rollouts. A probabilistic dynamics model is trained alongside the policy and evaluated on a fixed set of real-world trajectories to obtain the transfer metric. Experiments show that the transfer metric is highly correlated with policy performance in both simulated and real-world robotic environments for complex manipulation tasks. We further show that the transfer metric can predict the effect of training setups on policy transfer performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms