Understanding Domain Randomization for Sim-to-real Transfer

Xiaoyu Chen; Jiachen Hu; Chi Jin; Lihong Li; Liwei Wang

arXiv:2110.03239·cs.LG·March 15, 2022·33 cites

Understanding Domain Randomization for Sim-to-real Transfer

Xiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, Liwei Wang

PDF

Open Access 1 Video

TL;DR

This paper provides a theoretical framework for understanding why domain randomization effectively enables sim-to-real transfer in reinforcement learning, offering bounds on the transfer gap and emphasizing the role of memory in policies.

Contribution

It introduces a formal model of sim-to-real transfer with bounds on the transfer gap and demonstrates conditions for success without real-world data, highlighting the importance of history-dependent policies.

Findings

01

Bounds on the sim-to-real gap are established.

02

Transfer can succeed without real-world training samples.

03

Memory in policies enhances transfer success.

Abstract

Reinforcement learning encounters many challenges when applied directly in the real world. Sim-to-real transfer is widely used to transfer the knowledge learned from simulation to the real world. Domain randomization -- one of the most popular algorithms for sim-to-real transfer -- has been demonstrated to be effective in various tasks in robotics and autonomous driving. Despite its empirical successes, theoretical understanding on why this simple algorithm works is limited. In this paper, we propose a theoretical framework for sim-to-real transfers, in which the simulator is modeled as a set of MDPs with tunable parameters (corresponding to unknown physical parameters such as friction). We provide sharp bounds on the sim-to-real gap -- the difference between the value of policy returned by domain randomization and the value of an optimal policy for the real world. We prove that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Understanding Domain Randomization for Sim-to-real Transfer· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning