Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, Liwei Wang

TL;DR
This paper provides a theoretical framework for understanding why domain randomization effectively enables sim-to-real transfer in reinforcement learning, offering bounds on the transfer gap and emphasizing the role of memory in policies.
Contribution
It introduces a formal model of sim-to-real transfer with bounds on the transfer gap and demonstrates conditions for success without real-world data, highlighting the importance of history-dependent policies.
Findings
Bounds on the sim-to-real gap are established.
Transfer can succeed without real-world training samples.
Memory in policies enhances transfer success.
Abstract
Reinforcement learning encounters many challenges when applied directly in the real world. Sim-to-real transfer is widely used to transfer the knowledge learned from simulation to the real world. Domain randomization -- one of the most popular algorithms for sim-to-real transfer -- has been demonstrated to be effective in various tasks in robotics and autonomous driving. Despite its empirical successes, theoretical understanding on why this simple algorithm works is limited. In this paper, we propose a theoretical framework for sim-to-real transfers, in which the simulator is modeled as a set of MDPs with tunable parameters (corresponding to unknown physical parameters such as friction). We provide sharp bounds on the sim-to-real gap -- the difference between the value of policy returned by domain randomization and the value of an optimal policy for the real world. We prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning
