How to pick the domain randomization parameters for sim-to-real transfer   of reinforcement learning policies?

Quan Vuong; Sharad Vikram; Hao Su; Sicun Gao; Henrik I. Christensen

arXiv:1903.11774·cs.LG·March 29, 2019·22 cites

How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Quan Vuong, Sharad Vikram, Hao Su, Sicun Gao, Henrik I. Christensen

PDF

Open Access 1 Repo

TL;DR

This paper investigates how to optimally select domain randomization parameters in simulation to improve the transfer of reinforcement learning policies to real-world robotics, highlighting the importance of distribution choice.

Contribution

It demonstrates that the distribution parameters significantly impact real-world performance and proposes methods to optimize these parameters for better sim-to-real transfer.

Findings

01

Distribution choice greatly affects real-world policy performance.

02

Optimizing distribution parameters improves transfer success.

03

Hand-tuned distributions are less effective than optimized ones.

Abstract

Recently, reinforcement learning (RL) algorithms have demonstrated remarkable success in learning complicated behaviors from minimally processed input. However, most of this success is limited to simulation. While there are promising successes in applying RL algorithms directly on real systems, their performance on more complex systems remains bottle-necked by the relative data inefficiency of RL algorithms. Domain randomization is a promising direction of research that has demonstrated impressive results using RL algorithms to control real robots. At a high level, domain randomization works by training a policy on a distribution of environmental conditions in simulation. If the environments are diverse enough, then the policy trained on this distribution will plausibly generalize to the real world. A human-specified design choice in domain randomization is the form and parameters of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

quanvuong/domain_randomization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Advanced Multi-Objective Optimization Algorithms