Generating Automatic Curricula via Self-Supervised Active Domain   Randomization

Sharath Chandra Raparthy; Bhairav Mehta; Florian Golemo; Liam Paull

arXiv:2002.07911·cs.LG·October 28, 2020·5 cites

Generating Automatic Curricula via Self-Supervised Active Domain Randomization

Sharath Chandra Raparthy, Bhairav Mehta, Florian Golemo, Liam Paull

PDF

Open Access 1 Repo

TL;DR

This paper introduces SS-ADR, a self-supervised method that jointly learns goal and environment curricula through domain randomization, significantly improving sim2real transfer in goal-directed reinforcement learning.

Contribution

It extends self-play to include environment variation, creating a coupled curriculum for goals and domain randomization, enhancing transfer robustness.

Findings

01

Achieves state-of-the-art results on sim2real transfer tasks.

02

Demonstrates the effectiveness of co-evolving environment and goal difficulty.

03

Builds a curriculum that adapts to the agent's current capabilities.

Abstract

Goal-directed Reinforcement Learning (RL) traditionally considers an agent interacting with an environment, prescribing a real-valued reward to an agent proportional to the completion of some goal. Goal-directed RL has seen large gains in sample efficiency, due to the ease of reusing or generating new experience by proposing goals. One approach,self-play, allows an agent to "play" against itself by alternatively setting and accomplishing goals, creating a learned curriculum through which an agent can learn to accomplish progressively more difficult goals. However, self-play has been limited to goal curriculum learning or learning progressively harder goals within a single environment. Recent work on robotic agents has shown that varying the environment during training, for example with domain randomization, leads to more robust transfer. As a result, we extend the self-play framework to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

montrealrobotics/unsupervised-adr
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Robot Manipulation and Learning