Generating Automatic Curricula via Self-Supervised Active Domain Randomization
Sharath Chandra Raparthy, Bhairav Mehta, Florian Golemo, Liam Paull

TL;DR
This paper introduces SS-ADR, a self-supervised method that jointly learns goal and environment curricula through domain randomization, significantly improving sim2real transfer in goal-directed reinforcement learning.
Contribution
It extends self-play to include environment variation, creating a coupled curriculum for goals and domain randomization, enhancing transfer robustness.
Findings
Achieves state-of-the-art results on sim2real transfer tasks.
Demonstrates the effectiveness of co-evolving environment and goal difficulty.
Builds a curriculum that adapts to the agent's current capabilities.
Abstract
Goal-directed Reinforcement Learning (RL) traditionally considers an agent interacting with an environment, prescribing a real-valued reward to an agent proportional to the completion of some goal. Goal-directed RL has seen large gains in sample efficiency, due to the ease of reusing or generating new experience by proposing goals. One approach,self-play, allows an agent to "play" against itself by alternatively setting and accomplishing goals, creating a learned curriculum through which an agent can learn to accomplish progressively more difficult goals. However, self-play has been limited to goal curriculum learning or learning progressively harder goals within a single environment. Recent work on robotic agents has shown that varying the environment during training, for example with domain randomization, leads to more robust transfer. As a result, we extend the self-play framework to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Robot Manipulation and Learning
