Diversity is All You Need: Learning Skills without a Reward Function

Benjamin Eysenbach; Abhishek Gupta; Julian Ibarz; Sergey Levine

arXiv:1802.06070·cs.AI·October 11, 2018·97 cites

Diversity is All You Need: Learning Skills without a Reward Function

Benjamin Eysenbach, Abhishek Gupta, Julian Ibarz, Sergey Levine

PDF

Open Access 4 Repos

TL;DR

This paper introduces DIAYN, a method for unsupervised skill learning that maximizes diversity without reward signals, enabling agents to develop useful behaviors and improve downstream task performance.

Contribution

The paper presents DIAYN, a novel unsupervised learning approach that discovers diverse skills through an information-theoretic objective, reducing reliance on reward functions in reinforcement learning.

Findings

01

Unsupervised skills include walking and jumping behaviors.

02

Pretrained skills improve data efficiency in downstream tasks.

03

Skills can be combined hierarchically to solve complex problems.

Abstract

Intelligent creatures can explore their environments and learn useful skills without supervision. In this paper, we propose DIAYN ('Diversity is All You Need'), a method for learning useful skills without a reward function. Our proposed method learns skills by maximizing an information theoretic objective using a maximum entropy policy. On a variety of simulated robotic tasks, we show that this simple objective results in the unsupervised emergence of diverse skills, such as walking and jumping. In a number of reinforcement learning benchmark environments, our method is able to learn a skill that solves the benchmark task despite never receiving the true task reward. We show how pretrained skills can provide a good parameter initialization for downstream tasks, and can be composed hierarchically to solve complex, sparse reward tasks. Our results suggest that unsupervised discovery of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Single-cell and spatial transcriptomics · Evolutionary Algorithms and Applications