ELSIM: End-to-end learning of reusable skills through intrinsic   motivation

Arthur Aubret; Laetitia Matignon; Salima Hassas

arXiv:2006.12903·cs.AI·June 24, 2020

ELSIM: End-to-end learning of reusable skills through intrinsic motivation

Arthur Aubret, Laetitia Matignon, Salima Hassas

PDF

Open Access

TL;DR

This paper introduces ELSIM, a hierarchical reinforcement learning framework that autonomously learns transferable skills through intrinsic motivation, enhancing exploration and performance in complex environments.

Contribution

ELSIM presents a novel end-to-end architecture combining mutual information and curriculum learning to develop an explorable skill tree for reinforcement learning agents.

Findings

01

Skills are transferable across tasks.

02

Improves exploration in sparse reward settings.

03

Scales to complex MuJoCo environments.

Abstract

Taking inspiration from developmental learning, we present a novel reinforcement learning architecture which hierarchically learns and represents self-generated skills in an end-to-end way. With this architecture, an agent focuses only on task-rewarded skills while keeping the learning process of skills bottom-up. This bottom-up approach allows to learn skills that 1- are transferable across tasks, 2- improves exploration when rewards are sparse. To do so, we combine a previously defined mutual information objective with a novel curriculum learning algorithm, creating an unlimited and explorable tree of skills. We test our agent on simple gridworld environments to understand and visualize how the agent distinguishes between its skills. Then we show that our approach can scale on more difficult MuJoCo environments in which our agent is able to build a representation of skills which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Intelligent Tutoring Systems and Adaptive Learning · Multimodal Machine Learning Applications