Learning Task Agnostic Skills with Data-driven Guidance

Even Klemsdal; Sverre Herland; Abdulmajid Murad

arXiv:2108.01869·cs.AI·August 5, 2021

Learning Task Agnostic Skills with Data-driven Guidance

Even Klemsdal, Sverre Herland, Abdulmajid Murad

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework that guides unsupervised skill discovery in reinforcement learning towards expert-visited states using a learned state projection, resulting in more useful behaviors without task-specific rewards.

Contribution

It presents a novel data-driven guidance method for skill discovery that focuses on expert-visited states to improve the usefulness of learned behaviors.

Findings

01

Guided skill discovery produces more relevant behaviors.

02

The method enhances autonomy without task-specific rewards.

03

Effective in various RL tasks.

Abstract

To increase autonomy in reinforcement learning, agents need to learn useful behaviours without reliance on manually designed reward functions. To that end, skill discovery methods have been used to learn the intrinsic options available to an agent using task-agnostic objectives. However, without the guidance of task-specific rewards, emergent behaviours are generally useless due to the under-constrained problem of skill discovery in complex and high-dimensional spaces. This paper proposes a framework for guiding the skill discovery towards the subset of expert-visited states using a learned state projection. We apply our method in various reinforcement learning (RL) tasks and show that such a projection results in more useful behaviours.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sherilan/cs285-project
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Evolutionary Algorithms and Applications