Meta Learning Shared Hierarchies

Kevin Frans; Jonathan Ho; Xi Chen; Pieter Abbeel; John Schulman

arXiv:1710.09767·cs.LG·October 27, 2017·117 cites

Meta Learning Shared Hierarchies

Kevin Frans, Jonathan Ho, Xi Chen, Pieter Abbeel, John Schulman

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces a meta-learning approach for hierarchical policy learning using shared primitives, enhancing sample efficiency and transferability across diverse tasks and robotic platforms.

Contribution

It proposes a concrete metric for hierarchy strength and an end-to-end algorithm that leverages off-the-shelf RL methods for learning shared primitives.

Findings

01

Discovered motor primitives for four-legged robots in maze environments.

02

Transferred primitives to long-timescale obstacle courses.

03

Enabled humanoid robots to walk and crawl with the same policy.

Abstract

We develop a metalearning approach for learning hierarchically structured policies, improving sample efficiency on unseen tasks through the use of shared primitives---policies that are executed for large numbers of timesteps. Specifically, a set of primitives are shared within a distribution of tasks, and are switched between by task-specific policies. We provide a concrete metric for measuring the strength of such hierarchies, leading to an optimization problem for quickly reaching high reward on unseen tasks. We then present an algorithm to solve this problem end-to-end through the use of any off-the-shelf reinforcement learning method, by repeatedly sampling new tasks and resetting task-specific policies. We successfully discover meaningful motor primitives for the directional movement of four-legged robots, solely by interacting with distributions of mazes. We also demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Meta Learning Shared Hierarchies | Two Minute Papers #210· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Software Testing and Debugging Techniques