FeUdal Networks for Hierarchical Reinforcement Learning

Alexander Sasha Vezhnevets; Simon Osindero; Tom Schaul; Nicolas Heess,; Max Jaderberg; David Silver; Koray Kavukcuoglu

arXiv:1703.01161·cs.AI·March 7, 2017·252 cites

FeUdal Networks for Hierarchical Reinforcement Learning

Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess,, Max Jaderberg, David Silver, Koray Kavukcuoglu

PDF

Open Access 1 Repo

TL;DR

FeUdal Networks (FuNs) introduce a hierarchical reinforcement learning architecture with a Manager and Worker, enabling long-term credit assignment and sub-policy emergence, leading to superior performance on complex tasks.

Contribution

The paper presents a novel hierarchical RL architecture inspired by feudal reinforcement learning, with decoupled modules for different temporal resolutions, improving long-term learning.

Findings

01

Outperforms baseline agents on long-term credit tasks

02

Facilitates emergence of sub-policies for different goals

03

Effective on Atari and DeepMind Lab environments

Abstract

We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels -- allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits -- in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

4rChon/NL-FuN
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics