Self-organization of action hierarchy and compositionality by   reinforcement learning with recurrent neural networks

Dongqi Han; Kenji Doya; Jun Tani

arXiv:1901.10113·cs.LG·November 27, 2019·5 cites

Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Dongqi Han, Kenji Doya, Jun Tani

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel stochastic RNN architecture for reinforcement learning that autonomously develops action hierarchies and compositionality, leading to improved learning efficiency and re-learning in complex tasks.

Contribution

The paper presents a new multiple-timescale stochastic RNN that self-organizes action hierarchies and compositionality, advancing understanding of neural mechanisms in RL.

Findings

01

The network can learn to abstract sub-goals autonomously.

02

Self-developed compositionality accelerates re-learning on new tasks.

03

Stochastic neural activities improve overall performance.

Abstract

Recurrent neural networks (RNNs) for reinforcement learning (RL) have shown distinct advantages, e.g., solving memory-dependent tasks and meta-learning. However, little effort has been spent on improving RNN architectures and on understanding the underlying neural mechanisms for performance gain. In this paper, we propose a novel, multiple-timescale, stochastic RNN for RL. Empirical results show that the network can autonomously learn to abstract sub-goals and can self-develop an action hierarchy using internal dynamics in a challenging continuous control task. Furthermore, we show that the self-developed compositionality of the network enhances faster re-learning when adapting to a new task that is a re-composition of previously learned sub-goals, than when starting from scratch. We also found that improved performance can be achieved when neural activities are subject to stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oist-cnru/ReMASTER
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · EEG and Brain-Computer Interfaces