Context Meta-Reinforcement Learning via Neuromodulation

Eseoghene Ben-Iwhiwhu; Jeffery Dick; Nicholas A. Ketz; Praveen K.; Pilly; Andrea Soltoggio

arXiv:2111.00134·cs.NE·April 27, 2022

Context Meta-Reinforcement Learning via Neuromodulation

Eseoghene Ben-Iwhiwhu, Jeffery Dick, Nicholas A. Ketz, Praveen K., Pilly, Andrea Soltoggio

PDF

2 Repos

TL;DR

This paper introduces neuromodulation as a modular enhancement to policy networks in meta-reinforcement learning, significantly improving task adaptation and dynamic representations across various environments.

Contribution

It proposes a novel neuromodulation component for policy networks, enhancing dynamic representations and adaptability in meta-RL algorithms like CAVIA and PEARL.

Findings

01

Neuromodulation improves adaptation in complex control tasks.

02

Enhanced dynamic representations outperform baseline methods.

03

Applicable across multiple meta-RL algorithms and environments.

Abstract

Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent's policy network (obtained via reasoning about task context, model parameter updates, or both). However, obtaining rich dynamic representations for fast adaptation beyond simple benchmark problems is challenging due to the burden placed on the policy network to accommodate different policies. This paper addresses the challenge by introducing neuromodulation as a modular component to augment a standard policy network that regulates neuronal activities in order to produce efficient dynamic representations for task adaptation. The proposed extension to the policy network is evaluated across multiple discrete and continuous control environments of increasing complexity. To prove the generality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.