TL;DR
This paper introduces neuromodulation as a modular enhancement to policy networks in meta-reinforcement learning, significantly improving task adaptation and dynamic representations across various environments.
Contribution
It proposes a novel neuromodulation component for policy networks, enhancing dynamic representations and adaptability in meta-RL algorithms like CAVIA and PEARL.
Findings
Neuromodulation improves adaptation in complex control tasks.
Enhanced dynamic representations outperform baseline methods.
Applicable across multiple meta-RL algorithms and environments.
Abstract
Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent's policy network (obtained via reasoning about task context, model parameter updates, or both). However, obtaining rich dynamic representations for fast adaptation beyond simple benchmark problems is challenging due to the burden placed on the policy network to accommodate different policies. This paper addresses the challenge by introducing neuromodulation as a modular component to augment a standard policy network that regulates neuronal activities in order to produce efficient dynamic representations for task adaptation. The proposed extension to the policy network is evaluated across multiple discrete and continuous control environments of increasing complexity. To prove the generality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
