Recurrent Hypernetworks are Surprisingly Strong in Meta-RL

Jacob Beck; Risto Vuorio; Zheng Xiong; Shimon Whiteson

arXiv:2309.14970·cs.LG·December 27, 2023

Recurrent Hypernetworks are Surprisingly Strong in Meta-RL

Jacob Beck, Risto Vuorio, Zheng Xiong, Shimon Whiteson

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that recurrent hypernetworks significantly improve meta-reinforcement learning performance, outperforming specialized methods by leveraging hypernetworks to enhance simple recurrent models.

Contribution

The study shows that combining hypernetworks with recurrent models yields surprisingly strong meta-RL performance, surpassing more complex specialized approaches.

Findings

01

Recurrent hypernetworks outperform existing meta-RL methods.

02

Hypernetworks are crucial for maximizing recurrent model performance.

03

Simple recurrent hypernetworks achieve state-of-the-art results.

Abstract

Deep reinforcement learning (RL) is notoriously impractical to deploy due to sample inefficiency. Meta-RL directly addresses this sample inefficiency by learning to perform few-shot learning when a distribution of related tasks is available for meta-training. While many specialized meta-RL methods have been proposed, recent work suggests that end-to-end learning in conjunction with an off-the-shelf sequential model, such as a recurrent network, is a surprisingly strong baseline. However, such claims have been controversial due to limited supporting evidence, particularly in the face of prior work establishing precisely the opposite. In this paper, we conduct an empirical investigation. While we likewise find that a recurrent network can achieve strong performance, we demonstrate that the use of hypernetworks is crucial to maximizing their potential. Surprisingly, when combined with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jacooba/hyper
pytorchOfficial

Videos

Recurrent Hypernetworks are Surprisingly Strong in Meta-RL· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics