Hypernetworks in Meta-Reinforcement Learning

Jacob Beck; Matthew Thomas Jackson; Risto Vuorio; Shimon Whiteson

arXiv:2210.11348·cs.LG·October 21, 2022·5 cites

Hypernetworks in Meta-Reinforcement Learning

Jacob Beck, Matthew Thomas Jackson, Risto Vuorio, Shimon Whiteson

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that proper hypernetwork initialization significantly improves meta-reinforcement learning performance, introducing a novel initialization scheme that enhances generalization across tasks in robotics benchmarks.

Contribution

The authors identify hypernetwork initialization as crucial in meta-RL and propose a new, effective initialization method that outperforms existing approaches.

Findings

01

Hypernetwork initialization critically affects meta-RL performance.

02

The proposed initialization scheme matches or exceeds state-of-the-art results.

03

Hypernetworks improve task generalization in robotics benchmarks.

Abstract

Training a reinforcement learning (RL) agent on a real-world robotics task remains generally impractical due to sample inefficiency. Multi-task RL and meta-RL aim to improve sample efficiency by generalizing over a distribution of related tasks. However, doing so is difficult in practice: In multi-task RL, state of the art methods often fail to outperform a degenerate solution that simply learns each task separately. Hypernetworks are a promising path forward since they replicate the separate policies of the degenerate solution while also allowing for generalization across tasks, and are applicable to meta-RL. However, evidence from supervised learning suggests hypernetwork performance is highly sensitive to the initialization. In this paper, we 1) show that hypernetwork initialization is also a critical factor in meta-RL, and that naive initializations yield poor performance; 2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jacooba/hyper
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Mobile Crowdsensing and Crowdsourcing

MethodsHyperNetwork