Goal-Conditioned Generators of Deep Policies

Francesco Faccio; Vincent Herrmann; Aditya Ramesh; Louis Kirsch,; J\"urgen Schmidhuber

arXiv:2207.01570·cs.LG·July 5, 2022·1 cites

Goal-Conditioned Generators of Deep Policies

Francesco Faccio, Vincent Herrmann, Aditya Ramesh, Louis Kirsch,, J\"urgen Schmidhuber

PDF

Open Access 2 Repos

TL;DR

This paper introduces goal-conditioned neural network generators that produce deep policies as weight matrices, enabling flexible, goal-specific policy creation with competitive results in continuous control tasks.

Contribution

It presents a novel method for generating deep neural network policies conditioned on goals, combining hypernetworks and policy embeddings for scalable, goal-specific policy generation.

Findings

01

Single policy generator achieves multiple return levels.

02

Method scales to deep neural networks.

03

Competitive performance on continuous control tasks.

Abstract

Goal-conditioned Reinforcement Learning (RL) aims at learning optimal policies, given goals encoded in special command inputs. Here we study goal-conditioned neural nets (NNs) that learn to generate deep NN policies in form of context-specific weight matrices, similar to Fast Weight Programmers and other methods from the 1990s. Using context commands of the form "generate a policy that achieves a desired expected return," our NN generators combine powerful exploration of parameter space with generalization across commands to iteratively find better and better policies. A form of weight-sharing HyperNetworks and policy embeddings scales our method to generate deep NNs. Experiments show how a single learned policy generator can produce policies that achieve any return seen during training. Finally, we evaluate our algorithm on a set of continuous control tasks where it exhibits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Reinforcement Learning in Robotics