Policy Manifold Search for Improving Diversity-based Neuroevolution

Nemanja Rakicevic; Antoine Cully; Petar Kormushev

arXiv:2012.08676·cs.LG·December 17, 2020

Policy Manifold Search for Improving Diversity-based Neuroevolution

Nemanja Rakicevic, Antoine Cully, Petar Kormushev

PDF

Open Access

TL;DR

This paper introduces a novel neuroevolution method that leverages learned low-dimensional manifolds to improve diversity and scalability in policy search for continuous control tasks.

Contribution

It proposes a new approach that combines quality-diversity search with learned latent representations to enhance policy diversity and search efficiency.

Findings

01

More efficient policy search process

02

Robustness in diverse policy collection

03

Improved scalability over existing methods

Abstract

Diversity-based approaches have recently gained popularity as an alternative paradigm to performance-based policy search. A popular approach from this family, Quality-Diversity (QD), maintains a collection of high-performing policies separated in the diversity-metric space, defined based on policies' rollout behaviours. When policies are parameterised as neural networks, i.e. Neuroevolution, QD tends to not scale well with parameter space dimensionality. Our hypothesis is that there exists a low-dimensional manifold embedded in the policy parameter space, containing a high density of diverse and feasible policies. We propose a novel approach to diversity-based policy search via Neuroevolution, that leverages learned latent representations of the policy parameters which capture the local structure of the data. Our approach iteratively collects policies according to the QD framework, in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM