Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakicevic, Antoine Cully, Petar Kormushev

TL;DR
This paper introduces a novel neuroevolution method that leverages learned low-dimensional manifolds to improve diversity and scalability in policy search for continuous control tasks.
Contribution
It proposes a new approach that combines quality-diversity search with learned latent representations to enhance policy diversity and search efficiency.
Findings
More efficient policy search process
Robustness in diverse policy collection
Improved scalability over existing methods
Abstract
Diversity-based approaches have recently gained popularity as an alternative paradigm to performance-based policy search. A popular approach from this family, Quality-Diversity (QD), maintains a collection of high-performing policies separated in the diversity-metric space, defined based on policies' rollout behaviours. When policies are parameterised as neural networks, i.e. Neuroevolution, QD tends to not scale well with parameter space dimensionality. Our hypothesis is that there exists a low-dimensional manifold embedded in the policy parameter space, containing a high density of diverse and feasible policies. We propose a novel approach to diversity-based policy search via Neuroevolution, that leverages learned latent representations of the policy parameters which capture the local structure of the data. Our approach iteratively collects policies according to the QD framework, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
