Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot, Arthur Flajolet

TL;DR
This paper introduces a flexible framework for evolving diverse populations of RL agents using MAP-Elites, improving sample efficiency and robustness over previous methods that combined RL with Quality Diversity algorithms.
Contribution
The authors propose a novel framework that evolves entire RL agents, not just policies, allowing the use of any RL algorithm and reducing limitations of prior approaches.
Findings
Enhanced sample efficiency in high-dimensional problems
Robustness to hyperparameter sensitivity and stochasticity
Effective in robotics control tasks with deceptive rewards
Abstract
Quality Diversity (QD) has emerged as a powerful alternative optimization paradigm that aims at generating large and diverse collections of solutions, notably with its flagship algorithm MAP-ELITES (ME) which evolves solutions through mutations and crossovers. While very effective for some unstructured problems, early ME implementations relied exclusively on random search to evolve the population of solutions, rendering them notoriously sample-inefficient for high-dimensional problems, such as when evolving neural networks. Follow-up works considered exploiting gradient information to guide the search in order to address these shortcomings through techniques borrowed from either Black-Box Optimization (BBO) or Reinforcement Learning (RL). While mixing RL techniques with ME unlocked state-of-the-art performance for robotics control problems that require a good amount of exploration, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Scheduling and Optimization Algorithms · Advanced Multi-Objective Optimization Algorithms
MethodsRandom Search
