Evolving Populations of Diverse RL Agents with MAP-Elites

Thomas Pierrot; Arthur Flajolet

arXiv:2303.12803·cs.NE·June 16, 2023·1 cites

Evolving Populations of Diverse RL Agents with MAP-Elites

Thomas Pierrot, Arthur Flajolet

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a flexible framework for evolving diverse populations of RL agents using MAP-Elites, improving sample efficiency and robustness over previous methods that combined RL with Quality Diversity algorithms.

Contribution

The authors propose a novel framework that evolves entire RL agents, not just policies, allowing the use of any RL algorithm and reducing limitations of prior approaches.

Findings

01

Enhanced sample efficiency in high-dimensional problems

02

Robustness to hyperparameter sensitivity and stochasticity

03

Effective in robotics control tasks with deceptive rewards

Abstract

Quality Diversity (QD) has emerged as a powerful alternative optimization paradigm that aims at generating large and diverse collections of solutions, notably with its flagship algorithm MAP-ELITES (ME) which evolves solutions through mutations and crossovers. While very effective for some unstructured problems, early ME implementations relied exclusively on random search to evolve the population of solutions, rendering them notoriously sample-inefficient for high-dimensional problems, such as when evolving neural networks. Follow-up works considered exploiting gradient information to guide the search in order to address these shortcomings through techniques borrowed from either Black-Box Optimization (BBO) or Reinforcement Learning (RL). While mixing RL techniques with ME unlocked state-of-the-art performance for robotics control problems that require a good amount of exploration, it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adaptive-intelligent-robotics/qdax
jaxOfficial

Videos

Evolving Populations of Diverse RL Agents with MAP-Elites· slideslive

Taxonomy

TopicsMetaheuristic Optimization Algorithms Research · Scheduling and Optimization Algorithms · Advanced Multi-Objective Optimization Algorithms

MethodsRandom Search