Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder, Aldo Pacchiano, Krzysztof Choromanski and, Stephen Roberts

TL;DR
This paper introduces Diversity via Determinants (DvD), a novel method for enhancing behavioral diversity in population-based reinforcement learning by measuring the volume of the population in a behavioral space, improving exploration efficiency.
Contribution
The paper proposes a new approach to optimize entire populations simultaneously using volume in behavioral space, avoiding pairwise distances and domain-specific representations.
Findings
DvD improves exploration without harming reward optimization.
Both evolutionary and gradient-based DvD effectively increase behavioral diversity.
The method adapts diversity levels during training using online learning.
Abstract
Exploration is a key problem in reinforcement learning, since agents can only learn from data they acquire in the environment. With that in mind, maintaining a population of agents is an attractive method, as it allows data be collected with a diverse set of behaviors. This behavioral diversity is often boosted via multi-objective loss functions. However, those approaches typically leverage mean field updates based on pairwise distances, which makes them susceptible to cycling behaviors and increased redundancy. In addition, explicitly boosting diversity often has a detrimental impact on optimizing already fruitful behaviors for rewards. As such, the reward-diversity trade off typically relies on heuristics. Finally, such methods require behavioral representations, often handcrafted and domain specific. In this paper, we introduce an approach to optimize all members of a population…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Supply Chain and Inventory Management
