Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren, Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer

TL;DR
This paper introduces CERL, a scalable framework combining multiple reinforcement learning policies and neuroevolution to enhance exploration, sample efficiency, and overall performance in continuous control tasks, outperforming individual learners.
Contribution
The paper presents CERL, a novel framework that integrates multiple learners with neuroevolution, improving exploration and sample efficiency in deep reinforcement learning.
Findings
CERL outperforms individual learners on continuous control benchmarks.
The emergent learner surpasses the capabilities of its composite learners.
CERL achieves significant success on the Mujoco Humanoid benchmark.
Abstract
Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this paper, we introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space. A collection of learners - typically proven algorithms like TD3 - optimize over varying time-horizons leading to this diverse portfolio. All learners contribute to and use a shared replay buffer to achieve greater sample efficiency. Computational resources are dynamically distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms
MethodsExperience Replay · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Target Policy Smoothing · Clipped Double Q-learning · Adam · Twin Delayed Deep Deterministic
