Collaborative Evolutionary Reinforcement Learning

Shauharda Khadka; Somdeb Majumdar; Tarek Nassar; Zach Dwiel; Evren; Tumer; Santiago Miret; Yinyin Liu; Kagan Tumer

arXiv:1905.00976·cs.LG·October 13, 2020·32 cites

Collaborative Evolutionary Reinforcement Learning

Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren, Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer

PDF

Open Access 1 Repo

TL;DR

This paper introduces CERL, a scalable framework combining multiple reinforcement learning policies and neuroevolution to enhance exploration, sample efficiency, and overall performance in continuous control tasks, outperforming individual learners.

Contribution

The paper presents CERL, a novel framework that integrates multiple learners with neuroevolution, improving exploration and sample efficiency in deep reinforcement learning.

Findings

01

CERL outperforms individual learners on continuous control benchmarks.

02

The emergent learner surpasses the capabilities of its composite learners.

03

CERL achieves significant success on the Mujoco Humanoid benchmark.

Abstract

Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this paper, we introduce Collaborative Evolutionary Reinforcement Learning (CERL), a scalable framework that comprises a portfolio of policies that simultaneously explore and exploit diverse regions of the solution space. A collection of learners - typically proven algorithms like TD3 - optimize over varying time-horizons leading to this diverse portfolio. All learners contribute to and use a shared replay buffer to achieve greater sample efficiency. Computational resources are dynamically distributed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

intelai/cerl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms

MethodsExperience Replay · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Target Policy Smoothing · Clipped Double Q-learning · Adam · Twin Delayed Deep Deterministic