Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative   for Training Deep Neural Networks for Reinforcement Learning

Felipe Petroski Such; Vashisht Madhavan; Edoardo Conti; Joel Lehman,; Kenneth O. Stanley; Jeff Clune

arXiv:1712.06567·cs.NE·April 24, 2018·557 cites

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman,, Kenneth O. Stanley, Jeff Clune

PDF

Open Access 5 Repos 1 Models

TL;DR

This paper demonstrates that a simple, gradient-free genetic algorithm can effectively train deep neural networks for reinforcement learning tasks, achieving competitive results at unprecedented scales and speed, challenging the dominance of gradient-based methods.

Contribution

It introduces a large-scale, gradient-free genetic algorithm for training deep neural networks, showing competitive performance and faster training times than traditional gradient-based algorithms.

Findings

01

Genetic algorithms can evolve DNNs with over four million parameters.

02

Deep GA outperforms or matches state-of-the-art RL algorithms on Atari and humanoid tasks.

03

Combining DNNs with novelty search solves high-dimensional problems where other methods fail.

Abstract

Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an operation similar to a finite-difference approximation of the gradient. That raises the question of whether non-gradient-based evolutionary algorithms can work at DNN scales. Here we demonstrate they can: we evolve the weights of a DNN with a simple, gradient-free, population-based genetic algorithm (GA) and it performs well on hard deep RL problems, including Atari and humanoid locomotion. The Deep GA successfully evolves networks with over four million free parameters, the largest neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
AIRRC/ELN-Llama-1B-base
model· 12 dl· ♡ 3
12 dl♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Modular Robots and Swarm Intelligence

MethodsEntropy Regularization · Convolution · Softmax · A3C · Dense Connections · Deep Q-Network · Q-Learning