Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
Felipe Petroski Such, Vashisht Madhavan, Edoardo Conti, Joel Lehman,, Kenneth O. Stanley, Jeff Clune

TL;DR
This paper demonstrates that a simple, gradient-free genetic algorithm can effectively train deep neural networks for reinforcement learning tasks, achieving competitive results at unprecedented scales and speed, challenging the dominance of gradient-based methods.
Contribution
It introduces a large-scale, gradient-free genetic algorithm for training deep neural networks, showing competitive performance and faster training times than traditional gradient-based algorithms.
Findings
Genetic algorithms can evolve DNNs with over four million parameters.
Deep GA outperforms or matches state-of-the-art RL algorithms on Atari and humanoid tasks.
Combining DNNs with novelty search solves high-dimensional problems where other methods fail.
Abstract
Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an operation similar to a finite-difference approximation of the gradient. That raises the question of whether non-gradient-based evolutionary algorithms can work at DNN scales. Here we demonstrate they can: we evolve the weights of a DNN with a simple, gradient-free, population-based genetic algorithm (GA) and it performs well on hard deep RL problems, including Atari and humanoid locomotion. The Deep GA successfully evolves networks with over four million free parameters, the largest neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Modular Robots and Swarm Intelligence
MethodsEntropy Regularization · Convolution · Softmax · A3C · Dense Connections · Deep Q-Network · Q-Learning
