Safe Mutations for Deep and Recurrent Neural Networks through Output   Gradients

Joel Lehman; Jay Chen; Jeff Clune; Kenneth O. Stanley

arXiv:1712.06563·cs.NE·May 3, 2018

Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

PDF

1 Repo

TL;DR

This paper introduces safe mutation operators for neuroevolution that use output gradients to make controlled weight changes, enabling effective evolution of deep and recurrent neural networks without environmental interactions.

Contribution

It proposes gradient-based safe mutation operators that improve neuroevolution's ability to evolve large, deep, and recurrent neural networks.

Findings

01

Safe mutation operators significantly improve solution discovery in high-dimensional networks.

02

Gradient-based scaling of mutations enhances the robustness of neuroevolution.

03

Method enables evolution of networks processing raw pixel data.

Abstract

While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing no learning signal even if some individual weight changes were beneficial. This paper proposes a solution by introducing a family of safe mutation (SM) operators that aim within the mutation operator itself to find a degree of change that does not alter network behavior too much, but still facilitates exploration. Importantly, these SM operators do not require any additional interactions with the environment. The most effective SM variant capitalizes on the intriguing opportunity to scale the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uber-common/safemutations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.