TL;DR
This paper introduces safe mutation operators for neuroevolution that use output gradients to make controlled weight changes, enabling effective evolution of deep and recurrent neural networks without environmental interactions.
Contribution
It proposes gradient-based safe mutation operators that improve neuroevolution's ability to evolve large, deep, and recurrent neural networks.
Findings
Safe mutation operators significantly improve solution discovery in high-dimensional networks.
Gradient-based scaling of mutations enhances the robustness of neuroevolution.
Method enables evolution of networks processing raw pixel data.
Abstract
While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing no learning signal even if some individual weight changes were beneficial. This paper proposes a solution by introducing a family of safe mutation (SM) operators that aim within the mutation operator itself to find a degree of change that does not alter network behavior too much, but still facilitates exploration. Importantly, these SM operators do not require any additional interactions with the environment. The most effective SM variant capitalizes on the intriguing opportunity to scale the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
