Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control   Optimization

Paolo Pagliuca; Nicola Milano; and Stefano Nolfi

arXiv:1912.05239·cs.NE·June 2, 2020

Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization

Paolo Pagliuca, Nicola Milano, and Stefano Nolfi

PDF

1 Repo

TL;DR

This paper evaluates modern neuro-evolutionary strategies for continuous control, showing their effectiveness, scalability, and robustness, while highlighting differences in reward function optimization between reinforcement learning and evolutionary methods.

Contribution

It provides a comprehensive comparison of neuro-evolutionary algorithms, demonstrating the superior or equal performance of OpenAI-ES and revealing biases in reward function optimization.

Findings

01

Neuro-evolutionary methods are effective and scalable.

02

OpenAI-ES outperforms or matches other algorithms.

03

Reward functions differ in effectiveness between RL and evolutionary strategies.

Abstract

We analyze the efficacy of modern neuro-evolutionary strategies for continuous control optimization. Overall, the results collected on a wide variety of qualitatively different benchmark problems indicate that these methods are generally effective and scale well with respect to the number of parameters and the complexity of the problem. Moreover, they are relatively robust with respect to the setting of hyper-parameters. The comparison of the most promising methods indicates that the OpenAI-ES algorithm outperforms or equals the other algorithms on all considered problems. Moreover, we demonstrate how the reward functions optimized for reinforcement learning methods are not necessarily effective for evolutionary strategies and vice versa. This finding can lead to reconsideration of the relative efficacy of the two classes of algorithm since it implies that the comparisons performed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PaoloP84/EfficacyModernES
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.