Back to Basics: Benchmarking Canonical Evolution Strategies for Playing   Atari

Patryk Chrabaszcz; Ilya Loshchilov; Frank Hutter

arXiv:1802.08842·cs.NE·February 27, 2018

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

Patryk Chrabaszcz, Ilya Loshchilov, Frank Hutter

PDF

1 Repo 1 Video

TL;DR

This paper shows that a simple canonical Evolution Strategies algorithm can match or outperform specialized methods in Atari game benchmarks, highlighting the potential for further improvements by integrating ES and RL techniques.

Contribution

Demonstrates that basic canonical ES can achieve competitive performance on Atari benchmarks, challenging the notion that complex ES variants are necessary.

Findings

01

Basic ES matches or exceeds specialized ES performance.

02

ES algorithms exploit environment differently than RL.

03

Potential for combining ES and RL for improved results.

Abstract

Evolution Strategies (ES) have recently been demonstrated to be a viable alternative to reinforcement learning (RL) algorithms on a set of challenging deep RL problems, including Atari games and MuJoCo humanoid locomotion benchmarks. While the ES algorithms in that work belonged to the specialized class of natural evolution strategies (which resemble approximate gradient RL algorithms, such as REINFORCE), we demonstrate that even a very basic canonical ES algorithm can achieve the same or even better performance. This success of a basic ES algorithm suggests that the state-of-the-art can be advanced further by integrating the many advances made in the field of ES in the last decades. We also demonstrate qualitatively that ES algorithms have very different performance characteristics than traditional RL algorithms: on some games, they learn to exploit the environment and perform much…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PatrykChrabaszcz/Canonical_ES_Atari
tfOfficial

Videos

This Evolving AI Finds Bugs in Games | Two Minute Papers #250· youtube