Assessing Evolutionary Terrain Generation Methods for Curriculum   Reinforcement Learning

David Howard; Josh Kannemeyer; Davide Dolcetti; Humphrey Munn and; Nicole Robinson

arXiv:2203.15172·cs.NE·March 30, 2022

Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning

David Howard, Josh Kannemeyer, Davide Dolcetti, Humphrey Munn and, Nicole Robinson

PDF

Open Access

TL;DR

This paper compares different terrain generation methods, including noise functions and indirect encodings like CPPN and GAN, to evaluate their impact on curriculum reinforcement learning for humanoid robots.

Contribution

It provides a systematic comparison of terrain generators and introduces feature descriptors for terrain assessment in curriculum learning.

Findings

01

Different generators exhibit distinct effects on learning performance.

02

Feature descriptors can effectively characterize terrain meshes for curriculum design.

03

Results guide the choice of terrain generators in reinforcement learning applications.

Abstract

Curriculum learning allows complex tasks to be mastered via incremental progression over `stepping stone' goals towards a final desired behaviour. Typical implementations learn locomotion policies for challenging environments through gradual complexification of a terrain mesh generated through a parameterised noise function. To date, researchers have predominantly generated terrains from a limited range of noise functions, and the effect of the generator on the learning process is underrepresented in the literature. We compare popular noise-based terrain generators to two indirect encodings, CPPN and GAN. To allow direct comparison between both direct and indirect representations, we assess the impact of a range of representation-agnostic MAP-Elites feature descriptors that compute metrics directly from the generated terrain meshes. Next, performance and coverage are assessed when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Robotic Locomotion and Control

MethodsEntropy Regularization · Proximal Policy Optimization