Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning
David Howard, Josh Kannemeyer, Davide Dolcetti, Humphrey Munn and, Nicole Robinson

TL;DR
This paper compares different terrain generation methods, including noise functions and indirect encodings like CPPN and GAN, to evaluate their impact on curriculum reinforcement learning for humanoid robots.
Contribution
It provides a systematic comparison of terrain generators and introduces feature descriptors for terrain assessment in curriculum learning.
Findings
Different generators exhibit distinct effects on learning performance.
Feature descriptors can effectively characterize terrain meshes for curriculum design.
Results guide the choice of terrain generators in reinforcement learning applications.
Abstract
Curriculum learning allows complex tasks to be mastered via incremental progression over `stepping stone' goals towards a final desired behaviour. Typical implementations learn locomotion policies for challenging environments through gradual complexification of a terrain mesh generated through a parameterised noise function. To date, researchers have predominantly generated terrains from a limited range of noise functions, and the effect of the generator on the learning process is underrepresented in the literature. We compare popular noise-based terrain generators to two indirect encodings, CPPN and GAN. To allow direct comparison between both direct and indirect representations, we assess the impact of a range of representation-agnostic MAP-Elites feature descriptors that compute metrics directly from the generated terrain meshes. Next, performance and coverage are assessed when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Robotic Locomotion and Control
MethodsEntropy Regularization · Proximal Policy Optimization
