PCGRL+: Scaling, Control and Generalization in Reinforcement Learning   Level Generators

Sam Earle; Zehua Jiang; Julian Togelius

arXiv:2408.12525·cs.LG·August 23, 2024

PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators

Sam Earle, Zehua Jiang, Julian Togelius

PDF

Open Access

TL;DR

This paper enhances PCGRL by implementing GPU-accelerated environments in Jax, enabling faster training, larger scale level generation, and improved generalization, thus making PCGRL more practical for complex game design tasks.

Contribution

It introduces a GPU-based Jax implementation of PCGRL environments, allowing scalable, faster training and new control mechanisms for level generation.

Findings

01

Faster training speeds due to GPU parallelization.

02

Models trained for over 1 billion timesteps show robust behavior.

03

Partial observation strategies improve generalization to larger levels.

Abstract

Procedural Content Generation via Reinforcement Learning (PCGRL) has been introduced as a means by which controllable designer agents can be trained based only on a set of computable metrics acting as a proxy for the level's quality and key characteristics. While PCGRL offers a unique set of affordances for game designers, it is constrained by the compute-intensive process of training RL agents, and has so far been limited to generating relatively small levels. To address this issue of scale, we implement several PCGRL environments in Jax so that all aspects of learning and simulation happen in parallel on the GPU, resulting in faster environment simulation; removing the CPU-GPU transfer of information bottleneck during RL training; and ultimately resulting in significantly improved training speed. We replicate several key results from prior works in this new framework, letting models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training