Adversarial Reinforcement Learning for Procedural Content Generation
Linus Gissl\'en, Andy Eakins, Camilo Gordillo, Joakim Bergdahl, Konrad, Tollmar

TL;DR
This paper introduces ARLPCG, an adversarial reinforcement learning framework that procedurally generates diverse, challenging, yet solvable environments for training agents, with controllable environment features via auxiliary inputs.
Contribution
ARLPCG is the first to combine adversarial RL with auxiliary inputs for environment generation, improving agent generalization and environment controllability.
Findings
ARLPCG achieves higher solve ratios in 3D game environments.
Auxiliary inputs enable controllable environment generation.
Generated environments are both challenging and solvable.
Abstract
We present a new approach ARLPCG: Adversarial Reinforcement Learning for Procedural Content Generation, which procedurally generates and tests previously unseen environments with an auxiliary input as a control variable. Training RL agents over novel environments is a notoriously difficult task. One popular approach is to procedurally generate different environments to increase the generalizability of the trained agents. ARLPCG instead deploys an adversarial model with one PCG RL agent (called Generator) and one solving RL agent (called Solver). The Generator receives a reward signal based on the Solver's performance, which encourages the environment design to be challenging but not impossible. To further drive diversity and control of the environment generation, we propose using auxiliary inputs for the Generator. The benefit is two-fold: Firstly, the Solver achieves better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Reinforcement Learning in Robotics
