Illuminating Generalization in Deep Reinforcement Learning through   Procedural Level Generation

Niels Justesen; Ruben Rodriguez Torrado; Philip Bontrager; Ahmed; Khalifa; Julian Togelius; Sebastian Risi

arXiv:1806.10729·cs.LG·November 30, 2018·119 cites

Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation

Niels Justesen, Ruben Rodriguez Torrado, Philip Bontrager, Ahmed, Khalifa, Julian Togelius, Sebastian Risi

PDF

Open Access 1 Repo

TL;DR

This paper investigates how procedural level generation during training enhances the generalization ability of deep reinforcement learning agents across different levels, including human-designed ones, by manipulating difficulty and analyzing generator distributions.

Contribution

It demonstrates that procedural level generation can improve generalization in deep RL and introduces analysis methods for generator diversity and similarity to human levels.

Findings

01

Procedural generation enables generalization within the same distribution.

02

Manipulating level difficulty improves data efficiency.

03

Generator design influences generalization to human-designed levels.

Abstract

Deep reinforcement learning (RL) has shown impressive results in a variety of domains, learning directly from high-dimensional sensory streams. However, when neural networks are trained in a fixed environment, such as a single level in a video game, they will usually overfit and fail to generalize to new levels. When RL models overfit, even slight modifications to the environment can result in poor agent performance. This paper explores how procedurally generated levels during training can increase generality. We show that for some games procedural level generation enables generalization to new levels within the same distribution. Additionally, it is possible to achieve better performance with less data by manipulating the difficulty of the levels in response to the performance of the agent. The generality of the learned behaviors is also evaluated on a set of human-designed levels. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

njustesen/a2c_gvgai
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Robot Manipulation and Learning