Inverse reinforcement learning for video games

Aaron Tucker; Adam Gleave; Stuart Russell

arXiv:1810.10593·cs.LG·October 26, 2018·22 cites

Inverse reinforcement learning for video games

Aaron Tucker, Adam Gleave, Stuart Russell

PDF

Open Access 1 Repo

TL;DR

This paper extends inverse reinforcement learning to high-dimensional video games by developing a CNN-based adversarial IRL method with a novel state embedding, enabling learning from demonstrations in complex environments.

Contribution

The paper introduces a CNN-AIRL framework with a new autoencoder for state representation, improving IRL application to high-dimensional video game environments.

Findings

01

Achieved high performance on the Catcher game.

02

Partially succeeded on the Enduro Atari game.

03

Enhanced sample efficiency with learned state embeddings.

Abstract

Deep reinforcement learning achieves superhuman performance in a range of video game environments, but requires that a designer manually specify a reward function. It is often easier to provide demonstrations of a target behavior than to design a reward function describing that behavior. Inverse reinforcement learning (IRL) algorithms can infer a reward from demonstrations in low-dimensional continuous control environments, but there has been little work on applying IRL to high-dimensional video games. In our CNN-AIRL baseline, we modify the state-of-the-art adversarial IRL (AIRL) algorithm to use CNNs for the generator and discriminator. To stabilize training, we normalize the reward and increase the size of the discriminator training dataset. We additionally learn a low-dimensional state representation using a novel autoencoder architecture tuned for video game environments. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HumanCompatibleAI/atari-irl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Human Pose and Action Recognition