Unsupervised State Representation Learning in Atari

Ankesh Anand; Evan Racah; Sherjil Ozair; Yoshua Bengio; Marc-Alexandre; C\^ot\'e; R Devon Hjelm

arXiv:1906.08226·cs.LG·November 9, 2020·27 cites

Unsupervised State Representation Learning in Atari

Ankesh Anand, Evan Racah, Sherjil Ozair, Yoshua Bengio, Marc-Alexandre, C\^ot\'e, R Devon Hjelm

PDF

Open Access 5 Repos

TL;DR

This paper presents a novel unsupervised method for learning state representations in Atari games by maximizing mutual information, along with a new benchmark for evaluating how well these representations capture true game state variables.

Contribution

Introduces a mutual information-based approach for unsupervised state representation learning and a new Atari benchmark for evaluating the quality of learned representations.

Findings

01

Our method effectively captures ground truth state variables.

02

The proposed benchmark provides a new standard for evaluation.

03

Compared to other methods, our approach shows competitive performance.

Abstract

State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporally distinct features of a neural encoder of the observations. We also introduce a new benchmark based on Atari 2600 games where we evaluate representations based on how well they capture the ground truth state variables. We believe this new framework for evaluating representation learning models will be crucial for future representation learning research. Finally, we compare our technique with other state-of-the-art generative and contrastive representation learning methods. The code associated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis · Artificial Intelligence in Games