State of the Art Control of Atari Games Using Shallow Reinforcement   Learning

Yitao Liang; Marlos C. Machado; Erik Talvitie; Michael Bowling

arXiv:1512.01563·cs.LG·April 25, 2016·34 cites

State of the Art Control of Atari Games Using Shallow Reinforcement Learning

Yitao Liang, Marlos C. Machado, Erik Talvitie, Michael Bowling

PDF

Open Access 1 Repo

TL;DR

This paper analyzes the principles behind Deep Q-Networks' success in Atari games, proposing a simple, effective representation that rivals DQN and offers a benchmark for future research.

Contribution

It introduces a linear representation capturing key features of DQN, reducing the need for game-specific learning and providing a reproducible benchmark.

Findings

01

The linear representation achieves performance comparable to DQN.

02

It offers insights into DQN's strengths and weaknesses.

03

Provides a generic, practical feature set for the ALE.

Abstract

The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of the first successful combinations of deep neural networks and reinforcement learning. Its promise was demonstrated in the Arcade Learning Environment (ALE), a challenging framework composed of dozens of Atari 2600 games used to evaluate general competency in AI. It achieved dramatically better results than earlier approaches, showing that its ability to learn good representations is quite robust and general. This paper attempts to understand the principles that underlie DQN's impressive performance and to better contextualize its success. We systematically evaluate the importance of key representational biases encoded by DQN's network by proposing simple linear representations that make use of these concepts. Incorporating these characteristics, we obtain a computationally practical feature set that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mcmachado/b-pro
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Advanced Bandit Algorithms Research

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network