Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Alex Graves; Ioannis; Antonoglou; Daan Wierstra; Martin Riedmiller

arXiv:1312.5602·cs.LG·December 20, 2013·5.1k cites

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis, Antonoglou, Daan Wierstra, Martin Riedmiller

PDF

Open Access 5 Repos 2 Models 3 Videos

TL;DR

This paper introduces a deep convolutional neural network trained with reinforcement learning to play Atari games directly from raw pixel input, achieving superhuman performance on several titles.

Contribution

It is the first to successfully learn control policies directly from high-dimensional sensory input using deep reinforcement learning without task-specific adjustments.

Findings

01

Outperforms previous methods on six Atari games

02

Surpasses human expert performance on three games

03

Demonstrates effectiveness of deep learning in reinforcement learning

Abstract

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

What is Q-Learning (back to basics)· youtube

[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)· youtube

DQN - Playing Atari with Deep Reinforcement Learning | RL Paper Explained· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Advanced Bandit Algorithms Research

MethodsEpsilon Greedy Exploration · Q-Learning · Experience Replay · Dense Connections · Convolution · Deep Q-Network