Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis, Antonoglou, Daan Wierstra, Martin Riedmiller

TL;DR
This paper introduces a deep convolutional neural network trained with reinforcement learning to play Atari games directly from raw pixel input, achieving superhuman performance on several titles.
Contribution
It is the first to successfully learn control policies directly from high-dimensional sensory input using deep reinforcement learning without task-specific adjustments.
Findings
Outperforms previous methods on six Atari games
Surpasses human expert performance on three games
Demonstrates effectiveness of deep learning in reinforcement learning
Abstract
We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
What is Q-Learning (back to basics)· youtube
[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)· youtube
DQN - Playing Atari with Deep Reinforcement Learning | RL Paper Explained· youtube
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Advanced Bandit Algorithms Research
MethodsEpsilon Greedy Exploration · Q-Learning · Experience Replay · Dense Connections · Convolution · Deep Q-Network
