A Deep Learning Approach for Joint Video Frame and Reward Prediction in   Atari Games

Felix Leibfried; Nate Kushman; Katja Hofmann

arXiv:1611.07078·cs.AI·August 18, 2017·23 cites

A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games

Felix Leibfried, Nate Kushman, Katja Hofmann

PDF

Open Access

TL;DR

This paper extends a deep neural network for Atari video frame prediction to jointly predict rewards, enabling more data-efficient model-based reinforcement learning in high-dimensional visual environments.

Contribution

It introduces a joint learning framework for simultaneous video frame and reward prediction in Atari games using deep neural networks.

Findings

01

Accurate reward prediction up to 200 frames in Atari games

02

Joint optimization improves prediction accuracy

03

Demonstrates potential for model-based RL in complex environments

Abstract

Reinforcement learning is concerned with identifying reward-maximizing behaviour policies in environments that are initially unknown. State-of-the-art reinforcement learning approaches, such as deep Q-networks, are model-free and learn to act effectively across a wide range of environments such as Atari games, but require huge amounts of data. Model-based techniques are more data-efficient, but need to acquire explicit knowledge about the environment. In this paper, we take a step towards using model-based techniques in environments with a high-dimensional visual state space by demonstrating that it is possible to learn system dynamics and the reward structure jointly. Our contribution is to extend a recently developed deep neural network for video frame prediction in Atari games to enable reward prediction as well. To this end, we phrase a joint optimization problem for minimizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Artificial Intelligence in Games