A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games
Felix Leibfried, Nate Kushman, Katja Hofmann

TL;DR
This paper extends a deep neural network for Atari video frame prediction to jointly predict rewards, enabling more data-efficient model-based reinforcement learning in high-dimensional visual environments.
Contribution
It introduces a joint learning framework for simultaneous video frame and reward prediction in Atari games using deep neural networks.
Findings
Accurate reward prediction up to 200 frames in Atari games
Joint optimization improves prediction accuracy
Demonstrates potential for model-based RL in complex environments
Abstract
Reinforcement learning is concerned with identifying reward-maximizing behaviour policies in environments that are initially unknown. State-of-the-art reinforcement learning approaches, such as deep Q-networks, are model-free and learn to act effectively across a wide range of environments such as Atari games, but require huge amounts of data. Model-based techniques are more data-efficient, but need to acquire explicit knowledge about the environment. In this paper, we take a step towards using model-based techniques in environments with a high-dimensional visual state space by demonstrating that it is possible to learn system dynamics and the reward structure jointly. Our contribution is to extend a recently developed deep neural network for video frame prediction in Atari games to enable reward prediction as well. To this end, we phrase a joint optimization problem for minimizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Artificial Intelligence in Games
