Deep Reinforcement Learning From Raw Pixels in Doom

Danijar Hafner

arXiv:1610.02164·cs.LG·October 10, 2016·2 cites

Deep Reinforcement Learning From Raw Pixels in Doom

Danijar Hafner

PDF

Open Access

TL;DR

This paper explores applying deep reinforcement learning to a complex 3D Doom environment, analyzing challenges, training algorithms, and providing insights into learned behaviors despite limited success.

Contribution

It introduces a new Doom task for RL agents and evaluates DQN and LSTM-A3C algorithms, highlighting challenges and initial behaviors in this complex setting.

Findings

01

Both algorithms learned sensible policies

02

High scores were not achieved with current training

03

Insights into learned behaviors were provided

Abstract

Using current reinforcement learning methods, it has recently become possible to learn to play unknown 3D games from raw pixels. In this work, we study the challenges that arise in such complex environments, and summarize current methods to approach these. We choose a task within the Doom game, that has not been approached yet. The goal for the agent is to fight enemies in a 3D world consisting of five rooms. We train the DQN and LSTM-A3C algorithms on this task. Results show that both algorithms learn sensible policies, but fail to achieve high scores given the amount of training. We provide insights into the learned behavior, which can serve as a valuable starting point for further research in the Doom domain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms · Advanced Bandit Algorithms Research

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network