ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, &   Snapshot Ensembling

Christopher Schulze; Marcus Schulze

arXiv:1801.01000·cs.AI·January 4, 2018

ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, & Snapshot Ensembling

Christopher Schulze, Marcus Schulze

PDF

Open Access

TL;DR

This paper evaluates advanced reinforcement learning techniques like double-Q learning, prioritized experience replay, and snapshot ensembling within the ViZDoom environment, demonstrating improved performance over built-in AI and analyzing the effects of these methods.

Contribution

It introduces the application of double-Q learning, prioritized experience replay, and snapshot ensembling to ViZDoom, highlighting their effects and benefits in a complex first-person shooter environment.

Findings

01

Double-Q learning stabilizes training.

02

Prioritized experience replay accelerates early learning.

03

Snapshot ensembling enhances agent performance.

Abstract

ViZDoom is a robust, first-person shooter reinforcement learning environment, characterized by a significant degree of latent state information. In this paper, double-Q learning and prioritized experience replay methods are tested under a certain ViZDoom combat scenario using a competitive deep recurrent Q-network (DRQN) architecture. In addition, an ensembling technique known as snapshot ensembling is employed using a specific annealed learning rate to observe differences in ensembling efficacy under these two methods. Annealed learning rates are important in general to the training of deep neural network models, as they shake up the status-quo and counter a model's tending towards local optima. While both variants show performance exceeding those of built-in AI agents of the game, the known stabilizing effects of double-Q learning are illustrated, and priority experience replay is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsPrioritized Experience Replay · Experience Replay