Implementing the Deep Q-Network

Melrose Roderick; James MacGlashan; Stefanie Tellex

arXiv:1711.07478·cs.LG·November 22, 2017·57 cites

Implementing the Deep Q-Network

Melrose Roderick, James MacGlashan, Stefanie Tellex

PDF

Open Access 1 Repo

TL;DR

This paper reproduces and analyzes the implementation of the Deep Q-Network, highlighting key details for replication, discussing performance improvements, and providing a versatile implementation for various domains.

Contribution

It offers detailed insights into replicating DQN results, clarifies previously under-documented implementation details, and presents a flexible, performance-optimized implementation.

Findings

01

Reproduced DQN results with high fidelity

02

Identified critical implementation details for replication

03

Enhanced computational performance of DQN implementations

Abstract

The Deep Q-Network proposed by Mnih et al. [2015] has become a benchmark and building point for much deep reinforcement learning research. However, replicating results for complex systems is often challenging since original scientific publications are not always able to describe in detail every important parameter setting and software engineering solution. In this paper, we present results from our work reproducing the results of the DQN paper. We highlight key areas in the implementation that were not covered in great detail in the original paper to make it easier for researchers to replicate these results, including termination conditions and gradient descent algorithms. Finally, we discuss methods for improving the computational performance and provide our own implementation that is designed to work with a range of domains, and not just the original Arcade Learning Environment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xgfelicia/Reinforcement-Learning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBig Data and Business Intelligence · Cognitive Science and Mapping · Bayesian Modeling and Causal Inference

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network