Generalization and Regularization in DQN

Jesse Farebrother; Marlos C. Machado; Michael Bowling

arXiv:1810.00123·cs.LG·January 22, 2020·100 cites

Generalization and Regularization in DQN

Jesse Farebrother, Marlos C. Machado, Michael Bowling

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the generalization of DQN in Atari games, revealing overspecialization issues and demonstrating that regularization techniques like dropout and L2 can enhance feature generality and transferability.

Contribution

It introduces a protocol for assessing RL generalization and systematically studies how regularization improves DQN's ability to learn transferable, general features.

Findings

01

DQN tends to overspecialize to training environments.

02

Regularization methods improve DQN's generalization capabilities.

03

Regularized DQN learns features that transfer better to similar tasks.

Abstract

Deep reinforcement learning algorithms have shown an impressive ability to learn complex control policies in high-dimensional tasks. However, despite the ever-increasing performance on popular benchmarks, policies learned by deep reinforcement learning algorithms can struggle to generalize when evaluated in remarkably similar environments. In this paper we propose a protocol to evaluate generalization in reinforcement learning through different modes of Atari 2600 games. With that protocol we assess the generalization capabilities of DQN, one of the most traditional deep reinforcement learning algorithms, and we provide evidence suggesting that DQN overspecializes to the training environment. We then comprehensively evaluate the impact of dropout and $ℓ_{2}$ regularization, as well as the impact of reusing learned representations to improve the generalization capabilities of DQN.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jessefarebro/dqn-ale
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Adaptive Dynamic Programming Control

MethodsQ-Learning · Dense Connections · Convolution · Dropout · Deep Q-Network