Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Jean Harb; Doina Precup

arXiv:1704.05495·cs.AI·April 20, 2017·6 cites

Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Jean Harb, Doina Precup

PDF

Open Access

TL;DR

This paper explores how eligibility traces combined with recurrent neural networks can improve training efficiency in reinforcement learning for Atari games, emphasizing the role of optimization methods.

Contribution

It demonstrates the benefits of integrating eligibility traces with recurrent networks in Atari, highlighting their impact on training speed and performance.

Findings

01

Eligibility traces speed up training in some Atari games.

02

Recurrent networks combined with eligibility traces improve learning efficiency.

03

Optimization methods significantly influence training outcomes.

Abstract

Eligibility traces in reinforcement learning are used as a bias-variance trade-off and can often speed up training time by propagating knowledge back over time-steps in a single update. We investigate the use of eligibility traces in combination with recurrent networks in the Atari domain. We illustrate the benefits of both recurrent nets and eligibility traces in some Atari games, and highlight also the importance of the optimization used in the training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Evolutionary Algorithms and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings