Faster Deep Q-learning using Neural Episodic Control

Daichi Nishio; Satoshi Yamane

arXiv:1801.01968·cs.LG·June 5, 2018·1 cites

Faster Deep Q-learning using Neural Episodic Control

Daichi Nishio, Satoshi Yamane

PDF

Open Access

TL;DR

This paper introduces NEC2DQN, a hybrid approach combining Neural Episodic Control with Deep Q-Networks to accelerate learning speed in deep reinforcement learning, demonstrated on Pong.

Contribution

It presents NEC2DQN, a novel method that enhances sample efficiency and learning speed by leveraging NEC at the start of training.

Findings

01

NEC2DQN learns faster than Double DQN in Pong.

02

NEC2DQN outperforms N-step DQN in early learning stages.

03

The approach improves sample efficiency in deep reinforcement learning.

Abstract

The research on deep reinforcement learning which estimates Q-value by deep learning has been attracted the interest of researchers recently. In deep reinforcement learning, it is important to efficiently learn the experiences that an agent has collected by exploring environment. We propose NEC2DQN that improves learning speed of a poor sample efficiency algorithm such as DQN by using good one such as NEC at the beginning of learning. We show it is able to learn faster than Double DQN or N-step DQN in the experiments of Pong.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Data Stream Mining Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Experience Replay · Double Q-learning · Q-Learning · Double DQN · Dense Connections · Convolution · Deep Q-Network