Faster Deep Q-learning using Neural Episodic Control
Daichi Nishio, Satoshi Yamane

TL;DR
This paper introduces NEC2DQN, a hybrid approach combining Neural Episodic Control with Deep Q-Networks to accelerate learning speed in deep reinforcement learning, demonstrated on Pong.
Contribution
It presents NEC2DQN, a novel method that enhances sample efficiency and learning speed by leveraging NEC at the start of training.
Findings
NEC2DQN learns faster than Double DQN in Pong.
NEC2DQN outperforms N-step DQN in early learning stages.
The approach improves sample efficiency in deep reinforcement learning.
Abstract
The research on deep reinforcement learning which estimates Q-value by deep learning has been attracted the interest of researchers recently. In deep reinforcement learning, it is important to efficiently learn the experiences that an agent has collected by exploring environment. We propose NEC2DQN that improves learning speed of a poor sample efficiency algorithm such as DQN by using good one such as NEC at the beginning of learning. We show it is able to learn faster than Double DQN or N-step DQN in the experiments of Pong.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Data Stream Mining Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Experience Replay · Double Q-learning · Q-Learning · Double DQN · Dense Connections · Convolution · Deep Q-Network
