An adaptive synchronization approach for weights of deep reinforcement   learning

S. Amirreza Badran; Mansoor Rezghi

arXiv:2008.06973·cs.LG·August 18, 2020

An adaptive synchronization approach for weights of deep reinforcement learning

S. Amirreza Badran, Mansoor Rezghi

PDF

Open Access

TL;DR

This paper introduces an adaptive synchronization method for deep reinforcement learning networks, improving upon fixed-step approaches by considering agent behavior, leading to better learning performance in DQN variants.

Contribution

It proposes a novel adaptive weight synchronization technique based on agent behavior, enhancing DQN and Rainbow methods for more effective learning.

Findings

01

Improved performance on benchmark games.

02

Enhanced stability of learning process.

03

Better sample quality in replay memory.

Abstract

Deep Q-Networks (DQN) is one of the most well-known methods of deep reinforcement learning, which uses deep learning to approximate the action-value function. Solving numerous Deep reinforcement learning challenges such as moving targets problem and the correlation between samples are the main advantages of this model. Although there have been various extensions of DQN in recent years, they all use a similar method to DQN to overcome the problem of moving targets. Despite the advantages mentioned, synchronizing the network weight in a fixed step size, independent of the agent's behavior, may in some cases cause the loss of some properly learned networks. These lost networks may lead to states with more rewards, hence better samples stored in the replay memory for future training. In this paper, we address this problem from the DQN family and provide an adaptive approach for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Neural dynamics and brain function

MethodsQ-Learning · Convolution · Dense Connections · Deep Q-Network