An adaptive synchronization approach for weights of deep reinforcement learning
S. Amirreza Badran, Mansoor Rezghi

TL;DR
This paper introduces an adaptive synchronization method for deep reinforcement learning networks, improving upon fixed-step approaches by considering agent behavior, leading to better learning performance in DQN variants.
Contribution
It proposes a novel adaptive weight synchronization technique based on agent behavior, enhancing DQN and Rainbow methods for more effective learning.
Findings
Improved performance on benchmark games.
Enhanced stability of learning process.
Better sample quality in replay memory.
Abstract
Deep Q-Networks (DQN) is one of the most well-known methods of deep reinforcement learning, which uses deep learning to approximate the action-value function. Solving numerous Deep reinforcement learning challenges such as moving targets problem and the correlation between samples are the main advantages of this model. Although there have been various extensions of DQN in recent years, they all use a similar method to DQN to overcome the problem of moving targets. Despite the advantages mentioned, synchronizing the network weight in a fixed step size, independent of the agent's behavior, may in some cases cause the loss of some properly learned networks. These lost networks may lead to states with more rewards, hence better samples stored in the replay memory for future training. In this paper, we address this problem from the DQN family and provide an adaptive approach for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Neural dynamics and brain function
MethodsQ-Learning · Convolution · Dense Connections · Deep Q-Network
