Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise
Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad

TL;DR
This paper enhances Bootstrapped Deep Q-Learning by replacing priors with Gaussian noise, leading to increased diversity and significantly improved performance on Atari benchmarks.
Contribution
It introduces a novel approach of substituting priors with Gaussian noise to boost diversity and performance in Bootstrapped DQN.
Findings
Higher evaluation scores on Atari games
Increased diversity improves learning performance
Noise replacement outperforms prior-based methods
Abstract
Q-learning is one of the most well-known Reinforcement Learning algorithms. There have been tremendous efforts to develop this algorithm using neural networks. Bootstrapped Deep Q-Learning Network is amongst them. It utilizes multiple neural network heads to introduce diversity into Q-learning. Diversity can sometimes be viewed as the amount of reasonable moves an agent can take at a given state, analogous to the definition of the exploration ratio in RL. Thus, the performance of Bootstrapped Deep Q-Learning Network is deeply connected with the level of diversity within the algorithm. In the original research, it was pointed out that a random prior could improve the performance of the model. In this article, we further explore the possibility of replacing priors with noise and sample the noise from a Gaussian distribution to introduce more diversity into this algorithm. We conduct our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsQ-Learning
