Improving the Diversity of Bootstrapped DQN by Replacing Priors With   Noise

Li Meng; Morten Goodwin; Anis Yazidi; Paal Engelstad

arXiv:2203.01004·cs.LG·June 25, 2024

Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise

Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstad

PDF

TL;DR

This paper enhances Bootstrapped Deep Q-Learning by replacing priors with Gaussian noise, leading to increased diversity and significantly improved performance on Atari benchmarks.

Contribution

It introduces a novel approach of substituting priors with Gaussian noise to boost diversity and performance in Bootstrapped DQN.

Findings

01

Higher evaluation scores on Atari games

02

Increased diversity improves learning performance

03

Noise replacement outperforms prior-based methods

Abstract

Q-learning is one of the most well-known Reinforcement Learning algorithms. There have been tremendous efforts to develop this algorithm using neural networks. Bootstrapped Deep Q-Learning Network is amongst them. It utilizes multiple neural network heads to introduce diversity into Q-learning. Diversity can sometimes be viewed as the amount of reasonable moves an agent can take at a given state, analogous to the definition of the exploration ratio in RL. Thus, the performance of Bootstrapped Deep Q-Learning Network is deeply connected with the level of diversity within the algorithm. In the original research, it was pointed out that a random prior could improve the performance of the model. In this article, we further explore the possibility of replacing priors with noise and sample the noise from a Gaussian distribution to introduce more diversity into this algorithm. We conduct our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning