Exploring Variational Deep Q Networks

A. H. Bell-Thomas

arXiv:2008.01641·cs.LG·August 5, 2020

Exploring Variational Deep Q Networks

A. H. Bell-Thomas

PDF

Open Access 1 Repo

TL;DR

This paper analyzes and refines Variational Deep Q Networks, introducing a Double Variational Deep Q Network to enhance stability and robustness in exploration for complex environments, with empirical evaluation and discussion.

Contribution

It provides a detailed analysis, a refined implementation, and introduces the Double Variational Deep Q Network for improved stability in inference-based reinforcement learning.

Findings

01

Double Variational Deep Q Network improves stability

02

Refined implementation enhances exploration efficiency

03

Evaluation shows competitive performance

Abstract

This study provides both analysis and a refined, research-ready implementation of Tang and Kucukelbir's Variational Deep Q Network, a novel approach to maximising the efficiency of exploration in complex learning environments using Variational Bayesian Inference. Alongside reference implementations of both Traditional and Double Deep Q Networks, a small novel contribution is presented - the Double Variational Deep Q Network, which incorporates improvements to increase the stability and robustness of inference-based learning. Finally, an evaluation and discussion of the effectiveness of these approaches is discussed in the wider context of Bayesian Deep Learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HarriBellThomas/VDQN
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference