GAN Q-learning

Thang Doan; Bogdan Mazoure; Clare Lyle

arXiv:1805.04874·stat.ML·July 24, 2018·1 cites

GAN Q-learning

Thang Doan, Bogdan Mazoure, Clare Lyle

PDF

Open Access 1 Repo

TL;DR

This paper introduces GAN Q-learning, a novel distributional reinforcement learning method utilizing GANs, demonstrating its effectiveness in simple environments and OpenAI Gym, offering a flexible deep learning-based alternative.

Contribution

The paper proposes GAN Q-learning, integrating GANs into distributional RL, and analyzes its performance, providing a new approach for complex MDPs with deep learning.

Findings

01

Effective in simple tabular environments

02

Performs well in OpenAI Gym tasks

03

Offers a flexible deep learning alternative

Abstract

Distributional reinforcement learning (distributional RL) has seen empirical success in complex Markov Decision Processes (MDPs) in the setting of nonlinear function approximation. However, there are many different ways in which one can leverage the distributional approach to reinforcement learning. In this paper, we propose GAN Q-learning, a novel distributional RL method based on generative adversarial networks (GANs) and analyze its performance in simple tabular environments, as well as OpenAI Gym. We empirically show that our algorithm leverages the flexibility and blackbox approach of deep learning models while providing a viable alternative to traditional methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

daggertye/GAN-Q-Learning
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729