Distributional Reinforcement Learning with Unconstrained Monotonic   Neural Networks

Thibaut Th\'eate; Antoine Wehenkel; Adrien Bolland; Gilles Louppe and; Damien Ernst

arXiv:2106.03228·cs.LG·March 20, 2023

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

Thibaut Th\'eate, Antoine Wehenkel, Adrien Bolland, Gilles Louppe and, Damien Ernst

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel distributional RL algorithm using unconstrained monotonic neural networks that can learn multiple distribution representations and compares different probability metrics.

Contribution

It presents the first distributional RL method supporting learning of PDF, CDF, and QF representations using UMNNs, decoupling distribution representation from the metric.

Findings

01

UMDQN supports learning three distribution representations.

02

Comparison of KL, Cramer, and Wasserstein metrics reveals their strengths and weaknesses.

03

Identifies limitations of the Wasserstein distance in distributional RL.

Abstract

The distributional reinforcement learning (RL) approach advocates for representing the complete probability distribution of the random return instead of only modelling its expectation. A distributional RL algorithm may be characterised by two main components, namely the representation of the distribution together with its parameterisation and the probability metric defining the loss. The present research work considers the unconstrained monotonic neural network (UMNN) architecture, a universal approximator of continuous monotonic functions which is particularly well suited for modelling different representations of a distribution. This property enables the efficient decoupling of the effect of the function approximator class from that of the probability metric. The research paper firstly introduces a methodology for learning different representations of the random return distribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ThibautTheate/Unconstrained-Monotonic-Deep-Q-Network-algorithm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques