Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks
Thibaut Th\'eate, Antoine Wehenkel, Adrien Bolland, Gilles Louppe and, Damien Ernst

TL;DR
This paper introduces a novel distributional RL algorithm using unconstrained monotonic neural networks that can learn multiple distribution representations and compares different probability metrics.
Contribution
It presents the first distributional RL method supporting learning of PDF, CDF, and QF representations using UMNNs, decoupling distribution representation from the metric.
Findings
UMDQN supports learning three distribution representations.
Comparison of KL, Cramer, and Wasserstein metrics reveals their strengths and weaknesses.
Identifies limitations of the Wasserstein distance in distributional RL.
Abstract
The distributional reinforcement learning (RL) approach advocates for representing the complete probability distribution of the random return instead of only modelling its expectation. A distributional RL algorithm may be characterised by two main components, namely the representation of the distribution together with its parameterisation and the probability metric defining the loss. The present research work considers the unconstrained monotonic neural network (UMNN) architecture, a universal approximator of continuous monotonic functions which is particularly well suited for modelling different representations of a distribution. This property enables the efficient decoupling of the effect of the function approximator class from that of the probability metric. The research paper firstly introduces a methodology for learning different representations of the random return distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
