A Comparative Analysis of Expected and Distributional Reinforcement   Learning

Clare Lyle; Pablo Samuel Castro; Marc G. Bellemare

arXiv:1901.11084·cs.LG·February 22, 2019·21 cites

A Comparative Analysis of Expected and Distributional Reinforcement Learning

Clare Lyle, Pablo Samuel Castro, Marc G. Bellemare

PDF

Open Access

TL;DR

This paper investigates why distributional reinforcement learning often outperforms expected RL by analyzing theoretical differences in various settings and empirically testing their effects in control tasks.

Contribution

It provides the first theoretical analysis of distributional RL's behavior in different approximation settings and compares it empirically to expected RL in non-linear control tasks.

Findings

01

Distributional RL behaves like expected RL in many tabular and linear cases.

02

In some cases, distributional RL can harm performance if behaviors differ.

03

Empirical results suggest specific scenarios where distributional RL offers advantages.

Abstract

Since their introduction a year ago, distributional approaches to reinforcement learning (distributional RL) have produced strong results relative to the standard approach which models expected values (expected RL). However, aside from convergence guarantees, there have been few theoretical results investigating the reasons behind the improvements distributional RL provides. In this paper we begin the investigation into this fundamental question by analyzing the differences in the tabular, linear approximation, and non-linear approximation settings. We prove that in many realizations of the tabular and linear approximation settings, distributional RL behaves exactly the same as expected RL. In cases where the two methods behave differently, distributional RL can in fact hurt performance when it does not induce identical behaviour. We then continue with an empirical analysis comparing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Advanced Multi-Objective Optimization Algorithms