A Survey of Temporal Credit Assignment in Deep Reinforcement Learning

Eduardo Pignatelli; Johan Ferret; Matthieu Geist; Thomas Mesnard; Hado; van Hasselt; Olivier Pietquin; Laura Toni

arXiv:2312.01072·cs.LG·July 8, 2024·1 cites

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning

Eduardo Pignatelli, Johan Ferret, Matthieu Geist, Thomas Mesnard, Hado, van Hasselt, Olivier Pietquin, Laura Toni

PDF

Open Access

TL;DR

This survey reviews the state of the art in Temporal Credit Assignment in deep reinforcement learning, proposing a unifying formalism, analyzing challenges, and discussing evaluation protocols to advance understanding and research in the field.

Contribution

It introduces a unifying formalism for credit, compares algorithms, and provides a comprehensive overview of challenges and evaluation methods in Temporal Credit Assignment for deep RL.

Findings

01

Proposes a formalism for credit that enables algorithm comparison.

02

Analyzes challenges like delayed effects and lack of influence.

03

Suggests evaluation protocols and future research directions.

Abstract

The Credit Assignment Problem (CAP) refers to the longstanding challenge of Reinforcement Learning (RL) agents to associate actions with their long-term consequences. Solving the CAP is a crucial step towards the successful deployment of RL in the real world since most decision problems provide feedback that is noisy, delayed, and with little or no information about the causes. These conditions make it hard to distinguish serendipitous outcomes from those caused by informed decision-making. However, the mathematical nature of credit and the CAP remains poorly understood and defined. In this survey, we review the state of the art of Temporal Credit Assignment (CA) in deep RL. We propose a unifying formalism for credit that enables equitable comparisons of state-of-the-art algorithms and improves our understanding of the trade-offs between the various methods. We cast the CAP as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics