Towards Causal Credit Assignment

M\'aty\'as Schubert

arXiv:2212.11636·cs.LG·May 18, 2023

Towards Causal Credit Assignment

M\'aty\'as Schubert

PDF

Open Access 1 Repo

TL;DR

This paper investigates Hindsight Credit Assignment in Reinforcement Learning, demonstrating its benefits and proposing a causal-structure-based variant that improves efficiency and performance on various tasks.

Contribution

It empirically analyzes Hindsight Credit Assignment, introduces a causal-structure-based modification, and shows its improved efficiency and effectiveness in RL tasks.

Findings

01

Hindsight Credit Assignment has notable benefits in RL.

02

A causal-structure-based variant improves efficiency.

03

The modified method outperforms baseline on multiple tasks.

Abstract

Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making it highly inefficient. Still, no alternative methods have been widely adopted in the field. Hindsight Credit Assignment is a promising, but still unexplored candidate, which aims to solve the problems of both long-term and counterfactual credit assignment. In this thesis, we empirically investigate Hindsight Credit Assignment to identify its main benefits, and key points to improve. Then, we apply it to factored state representations, and in particular to state representations based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matyasch/factored-hca
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)