Towards Causal Credit Assignment
M\'aty\'as Schubert

TL;DR
This paper investigates Hindsight Credit Assignment in Reinforcement Learning, demonstrating its benefits and proposing a causal-structure-based variant that improves efficiency and performance on various tasks.
Contribution
It empirically analyzes Hindsight Credit Assignment, introduces a causal-structure-based modification, and shows its improved efficiency and effectiveness in RL tasks.
Findings
Hindsight Credit Assignment has notable benefits in RL.
A causal-structure-based variant improves efficiency.
The modified method outperforms baseline on multiple tasks.
Abstract
Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in tasks where the effects of decisions are not immediately evident. Furthermore, this method can only evaluate actions that have been selected by the agent, making it highly inefficient. Still, no alternative methods have been widely adopted in the field. Hindsight Credit Assignment is a promising, but still unexplored candidate, which aims to solve the problems of both long-term and counterfactual credit assignment. In this thesis, we empirically investigate Hindsight Credit Assignment to identify its main benefits, and key points to improve. Then, we apply it to factored state representations, and in particular to state representations based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
