Reinforcement Learning of Causal Variables Using Mediation Analysis
Tue Herlau, Rasmus Larsen

TL;DR
This paper introduces a reinforcement learning method that constructs causal graphs using mediation analysis, enabling agents to learn environment causality and improve policy performance without strict state restrictions.
Contribution
It presents a novel approach combining causal graph learning with reinforcement learning through mediation analysis and generalized Bellman's equations.
Findings
Learned plausible causal graphs in grid-world environments.
Causally informed policies improved performance.
Introduced a new method for causal acquisition via cost-function minimization.
Abstract
Many open problems in machine learning are intrinsically related to causality, however, the use of causal analysis in machine learning is still in its early stage. Within a general reinforcement learning setting, we consider the problem of building a general reinforcement learning agent which uses experience to construct a causal graph of the environment, and use this graph to inform its policy. Our approach has three characteristics: First, we learn a simple, coarse-grained causal graph, in which the variables reflect states at many time instances, and the interventions happen at the level of policies, rather than individual actions. Secondly, we use mediation analysis to obtain an optimization target. By minimizing this target, we define the causal variables. Thirdly, our approach relies on estimating conditional expectations rather the familiar expected return from reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing
