Reinforcement Learning of Causal Variables Using Mediation Analysis

Tue Herlau; Rasmus Larsen

arXiv:2010.15745·cs.LG·May 18, 2022

Reinforcement Learning of Causal Variables Using Mediation Analysis

Tue Herlau, Rasmus Larsen

PDF

Open Access 1 Video

TL;DR

This paper introduces a reinforcement learning method that constructs causal graphs using mediation analysis, enabling agents to learn environment causality and improve policy performance without strict state restrictions.

Contribution

It presents a novel approach combining causal graph learning with reinforcement learning through mediation analysis and generalized Bellman's equations.

Findings

01

Learned plausible causal graphs in grid-world environments.

02

Causally informed policies improved performance.

03

Introduced a new method for causal acquisition via cost-function minimization.

Abstract

Many open problems in machine learning are intrinsically related to causality, however, the use of causal analysis in machine learning is still in its early stage. Within a general reinforcement learning setting, we consider the problem of building a general reinforcement learning agent which uses experience to construct a causal graph of the environment, and use this graph to inform its policy. Our approach has three characteristics: First, we learn a simple, coarse-grained causal graph, in which the variables reflect states at many time instances, and the interventions happen at the level of policies, rather than individual actions. Secondly, we use mediation analysis to obtain an optimization target. By minimizing this target, we define the causal variables. Thirdly, our approach relies on estimating conditional expectations rather the familiar expected return from reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Reinforcement Learning of Causal Variables Using Mediation Analysis· underline

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing