Causal Bandits for Linear Structural Equation Models

Burak Varici; Karthikeyan Shanmugam; Prasanna Sattigeri; and Ali Tajer

arXiv:2208.12764·stat.ML·April 4, 2023·5 cites

Causal Bandits for Linear Structural Equation Models

Burak Varici, Karthikeyan Shanmugam, Prasanna Sattigeri, and Ali Tajer

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces new algorithms for causal bandit problems in linear SEMs that do not require full knowledge of interventional distributions, achieving regret bounds that scale with graph complexity.

Contribution

It proposes the first algorithms for causal bandits in linear SEMs that estimate model parameters instead of all interventional distributions, reducing complexity.

Findings

01

Regret scales as O(d^{L+1/2} \u221A NT) under certain assumptions.

02

Algorithms work in both frequentist (UCB) and Bayesian (Thompson Sampling) settings.

03

Lower bounds match the upper bounds' scaling, indicating near-optimality.

Abstract

This paper studies the problem of designing an optimal sequence of interventions in a causal graphical model to minimize cumulative regret with respect to the best intervention in hindsight. This is, naturally, posed as a causal bandit problem. The focus is on causal bandits for linear structural equation models (SEMs) and soft interventions. It is assumed that the graph's structure is known and has $N$ nodes. Two linear mechanisms, one soft intervention and one observational, are assumed for each node, giving rise to $2^{N}$ possible interventions. Majority of the existing causal bandit algorithms assume that at least the interventional distributions of the reward node's parents are fully specified. However, there are $2^{N}$ such distributions (one corresponding to each intervention), acquiring which becomes prohibitive even in moderate-sized graphs. This paper dispenses with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bvarici/causal-bandits-linear-sem
noneOfficial

Videos

Causal Bandits for Linear Structural Equation Models· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques