Causal Bandits for Linear Structural Equation Models
Burak Varici, Karthikeyan Shanmugam, Prasanna Sattigeri, and Ali Tajer

TL;DR
This paper introduces new algorithms for causal bandit problems in linear SEMs that do not require full knowledge of interventional distributions, achieving regret bounds that scale with graph complexity.
Contribution
It proposes the first algorithms for causal bandits in linear SEMs that estimate model parameters instead of all interventional distributions, reducing complexity.
Findings
Regret scales as O(d^{L+1/2} \u221A NT) under certain assumptions.
Algorithms work in both frequentist (UCB) and Bayesian (Thompson Sampling) settings.
Lower bounds match the upper bounds' scaling, indicating near-optimality.
Abstract
This paper studies the problem of designing an optimal sequence of interventions in a causal graphical model to minimize cumulative regret with respect to the best intervention in hindsight. This is, naturally, posed as a causal bandit problem. The focus is on causal bandits for linear structural equation models (SEMs) and soft interventions. It is assumed that the graph's structure is known and has nodes. Two linear mechanisms, one soft intervention and one observational, are assumed for each node, giving rise to possible interventions. Majority of the existing causal bandit algorithms assume that at least the interventional distributions of the reward node's parents are fully specified. However, there are such distributions (one corresponding to each intervention), acquiring which becomes prohibitive even in moderate-sized graphs. This paper dispenses with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques
