Linear Causal Bandits: Unknown Graph and Soft Interventions

Zirui Yan; Ali Tajer

arXiv:2411.02383·stat.ML·November 5, 2024

Linear Causal Bandits: Unknown Graph and Soft Interventions

Zirui Yan, Ali Tajer

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper develops new algorithms and theoretical bounds for causal bandit problems with unknown graphs and stochastic interventions, advancing understanding of regret minimization in complex causal settings.

Contribution

It introduces the first regret bounds for causal bandits with unknown graphs and stochastic interventions, along with a computationally efficient algorithm for this setting.

Findings

01

Regret scales as ((cd)^{L-rac{1}{2}}\u007d ext{T} + d + RN)

02

Established a minimax lower bound of (d^{L-rac{3}{2}} ext{T})

03

Graph size N has diminishing impact on regret as T increases

Abstract

Designing causal bandit algorithms depends on two central categories of assumptions: (i) the extent of information about the underlying causal graphs and (ii) the extent of information about interventional statistical models. There have been extensive recent advances in dispensing with assumptions on either category. These include assuming known graphs but unknown interventional distributions, and the converse setting of assuming unknown graphs but access to restrictive hard/ $do$ interventions, which removes the stochasticity and ancestral dependencies. Nevertheless, the problem in its general form, i.e., unknown graph and unknown stochastic intervention models, remains open. This paper addresses this problem and establishes that in a graph with $N$ nodes, maximum in-degree $d$ and maximum causal path length $L$ , after $T$ interaction rounds the regret upper bound scales…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZiruiYan/Linear-Causal-Bandit-Unknown-Graph
noneOfficial

Videos

Linear Causal Bandits: Unknown Graph and Soft Interventions· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Explainable Artificial Intelligence (XAI)