A Causal Bandit Approach to Learning Good Atomic Interventions in   Presence of Unobserved Confounders

Aurghya Maiti; Vineet Nair; Gaurav Sinha

arXiv:2107.02772·cs.LG·May 20, 2022

A Causal Bandit Approach to Learning Good Atomic Interventions in Presence of Unobserved Confounders

Aurghya Maiti, Vineet Nair, Gaurav Sinha

PDF

Open Access

TL;DR

This paper introduces algorithms for learning optimal interventions in causal Bayesian networks with unobserved confounders, achieving near-optimal regret bounds and outperforming existing methods by leveraging causal graph structures.

Contribution

It presents the first simple and cumulative regret minimization algorithms for causal Bayesian networks with unobserved confounders and general causal graphs.

Findings

01

Achieves $ ilde{O}( oot{M}{T})$ simple regret bound for certain causal graphs.

02

Demonstrates algorithms outperform standard MAB approaches by utilizing causal side-information.

03

Provides experimental validation comparing new algorithms with existing methods.

Abstract

We study the problem of determining the best intervention in a Causal Bayesian Network (CBN) specified only by its causal graph. We model this as a stochastic multi-armed bandit (MAB) problem with side-information, where the interventions correspond to the arms of the bandit instance. First, we propose a simple regret minimization algorithm that takes as input a semi-Markovian causal graph with atomic interventions and possibly unobservable variables, and achieves $\tilde{O} (M / T)$ expected simple regret, where $M$ is dependent on the input CBN and could be very small compared to the number of arms. We also show that this is almost optimal for CBNs described by causal graphs having an $n$ -ary tree structure. Our simple regret minimization results, both upper and lower bound, subsume previous results in the literature, which assumed additional structural restrictions on the input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning