A Causal Bandit Approach to Learning Good Atomic Interventions in Presence of Unobserved Confounders
Aurghya Maiti, Vineet Nair, Gaurav Sinha

TL;DR
This paper introduces algorithms for learning optimal interventions in causal Bayesian networks with unobserved confounders, achieving near-optimal regret bounds and outperforming existing methods by leveraging causal graph structures.
Contribution
It presents the first simple and cumulative regret minimization algorithms for causal Bayesian networks with unobserved confounders and general causal graphs.
Findings
Achieves $ ilde{O}( oot{M}{T})$ simple regret bound for certain causal graphs.
Demonstrates algorithms outperform standard MAB approaches by utilizing causal side-information.
Provides experimental validation comparing new algorithms with existing methods.
Abstract
We study the problem of determining the best intervention in a Causal Bayesian Network (CBN) specified only by its causal graph. We model this as a stochastic multi-armed bandit (MAB) problem with side-information, where the interventions correspond to the arms of the bandit instance. First, we propose a simple regret minimization algorithm that takes as input a semi-Markovian causal graph with atomic interventions and possibly unobservable variables, and achieves expected simple regret, where is dependent on the input CBN and could be very small compared to the number of arms. We also show that this is almost optimal for CBNs described by causal graphs having an -ary tree structure. Our simple regret minimization results, both upper and lower bound, subsume previous results in the literature, which assumed additional structural restrictions on the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
