Partial Structure Discovery is Sufficient for No-regret Learning in Causal Bandits
Muhammad Qasim Elahi, Mahsa Ghasemi, Murat Kocaoglu

TL;DR
This paper investigates causal bandit problems with unknown causal graphs, showing that discovering a partial causal structure suffices for no-regret learning, and introduces algorithms with sample complexity guarantees.
Contribution
It characterizes the necessary and sufficient components of the causal graph for optimal learning and proposes a two-stage algorithm combining causal discovery with bandit optimization.
Findings
Identifies minimal causal information needed for regret minimization.
Provides a sample complexity guarantee for causal graph learning.
Establishes a sublinear regret bound for the two-phase approach.
Abstract
Causal knowledge about the relationships among decision variables and a reward variable in a bandit setting can accelerate the learning of an optimal decision. Current works often assume the causal graph is known, which may not always be available a priori. Motivated by this challenge, we focus on the causal bandit problem in scenarios where the underlying causal graph is unknown and may include latent confounders. While intervention on the parents of the reward node is optimal in the absence of latent confounders, this is not necessarily the case in general. Instead, one must consider a set of possibly optimal arms/interventions, each being a special subset of the ancestors of the reward node, making causal discovery beyond the parents of the reward node essential. For regret minimization, we identify that discovering the full causal structure is unnecessary; however, no existing work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Misinformation and Its Impacts
MethodsSparse Evolutionary Training · Focus
