Efficient Transfer Learning via Causal Bounds
Xueping Gong, Wei You, Jiheng Zhang

TL;DR
This paper introduces a method for transfer learning that uses causal bounds derived from ambiguity sets of structural causal models to improve decision-making under heterogeneity and confounding, with theoretical guarantees and empirical validation.
Contribution
It develops a novel approach to bounding causal effects using ambiguity sets, efficient sampling algorithms, and integrates these bounds into bandit algorithms for improved online learning performance.
Findings
Causal bounds can significantly reduce regret in data-scarce regimes.
The proposed sampler converges almost surely to true causal effect limits.
Embedding causal bounds into bandit algorithms achieves optimal regret bounds.
Abstract
Transfer learning seeks to accelerate sequential decision-making by leveraging offline data from related agents. However, data from heterogeneous sources that differ in observed features, distributions, or unobserved confounders often render causal effects non-identifiable and bias naive estimators. We address this by forming ambiguity sets of structural causal models defined via integral constraints on their joint densities. Optimizing any causal effect over these sets leads to generally non-convex programs whose solutions tightly bound the range of possible effects under heterogeneity or confounding. To solve these programs efficiently, we develop a hit-and-run sampler that explores the entire ambiguity set and, when paired with a local optimization oracle, produces causal bound estimates that converge almost surely to the true limits. We further accommodate estimation error by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Smart Grid Energy Management
