A Convex Framework for Confounding Robust Inference
Kei Ishikawa, Niao He, Takafumi Kanamori

TL;DR
This paper introduces a convex optimization-based estimator for policy evaluation in offline contextual bandits with unobserved confounders, providing sharper bounds and enabling robust policy learning.
Contribution
It proposes a general convex programming framework for sharp policy value bounds under confounding, extending to sensitivity analysis, model selection, and robust policy learning.
Findings
Provides a convex estimator with sharp lower bounds on policy value.
Enables sensitivity analysis with f-divergence and model selection via cross-validation.
Offers theoretical guarantees through M-estimation techniques.
Abstract
We study policy evaluation of offline contextual bandits subject to unobserved confounders. Sensitivity analysis methods are commonly used to estimate the policy value under the worst-case confounding over a given uncertainty set. However, existing work often resorts to some coarse relaxation of the uncertainty set for the sake of tractability, leading to overly conservative estimation of the policy value. In this paper, we propose a general estimator that provides a sharp lower bound of the policy value using convex programming. The generality of our estimator enables various extensions such as sensitivity analysis with f-divergence, model selection with cross validation and information criterion, and robust policy learning with the sharp lower bound. Furthermore, our estimation method can be reformulated as an empirical risk minimization problem thanks to the strong duality, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Health Systems, Economic Evaluations, Quality of Life
