Uncertainty Quantification and Causal Considerations for Off-Policy Decision Making
Muhammad Faaiz Taufiq

TL;DR
This paper introduces new methods for off-policy evaluation that reduce variance, quantify uncertainty with predictive intervals, and address causal unidentifiability, enhancing robustness in decision-making under uncertainty.
Contribution
It presents three novel approaches: a variance-reducing estimator, a conformal prediction method for uncertainty quantification, and bounds for causal unidentifiability in sequential decisions.
Findings
The Marginal Ratio estimator reduces variance in importance sampling.
Conformal Off-Policy Prediction provides finite-sample predictive intervals.
New bounds for causal unidentifiability help assess model reliability.
Abstract
Off-policy evaluation (OPE) is a critical challenge in robust decision-making that seeks to assess the performance of a new policy using data collected under a different policy. However, the existing OPE methodologies suffer from several limitations arising from statistical uncertainty as well as causal considerations. In this thesis, we address these limitations by presenting three different works. Firstly, we consider the problem of high variance in the importance-sampling-based OPE estimators. We introduce the Marginal Ratio (MR) estimator, a novel OPE method that reduces variance by focusing on the marginal distribution of outcomes rather than direct policy shifts, improving robustness in contextual bandits. Next, we propose Conformal Off-Policy Prediction (COPP), a principled approach for uncertainty quantification in OPE that provides finite-sample predictive intervals, ensuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making · Climate Change Policy and Economics
