Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou, Yuhan Li, Ruoqing Zhu, Annie Qu

TL;DR
This paper introduces a unified error quantification framework for high-confidence off-policy evaluation in infinite-horizon MDPs, addressing distributional shift and error tradeoffs to produce tight, robust confidence intervals.
Contribution
It proposes a novel unified error analysis and estimator that jointly quantifies modeling and sampling errors, breaking tradeoffs to improve CI tightness and robustness against distributional shifts.
Findings
Achieves the tightest possible confidence intervals under distributional shift.
Proves sample efficiency and error robustness in non-linear function approximation.
Demonstrates effectiveness on synthetic data and mobile health datasets.
Abstract
We study high-confidence off-policy evaluation in the context of infinite-horizon Markov decision processes, where the objective is to establish a confidence interval (CI) for the target policy value using only offline data pre-collected from unknown behavior policies. This task faces two primary challenges: providing a comprehensive and rigorous error quantification in CI estimation, and addressing the distributional shift that results from discrepancies between the distribution induced by the target policy and the offline data-generating process. Motivated by an innovative unified error analysis, we jointly quantify the two sources of estimation errors: the misspecification error on modeling marginalized importance weights and the statistical uncertainty due to sampling, within a single interval. This unified framework reveals a previously hidden tradeoff between the errors, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNutritional Studies and Diet · Statistical Methods and Inference
