Off-policy evaluation beyond overlap: partial identification through   smoothness

Samir Khan; Martin Saveski; Johan Ugander

arXiv:2305.11812·stat.ME·March 12, 2024·2 cites

Off-policy evaluation beyond overlap: partial identification through smoothness

Samir Khan, Martin Saveski, Johan Ugander

PDF

Open Access

TL;DR

This paper develops a method for off-policy evaluation that provides bounds on policy value without overlap assumptions, using Lipschitz smoothness to formulate and solve linear programs efficiently.

Contribution

It introduces a novel partial identification approach for OPE under non-parametric smoothness assumptions, with closed-form solutions and optimal convergence rates.

Findings

01

Bounds are tighter with smoothness assumptions.

02

Linear programs can be solved efficiently.

03

Method achieves minimax optimal convergence rates.

Abstract

Off-policy evaluation (OPE) is the problem of estimating the value of a target policy using historical data collected under a different logging policy. OPE methods typically assume overlap between the target and logging policy, enabling solutions based on importance weighting and/or imputation. In this work, we approach OPE without assuming either overlap or a well-specified model by considering a strategy based on partial identification under non-parametric assumptions on the conditional mean function, focusing especially on Lipschitz smoothness. Under such smoothness assumptions, we formulate a pair of linear programs whose optimal values upper and lower bound the contributions of the no-overlap region to the off-policy value. We show that these linear programs have a concise closed form solution that can be computed efficiently and that their solutions converge, under the Lipschitz…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEnvironmental Impact and Sustainability · Infrastructure Maintenance and Monitoring · Advanced Causal Inference Techniques