Confidence Intervals for Policy Evaluation in Adaptive Experiments

Vitor Hadad; David A. Hirshberg; Ruohan Zhan; Stefan Wager; Susan; Athey

arXiv:1911.02768·stat.ML·February 16, 2021

Confidence Intervals for Policy Evaluation in Adaptive Experiments

Vitor Hadad, David A. Hirshberg, Ruohan Zhan, Stefan Wager, Susan, Athey

PDF

1 Repo

TL;DR

This paper introduces adaptive reweighting estimators for policy evaluation in adaptive experiments, improving accuracy, variance control, and confidence interval coverage, especially when estimating parameters different from the original trial target.

Contribution

The paper proposes a novel adaptive reweighting scheme for inverse propensity weighting estimators, addressing heavy tails and bias in adaptive experiment inference.

Findings

01

Estimators achieve asymptotically correct coverage.

02

Variance is reduced compared to existing methods.

03

Methods outperform alternatives in RMSE and coverage in experiments.

Abstract

Adaptive experiment designs can dramatically improve statistical efficiency in randomized trials, but they also complicate statistical inference. For example, it is now well known that the sample mean is biased in adaptive trials. Inferential challenges are exacerbated when our parameter of interest differs from the parameter the trial was designed to target, such as when we are interested in estimating the value of a sub-optimal treatment after running a trial to determine the optimal treatment using a stochastic bandit design. In this context, typical estimators that use inverse propensity weighting to eliminate sampling bias can be problematic: their distributions become skewed and heavy-tailed as the propensity scores decay to zero. In this paper, we present a class of estimators that overcome these issues. Our approach is to adaptively reweight the terms of an augmented inverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gsbDBI/adaptive-confidence-intervals
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest