Uncertainty in Online Experiments with Dependent Data: An Evaluation of Bootstrap Methods
Eytan Bakshy, Dean Eckles

TL;DR
This paper evaluates bootstrap methods for accurately estimating uncertainty in online experiments with dependent data, emphasizing the importance of accounting for user-item dependence to maintain correct Type I error rates.
Contribution
It develops a framework for understanding dependence effects and assesses bootstrap methods on real datasets, providing practical guidance for large-scale online experiment analysis.
Findings
Single-unit dependence often suffices for error control
Multiway bootstrap improves error rate accuracy in experiments with effects
Dependence structures significantly impact confidence interval validity
Abstract
Many online experiments exhibit dependence between users and items. For example, in online advertising, observations that have a user or an ad in common are likely to be associated. Because of this, even in experiments involving millions of subjects, the difference in mean outcomes between control and treatment conditions can have substantial variance. Previous theoretical and simulation results demonstrate that not accounting for this kind of dependence structure can result in confidence intervals that are too narrow, leading to inaccurate hypothesis tests. We develop a framework for understanding how dependence affects uncertainty in user-item experiments and evaluate how bootstrap methods that account for differing levels of dependence perform in practice. We use three real datasets describing user behaviors on Facebook - user responses to ads, search results, and News Feed stories…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Consumer Market Behavior and Pricing · Advanced Bandit Algorithms Research
