Enhancing External Validity of Experiments with Ongoing Sampling
Chen Wang, Shichao Han, Shan Huang

TL;DR
This paper introduces a dynamic sampling framework for online experiments that improves external validity by adjusting for temporal shifts, validated through real-world and synthetic experiments on WeChat.
Contribution
It proposes a stage-specific estimator for PATE that adapts during ongoing sampling, using survival analysis to identify stages without prior population knowledge.
Findings
Validated on WeChat with real-world online experiments.
Effective in synthetic and platform-wide A/B tests.
Enhances generalizability of experimental results.
Abstract
Participants in online experiments often enroll over time, which can compromise sample representativeness due to temporal shifts in covariates. This issue is particularly critical in A/B tests, online controlled experiments extensively used to evaluate product updates, since these tests are cost-sensitive and typically short in duration. We propose a novel framework that dynamically assesses sample representativeness by dividing the ongoing sampling process into three stages. We then develop stage-specific estimators for Population Average Treatment Effects (PATE), ensuring that experimental results remain generalizable across varying experiment durations. Leveraging survival analysis, we develop a heuristic function that identifies these stages without requiring prior knowledge of population or sample characteristics, thereby keeping implementation costs low. Our approach bridges the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
