Learning Across Experiments and Time: Tackling Heterogeneity in A/B Testing
Xinran Li

TL;DR
This paper introduces a local empirical Bayes method for A/B testing that adaptively pools data across time and experiments, improving estimate stability and accuracy amidst heterogeneity and nonstationarity.
Contribution
It proposes a novel local pooling framework that accounts for temporal and cross-experiment heterogeneity, outperforming traditional global pooling methods.
Findings
Reduces variance of treatment effect estimates.
Avoids bias caused by nonstationarity and heterogeneity.
Enhances decision-making reliability in A/B testing.
Abstract
A/B testing plays a central role in data-driven product development, guiding launch decisions for new features and designs. However, treatment effect estimates are often noisy due to short horizons, early stopping, and slowly accumulating long-tail metrics, making early conclusions unreliable. A natural remedy is to pool information across related experiments, but naive pooling potentially fails: within experiments, treatment effects may evolve over time, so mixing early and late outcomes without accounting for nonstationarity induces bias; across experiments, heterogeneity in product, user population, or season dilutes the signal with unrelated noise. These issues highlight the need for pooling strategies that adapt to both temporal evolution and cross-experiment variability. To address these challenges, we propose a local empirical Bayes framework that adapts to both temporal and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Advanced Multi-Objective Optimization Algorithms
