STATE: A Robust ATE Estimator of Heavy-Tailed Metrics for Variance   Reduction in Online Controlled Experiments

Hao Zhou; Kun Sun; Shaoming Li; Yangfeng Fan; Guibin Jiang; Jiaqi; Zheng; Tao Li

arXiv:2407.16337·cs.LG·July 24, 2024

STATE: A Robust ATE Estimator of Heavy-Tailed Metrics for Variance Reduction in Online Controlled Experiments

Hao Zhou, Kun Sun, Shaoming Li, Yangfeng Fan, Guibin Jiang, Jiaqi, Zheng, Tao Li

PDF

TL;DR

This paper introduces STATE, a robust estimator for heavy-tailed metrics in online experiments that significantly reduces variance and outperforms existing methods, enabling faster and more reliable data-driven decisions.

Contribution

The paper proposes a novel heavy-tailed metric estimator using Student's t-distribution and machine learning, extending variance reduction techniques to ratio metrics with proven effectiveness.

Findings

01

Achieves over 50% variance reduction in synthetic and real data.

02

Outperforms state-of-the-art estimators CUPAC and MLRATE.

03

Enables halving the number of observations or experimental duration.

Abstract

Online controlled experiments play a crucial role in enabling data-driven decisions across a wide range of companies. Variance reduction is an effective technique to improve the sensitivity of experiments, achieving higher statistical power while using fewer samples and shorter experimental periods. However, typical variance reduction methods (e.g., regression-adjusted estimators) are built upon the intuitional assumption of Gaussian distributions and cannot properly characterize the real business metrics with heavy-tailed distributions. Furthermore, outliers diminish the correlation between pre-experiment covariates and outcome metrics, greatly limiting the effectiveness of variance reduction. In this paper, we develop a novel framework that integrates the Student's t-distribution with machine learning tools to fit heavy-tailed metrics and construct a robust average treatment effect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.