Ensure A/B Test Quality at Scale with Automated Randomization Validation   and Sample Ratio Mismatch Detection

Keyu Nie; Zezhong Zhang; Bingquan Xu; Tao Yuan

arXiv:2208.07766·stat.AP·March 10, 2023·CIKM

Ensure A/B Test Quality at Scale with Automated Randomization Validation and Sample Ratio Mismatch Detection

Keyu Nie, Zezhong Zhang, Bingquan Xu, Tao Yuan

PDF

TL;DR

This paper presents automated methods for ensuring A/B test quality at large scale, focusing on randomization validation with PSI and sample ratio mismatch detection to improve trustworthiness and efficiency.

Contribution

It introduces automated processes for randomization validation and sample ratio mismatch detection, enhancing large-scale experiment quality monitoring.

Findings

01

Automated randomization validation reduces false positives.

02

Sample ratio mismatch detection improves experiment trustworthiness.

03

Methods enable scalable and efficient experiment monitoring.

Abstract

eBay's experimentation platform runs hundreds of A/B tests on any given day. The platform integrates with the tracking infrastructure and customer experience servers, provides the sampling service for experiments, and has the responsibility to monitor the progress of each A/B test. There are many challenges especially when it is required to ensure experiment quality at the large scale. We discuss two automated test quality monitoring processes and methodologies, namely randomization validation using population stability index (PSI) and sample ratio mismatch (a.k.a. sample delta) detection using sequential analysis. The automated processes assist the experimentation platform to run high quality and trustworthy tests not only effectively on a large scale, but also efficiently by minimizing false positive monitoring alarms to experimenters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.