Equivalence Test in Multi-dimensional Space with Applications in A/B Testing
Jing Miao, Hongyuan Yuan, Zhenyu Yan

TL;DR
This paper introduces a non-parametric statistical framework for validating the randomness of data splits in multi-dimensional A/B testing, ensuring the integrity of online experiments and other applications.
Contribution
It proposes a novel randomized chi-square test for multi-dimensional data, compares it with existing methods, and demonstrates its effectiveness on real and simulated datasets.
Findings
All three methods show promising power in detecting distribution differences.
The proposed tests are applicable to both categorical and continuous data.
The methodology is validated on datasets from Adobe Experience Cloud.
Abstract
In this paper, we provide a statistical testing framework to check whether a random sample splitting in a multi-dimensional space is carried out in a valid way, which could be directly applied to A/B testing and multivariate testing to ensure the online traffic split is truly random with respect to the covariates. We believe this is an important step of quality control that is missing in many real world online experiments. Here, we propose a randomized chi-square test method, compared with propensity score and distance components (DISCO) test methods, to test the hypothesis that the post-split categorical data sets have the same multi-dimensional distribution. The methods can be easily generalized to continuous data. We also propose a resampling procedure to adjust for multiplicity which in practice often has higher power than some existing method such as Holm's procedure. We try the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Statistical Methods in Clinical Trials · Mental Health Research Topics
