Scalable Analysis of Bipartite Experiments
Liang Shi, Edvard Bakhitov, Kenneth Hung, Brian Karrer, Charlie Walker, Monica Bhole, Okke Schrijvers

TL;DR
This paper introduces scalable, statistically valid methods for analyzing bipartite experiments on large online platforms, improving inference accuracy and computational efficiency in the presence of complex interference graphs.
Contribution
It presents a covariate-adjusted estimator, a new inference method with proven asymptotic validity, and a linear-time algorithm suitable for large-scale data analysis.
Findings
CA-ERL reduces variance by 60-90% in real experiments.
The randomization inference method achieves correct coverage.
The linear-time algorithm is easily implementable in query engines.
Abstract
Bipartite Experiments are randomized experiments where the treatment is applied to a set of units (randomization units) that is different from the units of analysis, and randomization units and analysis units are connected through a bipartite graph. The scale of experimentation at large online platforms necessitates both accurate inference in the presence of a large bipartite interference graph, as well as a highly scalable implementation. In this paper, we describe new methods for inference that enable practical, scalable analysis of bipartite experiments: (1) We propose CA-ERL, a covariate-adjusted variant of the exposure-reweighted-linear (ERL) estimator [9], which empirically yields 60-90% variance reduction. (2) We introduce a randomization-based method for inference and prove asymptotic validity of a Wald-type confidence interval under graph sparsity assumptions. (3) We present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods
