Scalable Analysis of Bipartite Experiments

Liang Shi; Edvard Bakhitov; Kenneth Hung; Brian Karrer; Charlie Walker; Monica Bhole; Okke Schrijvers

arXiv:2402.11070·stat.ME·November 5, 2025·1 cites

Scalable Analysis of Bipartite Experiments

Liang Shi, Edvard Bakhitov, Kenneth Hung, Brian Karrer, Charlie Walker, Monica Bhole, Okke Schrijvers

PDF

Open Access

TL;DR

This paper introduces scalable, statistically valid methods for analyzing bipartite experiments on large online platforms, improving inference accuracy and computational efficiency in the presence of complex interference graphs.

Contribution

It presents a covariate-adjusted estimator, a new inference method with proven asymptotic validity, and a linear-time algorithm suitable for large-scale data analysis.

Findings

01

CA-ERL reduces variance by 60-90% in real experiments.

02

The randomization inference method achieves correct coverage.

03

The linear-time algorithm is easily implementable in query engines.

Abstract

Bipartite Experiments are randomized experiments where the treatment is applied to a set of units (randomization units) that is different from the units of analysis, and randomization units and analysis units are connected through a bipartite graph. The scale of experimentation at large online platforms necessitates both accurate inference in the presence of a large bipartite interference graph, as well as a highly scalable implementation. In this paper, we describe new methods for inference that enable practical, scalable analysis of bipartite experiments: (1) We propose CA-ERL, a covariate-adjusted variant of the exposure-reweighted-linear (ERL) estimator [9], which empirically yields 60-90% variance reduction. (2) We introduce a randomization-based method for inference and prove asymptotic validity of a Wald-type confidence interval under graph sparsity assumptions. (3) We present a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimal Experimental Design Methods