Statistical Properties of Exclusive and Non-exclusive Online Randomized Experiments using Bucket Reuse
M{\aa}rten Schultzberg, Oskar Kjellin, Johan Rydberg

TL;DR
This paper analyzes the statistical effects of exclusive and non-exclusive randomized experiments that use bucket reuse, focusing on bias, estimator properties, and implications for inference in large populations.
Contribution
It connects bucket sampling to complex sampling theory, derives bias from restricted sampling, and provides simulation-based recommendations for bias evaluation and management.
Findings
Bucket sampling properties are linked to complex sampling theory.
Restricted sampling introduces quantifiable bias.
Simulations support theoretical bias estimates and handling strategies.
Abstract
Randomized experiments is a key part of product development in the tech industry. It is often necessary to run programs of exclusive experiments, i.e., experiments that cannot be run on the same units during the same time. These programs implies restriction on the random sampling, as units that are currently in an experiment cannot be sampled into a new one. Moreover, to technically enable this type of coordination with large populations, the units in the population are often grouped into 'buckets' and sampling is then performed on the bucket level. This paper investigates some statistical implications of both the restricted sampling and the bucket-level sampling. The contribution of this paper is threefold: First, bucket sampling is connected to the existing literature on randomized experiments in complex sampling designs which enables establishing properties of the difference-in-means…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
