Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?
Shuangning Li, Chonghuan Wang, Jingyan Wang

TL;DR
This paper examines the impact of data sharing on A/B experiments for recommendation algorithms, analyzing when interference biases decision-making and proposing a detection method to improve algorithm comparison accuracy.
Contribution
It formalizes the effect of data sharing interference within a bandit framework and provides theoretical conditions for when A/B test results are reliable.
Findings
Interference can bias the estimated treatment effect in recommendation systems.
The level of exploration influences the reliability of A/B comparisons under data sharing.
A ramp-up experiment procedure can detect when data sharing leads to incorrect algorithm assessments.
Abstract
We study A/B experiments that are designed to compare the performance of two recommendation algorithms. Prior work has observed that the stable unit treatment value assumption (SUTVA) often does not hold in large-scale recommendation systems, and hence the estimate for the global treatment effect (GTE) is biased. Specifically, units under the treatment and control algorithms contribute to a shared pool of data that subsequently train both algorithms, resulting in interference between the two groups. In this paper, we investigate when such interference may affect our decision making on which algorithm is better. We formalize this insight under a multi-armed bandit framework and theoretically characterize when the sign of the difference-in-means estimator of the GTE under data sharing aligns with or contradicts the sign of the true GTE. Our analysis identifies the level of exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · AI in Service Interactions
