Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

Shuangning Li; Chonghuan Wang; Jingyan Wang

arXiv:2507.11891·stat.ML·February 24, 2026

Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?

Shuangning Li, Chonghuan Wang, Jingyan Wang

PDF

Open Access 1 Repo

TL;DR

This paper examines the impact of data sharing on A/B experiments for recommendation algorithms, analyzing when interference biases decision-making and proposing a detection method to improve algorithm comparison accuracy.

Contribution

It formalizes the effect of data sharing interference within a bandit framework and provides theoretical conditions for when A/B test results are reliable.

Findings

01

Interference can bias the estimated treatment effect in recommendation systems.

02

The level of exploration influences the reliability of A/B comparisons under data sharing.

03

A ramp-up experiment procedure can detect when data sharing leads to incorrect algorithm assessments.

Abstract

We study A/B experiments that are designed to compare the performance of two recommendation algorithms. Prior work has observed that the stable unit treatment value assumption (SUTVA) often does not hold in large-scale recommendation systems, and hence the estimate for the global treatment effect (GTE) is biased. Specifically, units under the treatment and control algorithms contribute to a shared pool of data that subsequently train both algorithms, resulting in interference between the two groups. In this paper, we investigate when such interference may affect our decision making on which algorithm is better. We formalize this insight under a multi-armed bandit framework and theoretically characterize when the sign of the difference-in-means estimator of the GTE under data sharing aligns with or contradicts the sign of the true GTE. Our analysis identifies the level of exploration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jingyanw/bandit-data-sharing
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · AI in Service Interactions