Benefits and Costs of Adaptive Sampling
Yu-Shiou Willy Lin, Dae Woong Ham, Iavor Bojinov

TL;DR
This paper investigates when adaptive sampling in multi-armed bandits improves estimation accuracy over uniform sampling, proposing policies that balance inference quality and experimentation costs, with theoretical and empirical validation.
Contribution
It characterizes conditions for MSE improvements with adaptive Neyman allocation and introduces the SARP and NARP policies balancing inference and regret objectives.
Findings
Adaptive Neyman allocation improves MSE at modest sample sizes with variance heterogeneity.
Proposed policies converge to the optimal rate as the sampling budget increases.
Simulations show improved estimation precision over uniform sampling with controlled performance loss.
Abstract
Multi-armed bandits are widely used for sequential experimentation in clinical trials, recommendation systems, and online platforms. While regret minimization and valid inference from adaptively collected data have each been studied extensively, a basic question remains: when does adaptivity \emph{improve estimation precision} relative to uniform designs, and how should inference be balanced against the online cost of experimentation? We first study arm-level mean estimation under mean-squared-error (MSE) objectives. We characterize when an adaptive Neyman allocation, which allocates samples according to arm variance, yields strict MSE improvements over uniform sampling. When there is variance heterogeneity across arms, these improvements arise at modest sample sizes, clarifying that adaptivity can be preferable for inference not only asymptotically, but also in many practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
