Understanding Memory-Regret Trade-Off for Streaming Stochastic   Multi-Armed Bandits

Yuchen He; Zichun Ye; Chihao Zhang

arXiv:2405.19752·cs.LG·July 9, 2024

Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits

Yuchen He, Zichun Ye, Chihao Zhang

PDF

Open Access

TL;DR

This paper characterizes the optimal regret for streaming stochastic multi-armed bandits with memory constraints, providing algorithms and bounds that depend on the number of passes, arms, and memory size.

Contribution

It offers a complete characterization of the optimal regret in the streaming multi-armed bandit problem with memory limitations, including matching upper and lower bounds.

Findings

01

Designed an algorithm with specific regret bounds.

02

Proved a matching lower bound for the regret.

03

Results are tight up to a logarithmic factor.

Abstract

We study the stochastic multi-armed bandit problem in the $P$ -pass streaming model. In this problem, the $n$ arms are present in a stream and at most $m < n$ arms and their statistics can be stored in the memory. We give a complete characterization of the optimal regret in terms of $m, n$ and $P$ . Specifically, we design an algorithm with $\tilde{O} ((n - m)^{1 + \frac{2 ^{P} - 2}{2 ^{P + 1} - 1}} n^{\frac{2 - 2 ^{P + 1}}{2 ^{P + 1} - 1}} T^{\frac{2 ^{P}}{2 ^{P + 1} - 1}})$ regret and complement it with an $\tilde{Ω} ((n - m)^{1 + \frac{2 ^{P} - 2}{2 ^{P + 1} - 1}} n^{\frac{2 - 2 ^{P + 1}}{2 ^{P + 1} - 1}} T^{\frac{2 ^{P}}{2 ^{P + 1} - 1}})$ lower bound when the number of rounds $T$ is sufficiently large. Our results are tight up to a logarithmic factor in $n$ and $P$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Smart Grid Energy Management