On reducing the order of arm-passes bandit streaming algorithms under   memory bottleneck

Santanu Rathod

arXiv:2112.06130·cs.LG·December 14, 2021

On reducing the order of arm-passes bandit streaming algorithms under memory bottleneck

Santanu Rathod

PDF

Open Access

TL;DR

This paper improves multi-arm bandit streaming algorithms under memory constraints by reducing the number of passes needed to achieve near-optimal regret, offering more efficient solutions in resource-limited settings.

Contribution

It introduces algorithms that decrease the number of streaming passes required for low-regret bandit performance under memory bottlenecks, including 2-pass algorithms with initial conditions.

Findings

01

Reduced the number of passes for near-optimal regret by a logarithmic factor.

02

Developed 2-pass algorithms achieving similar regret bounds under initial conditions.

03

Enhanced efficiency of bandit algorithms in memory-constrained streaming environments.

Abstract

In this work we explore multi-arm bandit streaming model, especially in cases where the model faces resource bottleneck. We build over existing algorithms conditioned by limited arm memory at any instance of time. Specifically, we improve the amount of streaming passes it takes for a bandit algorithm to incur a $O (T lo g (T))$ regret by a logarithmic factor, and also provide 2-pass algorithms with some initial conditions to incur a similar order of regret.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications