Multi-User Multi-Armed Bandits for Uncoordinated Spectrum Access

Meghana Bande; Venugopal V. Veeravalli

arXiv:1807.00867·cs.LG·January 31, 2019

Multi-User Multi-Armed Bandits for Uncoordinated Spectrum Access

Meghana Bande, Venugopal V. Veeravalli

PDF

TL;DR

This paper develops algorithms for uncoordinated spectrum access using multi-user multi-armed bandit frameworks, addressing unknown user counts, collisions, and both stochastic and adversarial reward settings, with proven regret guarantees.

Contribution

It introduces novel algorithms for multi-user spectrum access that handle unknown user numbers, collisions, and dynamic environments, with theoretical regret bounds in stochastic and adversarial cases.

Findings

01

System-wide constant regret in stochastic setting

02

Sub-linear regret of order O(T^{3/4}) in adversarial setting

03

Algorithms extend to dynamic user scenarios with sub-linear regret

Abstract

A multi-user multi-armed bandit (MAB) framework is used to develop algorithms for uncoordinated spectrum access. The number of users is assumed to be unknown to each user. A stochastic setting is first considered, where the rewards on a channel are the same for each user. In contrast to prior work, it is assumed that the number of users can possibly exceed the number of channels, and that rewards can be non-zero even under collisions. The proposed algorithm consists of an estimation phase and an allocation phase. It is shown that if every user adopts the algorithm, the system wide regret is constant with time with high probability. The regret guarantees hold for any number of users and channels, in particular, even when the number of users is less than the number of channels. Next, an adversarial multi-user MAB framework is considered, where the rewards on the channels are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.