Multi-Player Bandits Revisited

Lilian Besson (IETR; SEQUEL); Emilie Kaufmann (CRIStAL; SEQUEL)

arXiv:1711.02317·stat.ML·April 30, 2019·19 cites

Multi-Player Bandits Revisited

Lilian Besson (IETR, SEQUEL), Emilie Kaufmann (CRIStAL, SEQUEL)

PDF

Open Access

TL;DR

This paper advances multi-player multi-armed bandit algorithms by introducing new algorithms with improved regret bounds, analyzing their performance under various feedback levels, and proposing heuristics suitable for IoT applications.

Contribution

It introduces two new algorithms with strong theoretical guarantees, improves regret lower bounds, and explores a sensing-free heuristic for decentralized multi-player bandits.

Findings

01

RandTopM and MCTopM outperform existing algorithms empirically.

02

Theoretical guarantees include asymptotic optimality in selecting suboptimal arms.

03

The Selfish heuristic operates without sensing information, suitable for IoT networks.

Abstract

Multi-player Multi-Armed Bandits (MAB) have been extensively studied in the literature, motivated by applications to Cognitive Radio systems. Driven by such applications as well, we motivate the introduction of several levels of feedback for multi-player MAB algorithms. Most existing work assume that sensing information is available to the algorithm. Under this assumption, we improve the state-of-the-art lower bound for the regret of any decentralized algorithms and introduce two algorithms, RandTopM and MCTopM, that are shown to empirically outperform existing algorithms. Moreover, we provide strong theoretical guarantees for these algorithms, including a notion of asymptotic optimality in terms of the number of selections of bad arms. We then introduce a promising heuristic, called Selfish, that can operate without sensing information, which is crucial for emerging applications to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Mobile Crowdsensing and Crowdsourcing