Selfish Robustness and Equilibria in Multi-Player Bandits
Etienne Boursier, Vianney Perchet

TL;DR
This paper introduces algorithms for multi-player bandit problems that are robust against selfish players aiming to maximize their own rewards, achieving low regret and handling different observation settings.
Contribution
It presents the first algorithms with logarithmic regret robust to selfish players in multi-player bandits, including strategies for various observation scenarios.
Findings
Algorithms achieve logarithmic regret with selfish players when rewards are observed.
Implicit communication strategies enable robustness in homogeneous and heterogeneous settings.
Impossibility results show limitations when only rewards are observed or means vary arbitrarily.
Abstract
Motivated by cognitive radios, stochastic multi-player multi-armed bandits gained a lot of interest recently. In this class of problems, several players simultaneously pull arms and encounter a collision - with 0 reward - if some of them pull the same arm at the same time. While the cooperative case where players maximize the collective reward (obediently following some fixed protocol) has been mostly considered, robustness to malicious players is a crucial and challenging concern. Existing approaches consider only the case of adversarial jammers whose objective is to blindly minimize the collective reward. We shall consider instead the more natural class of selfish players whose incentives are to maximize their individual rewards, potentially at the expense of the social welfare. We provide the first algorithm robust to selfish players (a.k.a. Nash equilibrium) with a logarithmic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Selfish Robustness and Equilibria in Multi-Player Bandits· youtube
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Cognitive Radio Networks and Spectrum Sensing
