Network of Bandits insure Privacy of end-users
Rapha\"el F\'eraud

TL;DR
This paper introduces a privacy-preserving distributed bandit algorithm that enables edge devices to collaboratively identify the best option without exposing individual data, balancing communication efficiency and speed.
Contribution
It proposes the Distributed Median Elimination algorithm and its extension, enhancing privacy, efficiency, and practical performance in distributed multi-armed bandit settings.
Findings
Optimal in transmitted bits
Near optimal speed-up factor
Significant practical improvements
Abstract
In order to distribute the best arm identification task as close as possible to the user's devices, on the edge of the Radio Access Network, we propose a new problem setting, where distributed players collaborate to find the best arm. This architecture guarantees privacy to end-users since no events are stored. The only thing that can be observed by an adversary through the core network is aggregated information across users. We provide a first algorithm, Distributed Median Elimination, which is optimal in term of number of transmitted bits and near optimal in term of speed-up factor with respect to an optimal algorithm run independently on each player. In practice, this first algorithm cannot handle the trade-off between the communication cost and the speed-up factor, and requires some knowledge about the distribution of players. Extended Distributed Median Elimination overcomes these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
