Distributed Differential Privacy in Multi-Armed Bandits
Sayak Ray Chowdhury, Xingyu Zhou

TL;DR
This paper introduces a new distributed differential privacy algorithm for multi-armed bandits that achieves pure-DP with minimal regret, using secure computation and noise addition, outperforming previous shuffle protocols.
Contribution
It proposes a novel bandit algorithm with pure-DP guarantees under distributed trust, matching central trust privacy costs without requiring a trusted server.
Findings
Achieves pure-DP in distributed setting with minimal regret.
Ensures Rényi differential privacy using Skellam noise.
Matches privacy costs of centralized trust models.
Abstract
We consider the standard -armed bandit problem under a distributed trust model of differential privacy (DP), which enables to guarantee privacy without a trustworthy server. Under this trust model, previous work largely focus on achieving privacy using a shuffle protocol, where a batch of users data are randomly permuted before sending to a central server. This protocol achieves () or approximate-DP guarantee by sacrificing an additional additive cost in -step cumulative regret. In contrast, the optimal privacy cost for achieving a stronger () or pure-DP guarantee under the widely used central trust model is only , where, however, a trusted server is required. In this work, we aim to obtain a pure-DP guarantee under distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Age of Information Optimization
