Differentially Private Stochastic Linear Bandits: (Almost) for Free
Osama A. Hanna, Antonious M. Girgis, Christina Fragouli, Suhas Diggavi

TL;DR
This paper introduces differentially private algorithms for stochastic linear bandits across various models, achieving near-optimal regret bounds and demonstrating that privacy can be maintained with minimal impact on performance.
Contribution
The authors develop differentially private algorithms for linear bandits in central, local, and shuffled models, achieving regret bounds close to non-private algorithms, thus nearly providing privacy for free.
Findings
Achieve regret of ((T}+rac{1}{\u03B5}) in the central model.
Match non-private regret for constant B5 in the local model, with penalties for small B5.
Attain regret of (T+rac{1}{B5}) in the shuffled model, outperforming previous algorithms.
Abstract
In this paper, we propose differentially private algorithms for the problem of stochastic linear bandits in the central, local and shuffled models. In the central model, we achieve almost the same regret as the optimal non-private algorithms, which means we get privacy for free. In particular, we achieve a regret of matching the known lower bound for private linear bandits, while the best previously known algorithm achieves . In the local case, we achieve a regret of which matches the non-private regret for constant , but suffers a regret penalty when is small. In the shuffled model, we also achieve regret of %for small as in the central case, while the best previously known algorithm suffers a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Age of Information Optimization
