Differentially Private Stochastic Linear Bandits: (Almost) for Free

Osama A. Hanna; Antonious M. Girgis; Christina Fragouli; Suhas Diggavi

arXiv:2207.03445·cs.LG·July 8, 2022

Differentially Private Stochastic Linear Bandits: (Almost) for Free

Osama A. Hanna, Antonious M. Girgis, Christina Fragouli, Suhas Diggavi

PDF

Open Access

TL;DR

This paper introduces differentially private algorithms for stochastic linear bandits across various models, achieving near-optimal regret bounds and demonstrating that privacy can be maintained with minimal impact on performance.

Contribution

The authors develop differentially private algorithms for linear bandits in central, local, and shuffled models, achieving regret bounds close to non-private algorithms, thus nearly providing privacy for free.

Findings

01

Achieve regret of ((T}+rac{1}{\u03B5}) in the central model.

02

Match non-private regret for constant B5 in the local model, with penalties for small B5.

03

Attain regret of (T+rac{1}{B5}) in the shuffled model, outperforming previous algorithms.

Abstract

In this paper, we propose differentially private algorithms for the problem of stochastic linear bandits in the central, local and shuffled models. In the central model, we achieve almost the same regret as the optimal non-private algorithms, which means we get privacy for free. In particular, we achieve a regret of $\tilde{O} (T + \frac{1}{ϵ})$ matching the known lower bound for private linear bandits, while the best previously known algorithm achieves $\tilde{O} (\frac{1}{ϵ} T)$ . In the local case, we achieve a regret of $\tilde{O} (\frac{1}{ϵ} T)$ which matches the non-private regret for constant $ϵ$ , but suffers a regret penalty when $ϵ$ is small. In the shuffled model, we also achieve regret of $\tilde{O} (T + \frac{1}{ϵ})$ %for small $ϵ$ as in the central case, while the best previously known algorithm suffers a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Age of Information Optimization