Cascading Bandit under Differential Privacy

Kun Wang; Jing Dong; Baoxiang Wang; Shuai Li; Shuo Shao

arXiv:2105.11126·cs.LG·June 7, 2021

Cascading Bandit under Differential Privacy

Kun Wang, Jing Dong, Baoxiang Wang, Shuai Li, Shuo Shao

PDF

Open Access

TL;DR

This paper introduces new differentially private algorithms for cascading bandits, achieving improved regret bounds under both central and local privacy models, with extensive experiments validating the theoretical results.

Contribution

The paper proposes novel differentially private algorithms for cascading bandits with improved regret bounds and extends the results to combinatorial semi-bandits.

Findings

01

Achieves regret of (rac{\u2113 T}{})^{1+} under DP, improving previous bounds.

02

Provides regret bounds of (rac{K\u2206 T}) under LDP, balancing privacy and error probability.

03

Validates theoretical results with extensive experiments.

Abstract

This paper studies \emph{differential privacy (DP)} and \emph{local differential privacy (LDP)} in cascading bandits. Under DP, we propose an algorithm which guarantees $ϵ$ -indistinguishability and a regret of $O ((\frac{l o g T}{ϵ})^{1 + ξ})$ for an arbitrarily small $ξ$ . This is a significant improvement from the previous work of $O (\frac{l o g ^{3} T}{ϵ})$ regret. Under ( $ϵ$ , $δ$ )-LDP, we relax the $K^{2}$ dependence through the tradeoff between privacy budget $ϵ$ and error probability $δ$ , and obtain a regret of $O (\frac{K l o g ( 1/ δ ) l o g T}{ϵ ^{2}})$ , where $K$ is the size of the arm subset. This result holds for both Gaussian mechanism and Laplace mechanism by analyses on the composition. Our results extend to combinatorial semi-bandit. We show respective lower bounds for DP and LDP cascading bandits.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Advanced Bandit Algorithms Research