Cascading Bandit under Differential Privacy
Kun Wang, Jing Dong, Baoxiang Wang, Shuai Li, Shuo Shao

TL;DR
This paper introduces new differentially private algorithms for cascading bandits, achieving improved regret bounds under both central and local privacy models, with extensive experiments validating the theoretical results.
Contribution
The paper proposes novel differentially private algorithms for cascading bandits with improved regret bounds and extends the results to combinatorial semi-bandits.
Findings
Achieves regret of (rac{\u2113 T}{})^{1+} under DP, improving previous bounds.
Provides regret bounds of (rac{K\u2206 T}) under LDP, balancing privacy and error probability.
Validates theoretical results with extensive experiments.
Abstract
This paper studies \emph{differential privacy (DP)} and \emph{local differential privacy (LDP)} in cascading bandits. Under DP, we propose an algorithm which guarantees -indistinguishability and a regret of for an arbitrarily small . This is a significant improvement from the previous work of regret. Under (,)-LDP, we relax the dependence through the tradeoff between privacy budget and error probability , and obtain a regret of , where is the size of the arm subset. This result holds for both Gaussian mechanism and Laplace mechanism by analyses on the composition. Our results extend to combinatorial semi-bandit. We show respective lower bounds for DP and LDP cascading bandits.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Advanced Bandit Algorithms Research
