Optimal Rates of (Locally) Differentially Private Heavy-tailed   Multi-Armed Bandits

Youming Tao; Yulian Wu; Peng Zhao; Di Wang

arXiv:2106.02575·cs.LG·March 25, 2022·6 cites

Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits

Youming Tao, Yulian Wu, Peng Zhao, Di Wang

PDF

Open Access

TL;DR

This paper develops and analyzes differentially private algorithms for heavy-tailed multi-armed bandit problems, achieving near-optimal regret rates in both central and local privacy models, and introduces new estimators and hard instances.

Contribution

It introduces the first near-optimal private algorithms for heavy-tailed MABs under both central and local differential privacy models, with new estimators and lower bounds.

Findings

01

Algorithms achieve near-optimal regret rates.

02

Heavy-tailed rewards require different privacy techniques.

03

Experimental results support theoretical guarantees.

Abstract

In this paper we investigate the problem of stochastic multi-armed bandits (MAB) in the (local) differential privacy (DP/LDP) model. Unlike previous results that assume bounded/sub-Gaussian reward distributions, we focus on the setting where each arm's reward distribution only has $(1 + v)$ -th moment with some $v \in (0, 1]$ . In the first part, we study the problem in the central $ϵ$ -DP model. We first provide a near-optimal result by developing a private and robust Upper Confidence Bound (UCB) algorithm. Then, we improve the result via a private and robust version of the Successive Elimination (SE) algorithm. Finally, we establish the lower bound to show that the instance-dependent regret of our improved algorithm is optimal. In the second part, we study the problem in the $ϵ$ -LDP model. We propose an algorithm that can be seen as locally private and robust version of SE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Privacy-Preserving Technologies in Data · Age of Information Optimization