When Privacy Meets Partial Information: A Refined Analysis of   Differentially Private Bandits

Achraf Azize; Debabrota Basu

arXiv:2209.02570·cs.LG·November 7, 2022·1 cites

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

Achraf Azize, Debabrota Basu

PDF

Open Access 1 Video

TL;DR

This paper analyzes the impact of differential privacy on multi-armed bandit problems, establishing regret bounds, identifying privacy regimes, and proposing near-optimal private algorithms.

Contribution

It provides the first minimax and problem-dependent regret bounds for differentially private bandits and introduces AdaP-UCB and AdaP-KLUCB algorithms with optimal regret guarantees.

Findings

01

Regret bounds depend on privacy level $\\epsilon$

02

High-privacy regime increases problem hardness

03

Proposed AdaP-KLUCB matches lower bounds

Abstract

We study the problem of multi-armed bandits with $ϵ$ -global Differential Privacy (DP). First, we prove the minimax and problem-dependent regret lower bounds for stochastic and linear bandits that quantify the hardness of bandits with $ϵ$ -global DP. These bounds suggest the existence of two hardness regimes depending on the privacy budget $ϵ$ . In the high-privacy regime (small $ϵ$ ), the hardness depends on a coupled effect of privacy and partial information about the reward distributions. In the low-privacy regime (large $ϵ$ ), bandits with $ϵ$ -global DP are not harder than the bandits without privacy. For stochastic bandits, we further propose a generic framework to design a near-optimal $ϵ$ global DP extension of an index-based optimistic bandit algorithm. The framework consists of three ingredients: the Laplace mechanism, arm-dependent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research