Differentially Private Contextual Linear Bandits

Roshan Shariff; Or Sheffet

arXiv:1810.00068·cs.LG·October 2, 2018·27 cites

Differentially Private Contextual Linear Bandits

Roshan Shariff, Or Sheffet

PDF

Open Access

TL;DR

This paper introduces a new approach to the contextual linear bandit problem that ensures privacy using joint differential privacy, modifies existing algorithms with noise addition, and establishes regret bounds and lower bounds for private algorithms.

Contribution

It proposes a joint differential privacy framework for contextual linear bandits, adapting linear-UCB with noise mechanisms, and provides regret bounds and fundamental lower bounds.

Findings

01

Joint differential privacy can be achieved with controlled regret.

02

Adding Gaussian or Wishart noise maintains privacy while bounding regret.

03

The paper establishes the first lower bound on regret for private bandit algorithms.

Abstract

We study the contextual linear bandit problem, a version of the standard stochastic multi-armed bandit (MAB) problem where a learner sequentially selects actions to maximize a reward which depends also on a user provided per-round context. Though the context is chosen arbitrarily or adversarially, the reward is assumed to be a stochastic function of a feature vector that encodes the context and selected action. Our goal is to devise private learners for the contextual linear bandit problem. We first show that using the standard definition of differential privacy results in linear regret. So instead, we adopt the notion of joint differential privacy, where we assume that the action chosen on day $t$ is only revealed to user $t$ and thus needn't be kept private that day, only on following days. We give a general scheme converting the classic linear-UCB algorithm into a joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research