Differentially Private Kernelized Contextual Bandits

Nikola Pavlovic; Sudeep Salgia; Qing Zhao

arXiv:2501.07046·stat.ML·January 14, 2025

Differentially Private Kernelized Contextual Bandits

Nikola Pavlovic, Sudeep Salgia, Qing Zhao

PDF

TL;DR

This paper introduces a differentially private algorithm for kernelized contextual bandits that achieves improved error bounds by leveraging a novel reward estimator with low sensitivity, balancing privacy and learning accuracy.

Contribution

It proposes a new differentially private algorithm for kernelized contextual bandits with an innovative reward estimator, improving error bounds and privacy-utility trade-offs.

Findings

01

Achieves an error rate of O(√(γ_T/T) + γ_T/(Tε)) for large kernel classes.

02

Introduces a reward estimator with high utility and low sensitivity.

03

Provides theoretical guarantees under joint differential privacy constraints.

Abstract

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space (RKHS). We study this problem under the additional constraint of joint differential privacy, where the agents needs to ensure that the sequence of query points is differentially private with respect to both the sequence of contexts and rewards. We propose a novel algorithm that improves upon the state of the art and achieves an error rate of $O (\frac{γ _{T}}{T} + \frac{γ _{T}}{T ε})$ after $T$ queries for a large class of kernel families, where $γ_{T}$ represents the effective dimensionality of the kernel and $ε > 0$ is the privacy parameter. Our results are based on a novel estimator for the reward function that simultaneously enjoys high utility along with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.