Differential Privacy in Kernelized Contextual Bandits via Random Projections
Nikola Pavlovic, Sudeep Salgia, Qing Zhao

TL;DR
This paper introduces a differentially private algorithm for kernelized contextual bandits that leverages random projections to reduce sensitivity, achieving optimal regret bounds while preserving privacy.
Contribution
The paper proposes a novel private kernel-ridge regression estimator using private covariance estimation and random projections, enabling state-of-the-art privacy-preserving bandit performance.
Findings
Achieves regret bounds of (\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f",
State-of-the-art performance guarantees in different privacy models.
Abstract
We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space. We study this problem under an additional constraint of Differential Privacy, where the agent needs to ensure that the sequence of query points is differentially private with respect to both the sequence of contexts and rewards. We propose a novel algorithm that achieves the state-of-the-art cumulative regret of and over a time horizon of in the joint and local models of differential privacy, respectively, where is the effective dimension of the kernel and is the privacy parameter. The key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques
