Differential Privacy in Kernelized Contextual Bandits via Random Projections

Nikola Pavlovic; Sudeep Salgia; Qing Zhao

arXiv:2507.13639·stat.ML·July 21, 2025

Differential Privacy in Kernelized Contextual Bandits via Random Projections

Nikola Pavlovic, Sudeep Salgia, Qing Zhao

PDF

Open Access

TL;DR

This paper introduces a differentially private algorithm for kernelized contextual bandits that leverages random projections to reduce sensitivity, achieving optimal regret bounds while preserving privacy.

Contribution

The paper proposes a novel private kernel-ridge regression estimator using private covariance estimation and random projections, enabling state-of-the-art privacy-preserving bandit performance.

Findings

01

Achieves regret bounds of (\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f\u007f",

02

State-of-the-art performance guarantees in different privacy models.

Abstract

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space. We study this problem under an additional constraint of Differential Privacy, where the agent needs to ensure that the sequence of query points is differentially private with respect to both the sequence of contexts and rewards. We propose a novel algorithm that achieves the state-of-the-art cumulative regret of $O (γ_{T} T + \frac{γ _{T}}{ε _{DP}})$ and $O (γ_{T} T + \frac{γ _{T} T}{ε _{DP}})$ over a time horizon of $T$ in the joint and local models of differential privacy, respectively, where $γ_{T}$ is the effective dimension of the kernel and $ε_{DP} > 0$ is the privacy parameter. The key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques