Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret

Bingshan Hu; Zhiming Huang; Tianyue H. Zhang; Mathias L\'ecuyer; Nidhi Hegde

arXiv:2505.02383·cs.LG·June 6, 2025

Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret

Bingshan Hu, Zhiming Huang, Tianyue H. Zhang, Mathias L\'ecuyer, Nidhi Hegde

PDF

Open Access 1 Video

TL;DR

This paper introduces DP-TS-UCB, a private bandit algorithm that balances privacy and regret by leveraging Gaussian mechanisms and exploring their connections, with theoretical guarantees.

Contribution

It proposes a novel parametrized private bandit algorithm connecting Thompson Sampling and UCB, enabling adjustable privacy-regret trade-offs with theoretical analysis.

Findings

01

Achieves a privacy-regret trade-off controlled by parameter α

02

Provides regret bounds of O(K log^{α+1}(T)/Δ)

03

Establishes links between exploration mechanisms in Thompson Sampling and UCB

Abstract

We address differentially private stochastic bandit problems from the angles of exploring the deep connections among Thompson Sampling with Gaussian priors, Gaussian mechanisms, and Gaussian differential privacy (GDP). We propose DP-TS-UCB, a novel parametrized private bandit algorithm that enables to trade off privacy and regret. DP-TS-UCB satisfies $\tilde{O} (T^{0.25 (1 - α)})$ -GDP and enjoys an $O (K ln^{α + 1} (T) /Δ)$ regret bound, where $α \in [0, 1]$ controls the trade-off between privacy and regret. Theoretically, our DP-TS-UCB relies on anti-concentration bounds of Gaussian distributions and links exploration mechanisms in Thompson Sampling-based algorithms and Upper Confidence Bound-based algorithms, which may be of independent interest.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Ethics in Clinical Research