DP-NCB: Privacy Preserving Fair Bandits
Dhruv Sarkar, Nishant Pandey, Sayak Ray Chowdhury

TL;DR
This paper introduces DP-NCB, a novel algorithm that ensures both differential privacy and fairness in multi-armed bandit problems, addressing a critical gap in socially sensitive decision-making applications.
Contribution
The paper presents DP-NCB, the first unified framework achieving both differential privacy and Nash fairness in bandit algorithms with theoretical guarantees and practical effectiveness.
Findings
DP-NCB achieves order-optimal Nash regret under privacy constraints.
The framework works under both global and local privacy models.
Simulations show DP-NCB outperforms existing baselines in fairness and privacy.
Abstract
Multi-armed bandit algorithms are fundamental tools for sequential decision-making under uncertainty, with widespread applications across domains such as clinical trials and personalized decision-making. As bandit algorithms are increasingly deployed in these socially sensitive settings, it becomes critical to protect user data privacy and ensure fair treatment across decision rounds. While prior work has independently addressed privacy and fairness in bandit settings, the question of whether both objectives can be achieved simultaneously has remained largely open. Existing privacy-preserving bandit algorithms typically optimize average regret, a utilitarian measure, whereas fairness-aware approaches focus on minimizing Nash regret, which penalizes inequitable reward distributions, but often disregard privacy concerns. To bridge this gap, we introduce Differentially Private Nash…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
