(Private) Kernelized Bandits with Distributed Biased Feedback
Fengjiao Li, Xingyu Zhou, and Bo Ji

TL;DR
This paper introduces a distributed kernelized bandit algorithm that effectively manages biased feedback from a subset of users, reducing communication and computation costs while providing privacy guarantees and achieving sublinear regret.
Contribution
The paper proposes the DPBE algorithm for distributed kernelized bandits with biased feedback, incorporating privacy models and demonstrating improved efficiency and theoretical regret bounds.
Findings
Achieves sublinear regret of O(T^{1-rac{eta}{2}} + \u221A{rac{ ext{gamma}_T T}{}})
Reduces communication and computation costs compared to existing algorithms
Provides privacy guarantees under various differential privacy models
Abstract
In this paper, we study kernelized bandits with distributed biased feedback. This problem is motivated by several real-world applications (such as dynamic pricing, cellular network configuration, and policy making), where users from a large population contribute to the reward of the action chosen by a central entity, but it is difficult to collect feedback from all users. Instead, only biased feedback (due to user heterogeneity) from a subset of users may be available. In addition to such partial biased feedback, we are also faced with two practical challenges due to communication cost and computation complexity. To tackle these challenges, we carefully design a new \emph{distributed phase-then-batch-based elimination (\texttt{DPBE})} algorithm, which samples users in phases for collecting feedback to reduce the bias and employs \emph{maximum variance reduction} to select actions in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Privacy-Preserving Technologies in Data · Age of Information Optimization
