(Private) Kernelized Bandits with Distributed Biased Feedback

Fengjiao Li; Xingyu Zhou; and Bo Ji

arXiv:2301.12061·cs.LG·February 8, 2023

(Private) Kernelized Bandits with Distributed Biased Feedback

Fengjiao Li, Xingyu Zhou, and Bo Ji

PDF

Open Access

TL;DR

This paper introduces a distributed kernelized bandit algorithm that effectively manages biased feedback from a subset of users, reducing communication and computation costs while providing privacy guarantees and achieving sublinear regret.

Contribution

The paper proposes the DPBE algorithm for distributed kernelized bandits with biased feedback, incorporating privacy models and demonstrating improved efficiency and theoretical regret bounds.

Findings

01

Achieves sublinear regret of O(T^{1-rac{eta}{2}} + \u221A{rac{ ext{gamma}_T T}{}})

02

Reduces communication and computation costs compared to existing algorithms

03

Provides privacy guarantees under various differential privacy models

Abstract

In this paper, we study kernelized bandits with distributed biased feedback. This problem is motivated by several real-world applications (such as dynamic pricing, cellular network configuration, and policy making), where users from a large population contribute to the reward of the action chosen by a central entity, but it is difficult to collect feedback from all users. Instead, only biased feedback (due to user heterogeneity) from a subset of users may be available. In addition to such partial biased feedback, we are also faced with two practical challenges due to communication cost and computation complexity. To tackle these challenges, we carefully design a new \emph{distributed phase-then-batch-based elimination (\texttt{DPBE})} algorithm, which samples users in phases for collecting feedback to reduce the bias and employs \emph{maximum variance reduction} to select actions in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Privacy-Preserving Technologies in Data · Age of Information Optimization