Anonymous Bandits for Multi-User Systems

Hossein Esfandiari; Vahab Mirrokni; Jon Schneider

arXiv:2210.12198·cs.LG·October 25, 2022

Anonymous Bandits for Multi-User Systems

Hossein Esfandiari, Vahab Mirrokni, Jon Schneider

PDF

Open Access 1 Video

TL;DR

This paper introduces anonymous bandits, a new online learning framework for multi-user systems that ensures user privacy through k-anonymity, and provides the first algorithms with sublinear regret bounds.

Contribution

It extends bandit algorithms to satisfy k-anonymity constraints, enabling online user clustering without individual user data, and establishes regret bounds for this setting.

Findings

01

First sublinear regret algorithms for anonymous bandits

02

Lower bounds demonstrating the difficulty of the problem

03

Framework enables privacy-preserving online learning

Abstract

In this work, we present and study a new framework for online learning in systems with multiple users that provide user anonymity. Specifically, we extend the notion of bandits to obey the standard $k$ -anonymity constraint by requiring each observation to be an aggregation of rewards for at least $k$ users. This provides a simple yet effective framework where one can learn a clustering of users in an online fashion without observing any user's individual decision. We initiate the study of anonymous bandits and provide the first sublinear regret algorithms and lower bounds for this setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Anonymous Bandits for Multi-User Systems· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Privacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing