Multi-Receiver Online Bayesian Persuasion
Matteo Castiglioni, Alberto Marchesi, Andrea Celli, Nicola Gatti

TL;DR
This paper explores online Bayesian persuasion with multiple receivers, establishing computational limits and designing polynomial-time algorithms for submodular utility functions to minimize regret.
Contribution
It introduces the first online Bayesian persuasion model with multiple receivers and develops polynomial-time algorithms for submodular utilities, while proving hardness results for other utility types.
Findings
No polynomial-time no-α-regret algorithms for supermodular or anonymous utilities.
A polynomial-time no-(1 - 1/e)-regret algorithm exists for submodular utilities.
A general online gradient descent scheme with an efficient projection oracle is proposed.
Abstract
Bayesian persuasion studies how an informed sender should partially disclose information to influence the behavior of a self-interested receiver. Classical models make the stringent assumption that the sender knows the receiver's utility. This can be relaxed by considering an online learning framework in which the sender repeatedly faces a receiver of an unknown, adversarially selected type. We study, for the first time, an online Bayesian persuasion setting with multiple receivers. We focus on the case with no externalities and binary actions, as customary in offline models. Our goal is to design no-regret algorithms for the sender with polynomial per-iteration running time. First, we prove a negative result: for any , there is no polynomial-time no--regret algorithm when the sender's utility function is supermodular or anonymous. Then, we focus on the case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Misinformation and Its Impacts
