Learning to Persuade on the Fly: Robustness Against Ignorance
You Zu, Krishnamurthy Iyer, Haifeng Xu

TL;DR
This paper develops an online learning algorithm for persuasion in repeated settings with unknown distributions, ensuring robustness and low regret, advancing the understanding of adaptive persuasion strategies under uncertainty.
Contribution
It introduces a robust persuasion algorithm that learns the unknown distribution on the fly, achieving near-optimal regret bounds in a repeated persuasion setting.
Findings
Achieves $O( oot T ext{log} T)$ regret with high probability.
Maintains a set of candidate distributions for robust persuasion.
Proves regret bounds are tight up to logarithmic factors.
Abstract
Motivated by information sharing in online platforms, we study repeated persuasion between a sender and a stream of receivers where at each time, the sender observes a payoff-relevant state drawn independently and identically from an unknown distribution, and shares state information with the receivers who each choose an action. The sender seeks to persuade the receivers into taking actions aligned with the sender's preference by selectively sharing state information. However, in contrast to the standard models, neither the sender nor the receivers know the distribution, and the sender has to persuade while learning the distribution on the fly. We study the sender's learning problem of making persuasive action recommendations to achieve low regret against the optimal persuasion mechanism with the knowledge of the distribution. To do this, we first propose and motivate a persuasiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Misinformation and Its Impacts · Mobile Crowdsensing and Crowdsourcing
