Optimal Algorithm for Bayesian Incentive-Compatible Exploration
Lee Cohen, Yishay Mansour

TL;DR
This paper presents an optimal algorithm for a social planner to guide myopic selfish agents in exploration tasks, ensuring Bayesian incentive compatibility without monetary incentives, with a focus on deterministic actions with limited support.
Contribution
It introduces the first optimal protocol for Bayesian incentive-compatible exploration in deterministic settings with limited support, featuring correlated randomization and a structured exploration order.
Findings
The protocol always explores more beneficial actions first.
Randomization is correlated across agents and actions.
Progress advances the understanding of incentive-compatible exploration algorithms.
Abstract
We consider a social planner faced with a stream of myopic selfish agents. The goal of the social planner is to maximize the social welfare, however, it is limited to using only information asymmetry (regarding previous outcomes) and cannot use any monetary incentives. The planner recommends actions to agents, but her recommendations need to be Bayesian Incentive Compatible to be followed by the agents. Our main result is an optimal algorithm for the planner, in the case that the actions realizations are deterministic and have limited support, making significant important progress on this open problem. Our optimal protocol has two interesting features. First, it always completes the exploration of a priori more beneficial actions before exploring a priori less beneficial actions. Second, the randomization in the protocol is correlated across agents and actions (and not independent at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
