Characterizing Truthful Multi-Armed Bandit Mechanisms
Moshe Babaioff, Yogeshwer Sharma, Aleksandrs Slivkins

TL;DR
This paper studies truthful mechanisms in multi-armed bandit settings for online advertising, revealing structural constraints and higher regret compared to non-strategic algorithms, and proposes mechanisms that approach theoretical regret bounds.
Contribution
It characterizes the structural properties of truthful bandit mechanisms and provides a near-optimal mechanism with regret close to the lower bound.
Findings
Truthful mechanisms must separate exploration from exploitation.
Such mechanisms incur higher regret than non-strategic algorithms.
A mechanism is proposed that nearly matches the regret lower bound.
Abstract
We consider a multi-round auction setting motivated by pay-per-click auctions for Internet advertising. In each round the auctioneer selects an advertiser and shows her ad, which is then either clicked or not. An advertiser derives value from clicks; the value of a click is her private information. Initially, neither the auctioneer nor the advertisers have any information about the likelihood of clicks on the advertisements. The auctioneer's goal is to design a (dominant strategies) truthful mechanism that (approximately) maximizes the social welfare. If the advertisers bid their true private values, our problem is equivalent to the "multi-armed bandit problem", and thus can be viewed as a strategic version of the latter. In particular, for both problems the quality of an algorithm can be characterized by "regret", the difference in social welfare between the algorithm and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems
