Online Allocation and Learning in the Presence of Strategic Agents
Steven Yin, Shipra Agrawal, Assaf Zeevi

TL;DR
This paper develops an online allocation mechanism for sequentially arriving items among strategic agents with unknown valuation distributions, ensuring approximate incentive compatibility and sublinear regret in utility compared to optimal offline policies.
Contribution
It introduces a novel online learning-based allocation mechanism that accounts for strategic behavior and unknown valuation distributions, bridging online auction design and learning.
Findings
Mechanism is approximately Bayesian incentive compatible.
Guarantees sublinear regret for agents' utility.
Works with unknown valuation distributions and strategic agents.
Abstract
We study the problem of allocating sequentially arriving items among homogeneous agents under the constraint that each agent must receive a pre-specified fraction of all items, with the objective of maximizing the agents' total valuation of items allocated to them. The agents' valuations for the item in each round are assumed to be i.i.d. but their distribution is a priori unknown to the central planner. Therefore, the central planner needs to implicitly learn these distributions from the observed values in order to pick a good allocation policy. However, an added challenge here is that the agents are strategic with incentives to misreport their valuations in order to receive better allocations. This sets our work apart both from the online auction design settings which typically assume known valuation distributions and/or involve payments, and from the online learning settings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Consumer Market Behavior and Pricing
