Bandit Sequential Posted Pricing via Half-Concavity
Sahil Singla, Yifan Wang

TL;DR
This paper develops bandit learning algorithms for sequential posted pricing with multiple buyers, achieving nearly optimal regret bounds by leveraging a new half-concavity property of the revenue function.
Contribution
It introduces a novel half-concavity property of the revenue function and provides nearly optimal regret bounds for multi-buyer posted pricing in the bandit setting.
Findings
Achieves ilde{O}( ext{poly}(n)\sqrt{T}) regret for regular distributions.
Achieves ilde{O}( ext{poly}(n)\ T^{2/3}) regret for general distributions.
First known regret bounds for multi-buyer posted pricing with regular distributions.
Abstract
Sequential posted pricing auctions are popular because of their simplicity in practice and their tractability in theory. A usual assumption in their study is that the Bayesian prior distributions of the buyers are known to the seller, while in reality these priors can only be accessed from historical data. To overcome this assumption, we study sequential posted pricing in the bandit learning model, where the seller interacts with buyers over rounds: In each round the seller posts prices for the buyers and the first buyer with a valuation higher than the price takes the item. The only feedback that the seller receives in each round is the revenue. Our main results obtain nearly-optimal regret bounds for single-item sequential posted pricing in the bandit learning model. In particular, we achieve an regret for buyers with (Myerson's)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Machine Learning and Algorithms
