A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Rui Ai, Boxiang Lyu, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan

TL;DR
This paper introduces a novel reinforcement learning-based mechanism for multi-phase second-price auctions that effectively handles untruthful bidding, unknown market noise, and nonlinear revenue, achieving low regret bounds.
Contribution
The paper develops the CLUB algorithm, combining buffer periods and an extended LSVI-UCB, to optimize reserve prices in complex auction settings with multiple challenges.
Findings
Achieves $ ilde{O}(H^{5/2}\sqrt{K})$ regret with known noise
Achieves $ ilde{O}(H^{3}\sqrt{K})$ regret with unknown noise
Effectively incentivizes truthful bidding despite strategic manipulation
Abstract
We study reserve price optimization in multi-phase second price auctions, where the seller's prior actions affect the bidders' later valuations through a Markov Decision Process (MDP). Compared to the bandit setting in existing works, the setting in ours involves three challenges. First, from the seller's perspective, we need to efficiently explore the environment in the presence of potentially untruthful bidders who aim to manipulate the seller's policy. Second, we want to minimize the seller's revenue regret when the market noise distribution is unknown. Third, the seller's per-step revenue is an unknown, nonlinear random variable, and cannot even be directly observed from the environment but realized values. We propose a mechanism addressing all three challenges. To address the first challenge, we use a combination of a new technique named "buffer periods" and inspirations from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Smart Grid Energy Management
