Adaptive Bidding Policies for First-Price Auctions with Budget Constraints under Non-stationarity
Yige Wang, Jiashuo Jiang

TL;DR
This paper develops an adaptive bidding policy for first-price auctions with budget constraints, analyzing its performance under non-stationary conditions and different information settings, with regret bounds of order ~O(sqrt(T)).
Contribution
It introduces a dual-gradient-descent bidding policy that adapts to non-stationarity and different information scenarios, providing near-optimal regret guarantees.
Findings
Regret is ~O(sqrt(T)) plus a Wasserstein variation term in uninformative setting.
Predictions of budget allocation can eliminate the variation term, maintaining ~O(sqrt(T)) regret.
Refined per-period budget plans achieve exactly ~O(sqrt(T)) regret.
Abstract
In this paper, we study how a budget-constrained bidder should learn to bid adaptively in repeated first-price auctions to maximize cumulative payoff. This problem arises from the recent industry-wide shift from second-price auctions to first-price auctions in display advertising, which renders truthful bidding suboptimal. We propose a simple dual-gradient-descent-based bidding policy that maintains a dual variable for the budget constraint as the bidder consumes the budget. We analyze two settings based on the bidder's knowledge of future private values: (i) an uninformative setting where all distributional knowledge (potentially non-stationary) is entirely unknown, and (ii) an informative setting where a prediction of budget allocation is available in advance. We characterize the performance loss (regret) relative to an optimal policy with complete information. For uninformative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
