No-regret Learning in Repeated First-Price Auctions with Budget Constraints
Rui Ai, Chang Wang, Chenchen Li, Jinshan Zhang, Wenhan Huang, Xiaotie, Deng

TL;DR
This paper studies online bidding strategies in repeated first-price auctions with budget constraints, proposing algorithms that achieve sublinear regret bounds and extend to general utility functions.
Contribution
It introduces RL-based algorithms for budget-constrained bidders in first-price auctions with provable regret guarantees, addressing a key open problem.
Findings
Achieves (\u221a T) regret with full bid visibility.
Achieves (T^{7/12}) regret with limited bid information.
Extends analysis to general bounded utility functions.
Abstract
Recently the online advertising market has exhibited a gradual shift from second-price auctions to first-price auctions. Although there has been a line of works concerning online bidding strategies in first-price auctions, it still remains open how to handle budget constraints in the problem. In the present paper, we initiate the study for a buyer with budgets to learn online bidding strategies in repeated first-price auctions. We propose an RL-based bidding algorithm against the optimal non-anticipating strategy under stationary competition. Our algorithm obtains -regret if the bids are all revealed at the end of each round. With the restriction that the buyer only sees the winning bid after each round, our modified algorithm obtains -regret by techniques developed from survival analysis. Our analysis extends to the more general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Smart Grid Energy Management
