Online learning in repeated auctions

Jonathan Weed; Vianney Perchet; Philippe Rigollet

arXiv:1511.05720·cs.GT·November 19, 2015·38 cites

Online learning in repeated auctions

Jonathan Weed, Vianney Perchet, Philippe Rigollet

PDF

Open Access

TL;DR

This paper develops online learning strategies for repeated Vickrey auctions, enabling bidders to adapt their bids over time with provable regret bounds in both stochastic and adversarial settings.

Contribution

It introduces the first comprehensive set of bidding strategies for repeated auctions with bandit feedback, applicable to stochastic and adversarial models.

Findings

01

Logarithmic regret in stochastic models

02

Sublinear regret in adversarial models

03

Matching minimax lower bounds established

Abstract

Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good's value once it is purchased. We adopt an online learning approach with bandit feedback to model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical and we simply compare our performance against that of the best fixed bid in hindsight. We show that sublinear regret is also achievable in this case and prove matching minimax lower bounds. To our knowledge, this is the first complete set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Mobile Crowdsensing and Crowdsourcing