Fast Rate Learning in Stochastic First Price Bidding
Juliette Achddou (DI-ENS, VALDA ), Olivier Capp\'e (VALDA, DI-ENS),, Aur\'elien Garivier (UMPA-ENSL)

TL;DR
This paper introduces new algorithms for first-price auction bidding that significantly reduce regret in stochastic environments, outperforming existing methods and providing practical convergence guarantees.
Contribution
It presents novel algorithms achieving near-logarithmic regret when the opponent's bid distribution is known and polynomial regret when it must be learned, with new theoretical insights into utility behavior.
Findings
Algorithms outperform existing methods in simulations.
Regret can be as low as log^2(T) with known distribution.
Convergence is faster than previous approaches.
Abstract
First-price auctions have largely replaced traditional bidding approaches based on Vickrey auctions in programmatic advertising. As far as learning is concerned, first-price auctions are more challenging because the optimal bidding strategy does not only depend on the value of the item but also requires some knowledge of the other bids. They have already given rise to several works in sequential learning, many of which consider models for which the value of the buyer or the opponents' maximal bid is chosen in an adversarial manner. Even in the simplest settings, this gives rise to algorithms whose regret grows as with respect to the time horizon . Focusing on the case where the buyer plays against a stationary stochastic environment, we show how to achieve significantly lower regret: when the opponents' maximal bid distribution is known we provide an algorithm whose regret…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Machine Learning and Algorithms
