Stochastic Bandits for Multi-platform Budget Optimization in Online Advertising
Vashist Avadhanula, Riccardo Colini-Baldeschi, Stefano Leonardi,, Karthik Abinav Sankararaman, Okke Schrijvers

TL;DR
This paper develops and analyzes algorithms for multi-platform online advertising budget optimization modeled as a stochastic bandits with knapsacks problem, providing regret bounds and empirical validation.
Contribution
It extends existing bandit algorithms to multi-platform bidding, deriving regret bounds for both discrete and continuous bid spaces, and validates the approach with real-world data.
Findings
Algorithms outperform benchmarks on real-world data
Regret bounds are nearly tight with established lower bounds
Improved regret bounds for specific regimes of the problem
Abstract
We study the problem of an online advertising system that wants to optimally spend an advertiser's given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over rounds of bidding with the set of arms given by the set of distinct bidding -tuples, where is the number of platforms. We modify the algorithm proposed in Badanidiyuru \emph{et al.,} to extend it to the case of multiple platforms to obtain an algorithm for both the discrete and continuous bid-spaces. Namely, for discrete bid spaces we give an algorithm with regret , where is the performance of the optimal algorithm that knows the distributions. For continuous bid spaces the regret of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
