Online and Scalable Model Selection with Multi-Armed Bandits
Jiayi Xie, Michael Tashman, John Hoffman, Lee Winikor, Rouzbeh Gerami

TL;DR
This paper introduces AMS, an online scalable system using Multi-Armed Bandits for real-time selection of bidding models in RTB, improving campaign performance without relying on offline data.
Contribution
The paper presents AMS, a novel online model selection system that dynamically allocates traffic to models based on real-world metrics, addressing non-stationarity and offline data limitations.
Findings
AMS effectively improves ad campaign performance in live tests.
The system enables safe, real-time introduction of new models.
Traffic allocation adapts to model performance, minimizing risks.
Abstract
Many online applications running on live traffic are powered by machine learning models, for which training, validation, and hyper-parameter tuning are conducted on historical data. However, it is common for models demonstrating strong performance in offline analysis to yield poorer performance when deployed online. This problem is a consequence of the difficulty of training on historical data in non-stationary environments. Moreover, the machine learning metrics used for model selection may not sufficiently correlate with real-world business metrics used to determine the success of the applications being tested. These problems are particularly prominent in the Real-Time Bidding (RTB) domain, in which ML models power bidding strategies, and a change in models will likely affect performance of the advertising campaigns. In this work, we present Automatic Model Selector (AMS), a system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing
