Online Matching: A Real-time Bandit System for Large-scale Recommendations
Xinyang Yi, Shao-Chuan Wang, Ruining He, Hariharan Chandrasekaran,, Charles Wu, Lukasz Heldt, Lichan Hong, Minmin Chen, Ed H. Chi

TL;DR
This paper introduces Online Matching, a scalable real-time bandit system for large-scale recommender systems, combining offline and online learning to improve content discovery and user engagement.
Contribution
It proposes a hybrid offline-online framework and a novel Diag-LinUCB algorithm for scalable, real-time bandit updates in large-scale recommender systems.
Findings
Enhanced content discovery in YouTube live experiments
Improved exploration of new items and user interests
Scalable distributed bandit updates achieved
Abstract
The last decade has witnessed many successes of deep learning-based models for industry-scale recommender systems. These models are typically trained offline in a batch manner. While being effective in capturing users' past interactions with recommendation platforms, batch learning suffers from long model-update latency and is vulnerable to system biases, making it hard to adapt to distribution shift and explore new items or user interests. Although online learning-based approaches (e.g., multi-armed bandits) have demonstrated promising theoretical results in tackling these challenges, their practical real-time implementation in large-scale recommender systems remains limited. First, the scalability of online approaches in servicing a massive online traffic while ensuring timely updates of bandit parameters poses a significant challenge. Additionally, exploring uncertainty in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Data Stream Mining Techniques
