Data Efficiency for Large Recommendation Models

Kshitij Jain; Jingru Xie; Kevin Regan; Cheng Chen; Jie Han; Steve Li,; Zhuoshu Li; Todd Phillips; Myles Sussman; Matt Troup; Angel Yu; Jia Zhuo

arXiv:2410.18111·cs.IR·October 28, 2024

Data Efficiency for Large Recommendation Models

Kshitij Jain, Jingru Xie, Kevin Regan, Cheng Chen, Jie Han, Steve Li,, Zhuoshu Li, Todd Phillips, Myles Sussman, Matt Troup, Angel Yu, Jia Zhuo

PDF

Open Access

TL;DR

This paper provides principles and frameworks to optimize training data efficiency for large recommendation models, reducing costs and improving R&D velocity in high-scale online advertising systems.

Contribution

It introduces data convergence concepts and methods to accelerate convergence, guiding practitioners to balance data volume and model size effectively.

Findings

01

Strategies successfully deployed in Google's Ads CTR models

02

Frameworks applicable beyond large recommendation models

03

Guidelines for balancing data volume and model complexity

Abstract

Large recommendation models (LRMs) are fundamental to the multi-billion dollar online advertising industry, processing massive datasets of hundreds of billions of examples before transitioning to continuous online training to adapt to rapidly changing user behavior. The massive scale of data directly impacts both computational costs and the speed at which new methods can be evaluated (R&D velocity). This paper presents actionable principles and high-level frameworks to guide practitioners in optimizing training data requirements. These strategies have been successfully deployed in Google's largest Ads CTR prediction models and are broadly applicable beyond LRMs. We outline the concept of data convergence, describe methods to accelerate this convergence, and finally, detail how to optimally balance training data volume with model size.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings