ERCache: An Efficient and Reliable Caching Framework for Large-Scale User Representations in Meta's Ads System
Fang Zhou, Yaning Huang, Dong Liang, Dai Li, Zhongke Zhang, Kai Wang,, Xiao Xin, Abdallah Aboelela, Zheliang Jiang, Yang Wang, Jeff Song, Wei Zhang,, Chen Liang, Huayu Li, ChongLin Sun, Hang Yang, Lei Qu, Zhan Shu, Mindi Yuan,, Emanuele Maccherani, Taha Hayat, John Guo

TL;DR
ERCache is a caching framework designed to efficiently manage large-scale user representations in Meta's ad system, balancing model complexity, freshness, and SLAs, leading to resource savings and reliable service.
Contribution
The paper introduces ERCache, a novel caching framework that effectively handles user representations at scale, considering staleness and service requirements, and has been successfully deployed at Meta.
Findings
Supports over 30 ranking models with efficient resource use
Balances model complexity, freshness, and SLAs effectively
Deployed at Meta for over six months with positive results
Abstract
The increasing complexity of deep learning models used for calculating user representations presents significant challenges, particularly with limited computational resources and strict service-level agreements (SLAs). Previous research efforts have focused on optimizing model inference but have overlooked a critical question: is it necessary to perform user model inference for every ad request in large-scale social networks? To address this question and these challenges, we first analyze user access patterns at Meta and find that most user model inferences occur within a short timeframe. T his observation reveals a triangular relationship among model complexity, embedding freshness, and service SLAs. Building on this insight, we designed, implemented, and evaluated ERCache, an efficient and robust caching framework for large-scale user representations in ads recommendation systems on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Digital Marketing and Social Media · Human Mobility and Location-Based Analysis
Methodstravel james
