RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su, Xiaoshuang Chen, Yao Wang, Yulin Wu, Ziqiang Zhang, Kaiqiao, Zhan, Ben Wang, Kun Gai

TL;DR
This paper introduces RPAF, a reinforcement learning framework that optimizes cache allocation in large-scale recommender systems to enhance user engagement within limited computational resources.
Contribution
It proposes a novel two-stage reinforcement learning framework with prediction and allocation stages, addressing cache value-strategy dependency and streaming allocation challenges.
Findings
RPAF significantly improves user engagement under computational constraints.
The framework effectively balances real-time and cached recommendations.
The PoolRank algorithm enhances streaming allocation performance.
Abstract
Modern recommender systems are built upon computation-intensive infrastructure, and it is challenging to perform real-time computation for each request, especially in peak periods, due to the limited computational resources. Recommending by user-wise result caches is widely used when the system cannot afford a real-time recommendation. However, it is challenging to allocate real-time and cached recommendations to maximize the users' overall engagement. This paper shows two key challenges to cache allocation, i.e., the value-strategy dependency and the streaming allocation. Then, we propose a reinforcement prediction-allocation framework (RPAF) to address these issues. RPAF is a reinforcement-learning-based two-stage framework containing prediction and allocation stages. The prediction stage estimates the values of the cache choices considering the value-strategy dependency, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Caching and Content Delivery · Advanced Wireless Network Optimization
