RPAF: A Reinforcement Prediction-Allocation Framework for Cache   Allocation in Large-Scale Recommender Systems

Shuo Su; Xiaoshuang Chen; Yao Wang; Yulin Wu; Ziqiang Zhang; Kaiqiao; Zhan; Ben Wang; Kun Gai

arXiv:2409.13175·cs.LG·September 23, 2024

RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems

Shuo Su, Xiaoshuang Chen, Yao Wang, Yulin Wu, Ziqiang Zhang, Kaiqiao, Zhan, Ben Wang, Kun Gai

PDF

Open Access

TL;DR

This paper introduces RPAF, a reinforcement learning framework that optimizes cache allocation in large-scale recommender systems to enhance user engagement within limited computational resources.

Contribution

It proposes a novel two-stage reinforcement learning framework with prediction and allocation stages, addressing cache value-strategy dependency and streaming allocation challenges.

Findings

01

RPAF significantly improves user engagement under computational constraints.

02

The framework effectively balances real-time and cached recommendations.

03

The PoolRank algorithm enhances streaming allocation performance.

Abstract

Modern recommender systems are built upon computation-intensive infrastructure, and it is challenging to perform real-time computation for each request, especially in peak periods, due to the limited computational resources. Recommending by user-wise result caches is widely used when the system cannot afford a real-time recommendation. However, it is challenging to allocate real-time and cached recommendations to maximize the users' overall engagement. This paper shows two key challenges to cache allocation, i.e., the value-strategy dependency and the streaming allocation. Then, we propose a reinforcement prediction-allocation framework (RPAF) to address these issues. RPAF is a reinforcement-learning-based two-stage framework containing prediction and allocation stages. The prediction stage estimates the values of the cache choices considering the value-strategy dependency, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Caching and Content Delivery · Advanced Wireless Network Optimization