MSN: A Memory-based Sparse Activation Scaling Framework for Large-scale Industrial Recommendation
Shikang Wu, Hui Lu, Jinqiu Jin, Zheng Chai, Shiyong Hong, Junjie Zhang, Shanlei Mu, Kaiyuan Ma, Tianyi Liu, Yuchao Zheng, Zhe Wang, Jingjian Lin

TL;DR
MSN introduces a memory-based sparse activation framework for large-scale recommendation systems, enhancing personalization and efficiency by dynamically retrieving representations from a large memory while controlling computational costs.
Contribution
The paper proposes MSN, a novel framework that integrates large parameterized memory with sparse activation and efficient retrieval mechanisms for improved industrial recommendation models.
Findings
MSN improves recommendation accuracy over state-of-the-art models.
MSN maintains low computational overhead suitable for industrial deployment.
MSN achieves significant online performance gains in Douyin Search Ranking.
Abstract
Scaling deep learning recommendation models is an effective way to improve model expressiveness. Existing approaches often incur substantial computational overhead, making them difficult to deploy in large-scale industrial systems under strict latency constraints. Recent sparse activation scaling methods, such as Sparse Mixture-of-Experts, reduce computation by activating only a subset of parameters, but still suffer from high memory access costs and limited personalization capacity due to the large size and small number of experts. To address these challenges, we propose MSN, a memory-based sparse activation scaling framework for recommendation models. MSN dynamically retrieves personalized representations from a large parameterized memory and integrates them into downstream feature interaction modules via a memory gating mechanism, enabling fine-grained personalization with low…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
