MSN: A Memory-based Sparse Activation Scaling Framework for Large-scale Industrial Recommendation

Shikang Wu; Hui Lu; Jinqiu Jin; Zheng Chai; Shiyong Hong; Junjie Zhang; Shanlei Mu; Kaiyuan Ma; Tianyi Liu; Yuchao Zheng; Zhe Wang; Jingjian Lin

arXiv:2602.07526·cs.IR·February 10, 2026

MSN: A Memory-based Sparse Activation Scaling Framework for Large-scale Industrial Recommendation

Shikang Wu, Hui Lu, Jinqiu Jin, Zheng Chai, Shiyong Hong, Junjie Zhang, Shanlei Mu, Kaiyuan Ma, Tianyi Liu, Yuchao Zheng, Zhe Wang, Jingjian Lin

PDF

Open Access

TL;DR

MSN introduces a memory-based sparse activation framework for large-scale recommendation systems, enhancing personalization and efficiency by dynamically retrieving representations from a large memory while controlling computational costs.

Contribution

The paper proposes MSN, a novel framework that integrates large parameterized memory with sparse activation and efficient retrieval mechanisms for improved industrial recommendation models.

Findings

01

MSN improves recommendation accuracy over state-of-the-art models.

02

MSN maintains low computational overhead suitable for industrial deployment.

03

MSN achieves significant online performance gains in Douyin Search Ranking.

Abstract

Scaling deep learning recommendation models is an effective way to improve model expressiveness. Existing approaches often incur substantial computational overhead, making them difficult to deploy in large-scale industrial systems under strict latency constraints. Recent sparse activation scaling methods, such as Sparse Mixture-of-Experts, reduce computation by activating only a subset of parameters, but still suffer from high memory access costs and limited personalization capacity due to the large size and small number of experts. To address these challenges, we propose MSN, a memory-based sparse activation scaling framework for recommendation models. MSN dynamically retrieves personalized representations from a large parameterized memory and integrates them into downstream feature interaction modules via a memory gating mechanism, enabling fine-grained personalization with low…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks