MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Qihang Yu; Kairui Fu; Zhaocheng Du; Yuxuan Si; Kaiyuan Li; Weihao Zhao; Zhicheng Zhang; Jieming Zhu; Quanyu Dai; Zhenhua Dong; Shengyu Zhang; Kun Kuang; Fei Wu

arXiv:2601.20234·cs.IR·January 30, 2026

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Qihang Yu, Kairui Fu, Zhaocheng Du, Yuxuan Si, Kaiyuan Li, Weihao Zhao, Zhicheng Zhang, Jieming Zhu, Quanyu Dai, Zhenhua Dong, Shengyu Zhang, Kun Kuang, Fei Wu

PDF

Open Access

TL;DR

MALLOC is a benchmark designed to evaluate memory-aware long sequence compression techniques in large-scale recommendation systems, addressing the challenge of balancing memory usage and computational efficiency.

Contribution

This paper introduces MALLOC, a comprehensive benchmark and systematic classification for memory management techniques in long sequence recommendation models.

Findings

01

Memory-aware compression improves efficiency without sacrificing accuracy.

02

Systematic evaluation reveals trade-offs between memory usage and model performance.

03

MALLOC provides a reproducible platform for future research in large-scale recommendation memory management.

Abstract

The scaling law, which indicates that model performance improves with increasing dataset and model capacity, has fueled a growing trend in expanding recommendation models in both industry and academia. However, the advent of large-scale recommenders also brings significantly higher computational costs, particularly under the long-sequence dependencies inherent in the user intent of recommendation systems. Current approaches often rely on pre-storing the intermediate states of the past behavior for each user, thereby reducing the quadratic re-computation cost for the following requests. Despite their effectiveness, these methods often treat memory merely as a medium for acceleration, without adequately considering the space overhead it introduces. This presents a critical challenge in real-world recommendation systems with billions of users, each of whom might initiate thousands of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Stochastic Gradient Optimization Techniques