Accelerating Generative Recommendation via Simple Categorical User Sequence Compression

Qijiong Liu; Lu Fan; Zhongzhou Liu; Xiaoyu Dong; Yuankai Luo; Guoyuan An; Nuo Chen; Wei Guo; Yong Liu; Xiao-Ming Wu

arXiv:2601.19158·cs.IR·January 28, 2026

Accelerating Generative Recommendation via Simple Categorical User Sequence Compression

Qijiong Liu, Lu Fan, Zhongzhou Liu, Xiaoyu Dong, Yuankai Luo, Guoyuan An, Nuo Chen, Wei Guo, Yong Liu, Xiao-Ming Wu

PDF

Open Access

TL;DR

This paper introduces a straightforward method for compressing user history sequences in generative recommenders using item categories, significantly reducing computational costs while maintaining or improving accuracy.

Contribution

The proposed categorical compression technique efficiently reduces sequence length and computational load without sacrificing recommendation quality, outperforming existing models like HSTU.

Findings

01

Up to 6x reduction in computational cost

02

Up to 39% increase in accuracy at similar sequence length

03

Effective on large-scale datasets

Abstract

Although generative recommenders demonstrate improved performance with longer sequences, their real-time deployment is hindered by substantial computational costs. To address this challenge, we propose a simple yet effective method for compressing long-term user histories by leveraging inherent item categorical features, thereby preserving user interests while enhancing efficiency. Experiments on two large-scale datasets demonstrate that, compared to the influential HSTU model, our approach achieves up to a 6x reduction in computational cost and up to 39% higher accuracy at comparable cost (i.e., similar sequence length).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)