Bending the Scaling Law Curve in Large-Scale Recommendation Systems

Qin Ding; Kevin Course; Linjian Ma; Jianhui Sun; Ruochen Liu; Zhao Zhu; Chunxing Yin; Wei Li; Dai Li; Yu Shi; Xuan Cao; Ze Yang; Han Li; Xing Liu; Bi Xue; Hongwei Li; Rui Jian; Daisy Shi He; Jing Qian; Matt Ma; Qunshu Zhang; Rui Li

arXiv:2602.16986·cs.IR·February 23, 2026

Bending the Scaling Law Curve in Large-Scale Recommendation Systems

Qin Ding, Kevin Course, Linjian Ma, Jianhui Sun, Ruochen Liu, Zhao Zhu, Chunxing Yin, Wei Li, Dai Li, Yu Shi, Xuan Cao, Ze Yang, Han Li, Xing Liu, Bi Xue, Hongwei Li, Rui Jian, Daisy Shi He, Jing Qian, Matt Ma, Qunshu Zhang, Rui Li

PDF

Open Access

TL;DR

This paper introduces ULTRA-HSTU, a novel recommendation model that significantly improves training and inference efficiency while enhancing recommendation quality through innovative sequence design and sparse attention mechanisms, deployed at scale.

Contribution

The paper presents ULTRA-HSTU, a new sequential recommendation model with end-to-end co-design, achieving over 5x faster training, 21x faster inference, and superior recommendation performance.

Findings

01

Over 5x faster training scaling

02

21x faster inference scaling

03

4-8% improvements in user engagement

Abstract

Learning from user interaction history through sequential models has become a cornerstone of large-scale recommender systems. Recent advances in large language models have revealed promising scaling laws, sparking a surge of research into long-sequence modeling and deeper architectures for recommendation tasks. However, many recent approaches rely heavily on cross-attention mechanisms to address the quadratic computational bottleneck in sequential modeling, which can limit the representational power gained from self-attention. We present ULTRA-HSTU, a novel sequential recommendation model developed through end-to-end model and system co-design. By innovating in the design of input sequences, sparse attention mechanisms, and model topology, ULTRA-HSTU achieves substantial improvements in both model quality and efficiency. Comprehensive benchmarking demonstrates that ULTRA-HSTU achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Topic Modeling