Farewell to Item IDs: Unlocking the Scaling Potential of Large Ranking Models via Semantic Tokens

Zhen Zhao; Tong Zhang; Jie Xu; Qingliang Cai; Qile Zhang; Leyuan Yang; Daorui Xiao; Xiaojia Chang

arXiv:2601.22694·cs.IR·February 2, 2026

Farewell to Item IDs: Unlocking the Scaling Potential of Large Ranking Models via Semantic Tokens

Zhen Zhao, Tong Zhang, Jie Xu, Qingliang Cai, Qile Zhang, Leyuan Yang, Daorui Xiao, Xiaojia Chang

PDF

Open Access

TL;DR

This paper introduces TRM, a framework that replaces item IDs with semantic tokens in large ranking models, significantly improving scalability, reducing storage, and enhancing performance in recommendation and search systems.

Contribution

The paper proposes a novel semantic token-based approach (TRM) that overcomes the limitations of item ID embeddings, enabling better scalability and stability in large ranking models.

Findings

01

33% reduction in sparse storage

02

0.85% increase in AUC performance

03

improved user engagement metrics in deployment

Abstract

Recent studies on scaling up ranking models have achieved substantial improvement for recommendation systems and search engines. However, most large-scale ranking systems rely on item IDs, where each item is treated as an independent categorical symbol and mapped to a learned embedding. As items rapidly appear and disappear, these embeddings become difficult to train and maintain. This instability impedes effective learning of neural network parameters and limits the scalability of ranking models. In this paper, we show that semantic tokens possess greater scaling potential compared to item IDs. Our proposed framework TRM improves the token generation and application pipeline, leading to 33% reduction in sparse storage while achieving 0.85% AUC increase. Extensive experiments further show that TRM could consistently outperform state-of-the-art models when model capacity scales. Finally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Information Retrieval and Search Behavior · Expert finding and Q&A systems