Discrimination Is Generation: Unifying Ranking and Retrieval from a Tokenizer Perspective

Shuli Wang; Junwei Yin; Changhao Li; Senjie Kou; Chi Wang; Yinqiu Huang; Yinhua Zhu; Haitao Wang; Xingxing Wang

arXiv:2605.14853·cs.IR·May 15, 2026

Discrimination Is Generation: Unifying Ranking and Retrieval from a Tokenizer Perspective

Shuli Wang, Junwei Yin, Changhao Li, Senjie Kou, Chi Wang, Yinqiu Huang, Yinhua Zhu, Haitao Wang, Xingxing Wang

PDF

TL;DR

This paper introduces DIG, a unified model that embeds tokenizers within a discriminative ranking framework, aligning retrieval and ranking tasks for improved recommendation quality.

Contribution

DIG is the first end-to-end training approach that unifies tokenization and ranking, enhancing both retrieval and ranking performance in generative recommendation systems.

Findings

01

DIG improves ranking and retrieval metrics across multiple datasets.

02

Unified training leads to better recommendation quality than separate models.

03

The approach demonstrates effectiveness on both public benchmarks and industrial datasets.

Abstract

Semantic IDs (SIDs) define the generation space of generative recommendation and directly determine its personalization ceiling. However, existing tokenizers are trained independently with retrieval objectives, leaving personalization signals fully decoupled from the SID construction process -- a fundamental gap that causes generative retrieval to persistently lag behind discriminative ranking. In this paper, we rethink the essence of SIDs: \emph{ranking seeks argmax in item space while retrieval seeks argmax in token space; both are the same problem solved at different granularities.} Based on this insight, we propose \DIG (\textbf{D}iscrimination \textbf{I}s \textbf{G}eneration), which embeds the tokenizer inside a discriminative ranking model for end-to-end training -- the ranker naturally becomes a retrieval model, yielding two models from a single training run. \DIG is organized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.