Differentiable Semantic ID for Generative Recommendation

Junchen Fu; Xuri Ge; Alexandros Karatzoglou; Ioannis Arapakis; Suzan Verberne; Joemon M. Jose; Zhaochun Ren

arXiv:2601.19711·cs.IR·April 15, 2026

Differentiable Semantic ID for Generative Recommendation

Junchen Fu, Xuri Ge, Alexandros Karatzoglou, Ioannis Arapakis, Suzan Verberne, Joemon M. Jose, Zhaochun Ren

PDF

1 Repo

TL;DR

This paper introduces DIGER, a method that makes semantic IDs differentiable for generative recommendation, enabling direct optimization of recommendation accuracy and addressing codebook collapse.

Contribution

It proposes Gumbel noise and decay strategies to improve differentiable semantic indexing, enhancing recommendation performance and code utilization.

Findings

01

Consistent improvements on multiple datasets.

02

Effective mitigation of codebook collapse.

03

Demonstrates the benefits of aligning indexing with recommendation objectives.

Abstract

Generative recommendation provides a novel paradigm in which each item is represented by a discrete semantic ID (SID) learned from rich content. Most existing methods treat SIDs as predefined and train recommenders under static indexing. In practice, SIDs are typically optimized only for content reconstruction rather than recommendation accuracy. This leads to an objective mismatch: the system optimizes an indexing loss to learn the SID and a recommendation loss for interaction prediction, but because the tokenizer is trained independently, the recommendation loss cannot update it. A natural approach is to make semantic indexing differentiable so that recommendation gradients can directly influence SID learning, but this often causes codebook collapse, where only a few codes are used. We attribute this issue to early deterministic assignments that limit codebook exploration, resulting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

junchen-fu/DIGER
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.