Reasoning over Semantic IDs Enhances Generative Recommendation
Yingzhi He, Yan Sun, Junfei Tan, Yuxin Chen, Xiaoyu Kong, Chunxu Shen, Xiang Wang, An Zhang, Tat-Seng Chua

TL;DR
This paper introduces SIDReasoner, a framework that enhances semantic ID-based generative recommendation by improving reasoning and alignment with language, leading to better accuracy, interpretability, and cross-domain generalization.
Contribution
The paper proposes a novel two-stage framework that strengthens SID-language alignment and guides reasoning without relying on explicit reasoning annotations.
Findings
Improved recommendation accuracy across three datasets.
Enhanced interpretability of the recommendation process.
Better cross-domain generalization capabilities.
Abstract
Recent advances in generative recommendation have leveraged pretrained LLMs by formulating sequential recommendation as autoregressive generation over a unified token space comprising language tokens and itemic identifiers, where each item is represented by a compact sequence of discrete tokens, namely Semantic IDs (SIDs). This SID-based formulation enables efficient decoding over large-scale item corpora and provides a natural interface for LLM-based recommenders to leverage rich world knowledge. Meanwhile, breakthroughs in LLM reasoning motivate reasoning-enhanced recommendation, yet effective reasoning over SIDs remains underexplored and challenging. Itemic tokens are not natively meaningful to LLMs; moreover, recommendation-oriented SID reasoning is hard to evaluate, making high-quality supervision scarce. To address these challenges, we propose SIDReasoner, a two-stage framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Advanced Graph Neural Networks
