Recommender Systems with Generative Retrieval
Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan H. Keshavan,, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost,, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy

TL;DR
This paper introduces a novel generative retrieval method for recommender systems that uses semantic IDs and sequence-to-sequence models, outperforming state-of-the-art models and improving generalization to new items.
Contribution
It proposes the first Semantic ID-based generative model for recommendation, enhancing retrieval accuracy and generalization in large-scale systems.
Findings
Outperforms current SOTA models on various datasets
Improves retrieval for items with no prior interactions
Enhances model generalization with Semantic IDs
Abstract
Modern recommender systems perform large-scale retrieval by first embedding queries and item candidates in the same unified space, followed by approximate nearest neighbor search to select top candidates given a query embedding. In this paper, we propose a novel generative retrieval approach, where the retrieval model autoregressively decodes the identifiers of the target candidates. To that end, we create semantically meaningful tuple of codewords to serve as a Semantic ID for each item. Given Semantic IDs for items in a user session, a Transformer-based sequence-to-sequence model is trained to predict the Semantic ID of the next item that the user will interact with. To the best of our knowledge, this is the first Semantic ID-based generative model for recommendation tasks. We show that recommender systems trained with the proposed paradigm significantly outperform the current SOTA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Image and Video Retrieval Techniques
