Masked Diffusion for Generative Recommendation
Kulin Shah, Bhuvesh Kumar, Neil Shah, Liam Collins

TL;DR
This paper introduces a masked diffusion approach for generative recommendation using semantic IDs, enabling parallel decoding and outperforming traditional autoregressive models especially in data-limited scenarios.
Contribution
It proposes a novel masked diffusion method for sequence modeling in generative recommendation, improving inference efficiency and performance over autoregressive models.
Findings
Outperforms autoregressive models in various settings.
More effective in data-constrained environments.
Enables parallel prediction of multiple SIDs.
Abstract
Generative recommendation (GR) with semantic IDs (SIDs) has emerged as a promising alternative to traditional recommendation approaches due to its performance gains, capitalization on semantic information provided through language model embeddings, and inference and storage efficiency. Existing GR with SIDs works frame the probability of a sequence of SIDs corresponding to a user's interaction history using autoregressive modeling. While this has led to impressive next item prediction performances in certain settings, these autoregressive GR with SIDs models suffer from expensive inference due to sequential token-wise decoding, potentially inefficient use of training data and bias towards learning short-context relationships among tokens. Inspired by recent breakthroughs in NLP, we propose to instead model and learn the probability of a user's sequence of SIDs using masked diffusion.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Machine Learning in Healthcare
