Order-agnostic Identifier for Large Language Model-based Generative Recommendation
Xinyu Lin, Haihan Shi, Wenjie Wang, Fuli Feng, Qifan Wang, See-Kiong Ng, Tat-Seng Chua

TL;DR
This paper introduces SETRec, a novel order-agnostic set identifier paradigm for LLM-based generative recommendation, combining collaborative filtering and semantic info for efficient, scalable item recommendation.
Contribution
The paper proposes a set identifier paradigm and SETRec model that integrate CF and semantic info, using order-agnostic tokens for improved efficiency and scalability in LLM recommendation.
Findings
SETRec outperforms existing methods in various recommendation scenarios.
SETRec demonstrates superior efficiency and scalability, especially for cold-start items.
Extensive experiments validate the effectiveness of the proposed approach.
Abstract
Leveraging Large Language Models (LLMs) for generative recommendation has attracted significant research interest, where item tokenization is a critical step. It involves assigning item identifiers for LLMs to encode user history and generate the next item. Existing approaches leverage either token-sequence identifiers, representing items as discrete token sequences, or single-token identifiers, using ID or semantic embeddings. Token-sequence identifiers face issues such as the local optima problem in beam search and low generation efficiency due to step-by-step generation. In contrast, single-token identifiers fail to capture rich semantics or encode Collaborative Filtering (CF) information, resulting in suboptimal performance. To address these issues, we propose two fundamental principles for item identifier design: 1) integrating both CF and semantic information to fully capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Inverse Square Root Schedule · SentencePiece · Layer Normalization · Residual Connection · Linear Layer · Dense Connections · Attention Dropout
