CoVE: Compressed Vocabulary Expansion Makes Better LLM-based Recommender Systems
Haochen Zhang, Tianyi Zhang, Junze Yin, Oren Gal, Anshumali Shrivastava, Vladimir Braverman

TL;DR
CoVE leverages the sequence understanding capabilities of large language models by expanding item vocabularies and compressing embeddings, significantly improving recommendation performance and scalability.
Contribution
The paper introduces a novel compressed vocabulary expansion framework that enhances LLM-based recommender systems by utilizing sequence understanding and embedding compression.
Findings
Significant performance improvements on multiple datasets
Effective vocabulary expansion for better recommendation accuracy
Embedding compression enables large-scale industrial application
Abstract
Recommender systems play a pivotal role in providing relevant content to users. With the rapid development of large language models (LLMs), researchers have begun utilizing LLMs to build more powerful recommender systems. However, existing approaches that focus on aligning LLMs with recommendation tasks do not fully leverage their sequential information processing capabilities, leading to suboptimal performance. In this paper, we propose a novel system called compressed vocabulary expansion (CoVE). In CoVE, each item is assigned a unique ID within the expanded vocabulary. Our framework effectively capitalizes on sequence understanding abilities of LLMs, significantly enhancing their performance on recommendation tasks. Additionally, we compress the embedding layer, making CoVE practical for large-scale industrial applications. The effectiveness and performance of CoVE are demonstrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsGloVe Embeddings · Softmax · Long Short-Term Memory · Sequence to Sequence · Bidirectional LSTM · Contextual Word Vectors · Focus
