Semantic Convergence: Harmonizing Recommender Systems via Two-Stage   Alignment and Behavioral Semantic Tokenization

Guanghan Li; Xun Zhang; Yufei Zhang; Yifan Yin; Guojun Yin; Wei Lin

arXiv:2412.13771·cs.IR·December 19, 2024

Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization

Guanghan Li, Xun Zhang, Yufei Zhang, Yifan Yin, Guojun Yin, Wei Lin

PDF

Open Access 1 Video

TL;DR

This paper introduces a two-stage framework that aligns traditional recommendation signals with large language models through semantic tokenization and supervised alignment tasks, significantly enhancing recommendation accuracy and scalability.

Contribution

It presents a novel semantic alignment method combining item tokenization and supervised learning to bridge collaborative signals with LLM semantics in recommendation systems.

Findings

01

Improved recall metrics in experiments

02

Enhanced scalability of recommendation systems

03

Reduced inference latency through pre-caching

Abstract

Large language models (LLMs), endowed with exceptional reasoning capabilities, are adept at discerning profound user interests from historical behaviors, thereby presenting a promising avenue for the advancement of recommendation systems. However, a notable discrepancy persists between the sparse collaborative semantics typically found in recommendation systems and the dense token representations within LLMs. In our study, we propose a novel framework that harmoniously merges traditional recommendation models with the prowess of LLMs. We initiate this integration by transforming ItemIDs into sequences that align semantically with the LLMs space, through the proposed Alignment Tokenization module. Additionally, we design a series of specialized supervised learning tasks aimed at aligning collaborative signals with the subtleties of natural language semantics. To ensure practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization· underline

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Advanced Text Analysis Techniques

MethodsALIGN