Semantic IDs for Joint Generative Search and Recommendation

Gustavo Penha; Edoardo D'Amico; Marco De Nadai; Enrico Palumbo; Alexandre Tamborrino; Ali Vardasbi; Max Lefarov; Shawn Lin; Timothy Heath; Francesco Fabbri; Hugues Bouchard

arXiv:2508.10478·cs.IR·August 15, 2025

Semantic IDs for Joint Generative Search and Recommendation

Gustavo Penha, Edoardo D'Amico, Marco De Nadai, Enrico Palumbo, Alexandre Tamborrino, Ali Vardasbi, Max Lefarov, Shawn Lin, Timothy Heath, Francesco Fabbri, Hugues Bouchard

PDF

TL;DR

This paper investigates how to construct Semantic IDs for large language model-based joint search and recommendation systems, demonstrating that a bi-encoder approach yields effective, generalizable item representations for both tasks.

Contribution

It introduces a unified Semantic ID construction method using bi-encoder embeddings, improving joint task performance and generalizability over task-specific IDs.

Findings

01

Bi-encoder based Semantic IDs outperform task-specific IDs.

02

Unified Semantic ID space balances search and recommendation performance.

03

The approach enhances generalizability across tasks.

Abstract

Generative models powered by Large Language Models (LLMs) are emerging as a unified solution for powering both recommendation and search tasks. A key design choice in these models is how to represent items, traditionally through unique identifiers (IDs) and more recently with Semantic IDs composed of discrete codes, obtained from embeddings. While task-specific embedding models can improve performance for individual tasks, they may not generalize well in a joint setting. In this paper, we explore how to construct Semantic IDs that perform well both in search and recommendation when using a unified model. We compare a range of strategies to construct Semantic IDs, looking into task-specific and cross-tasks approaches, and also whether each task should have its own semantic ID tokens in a joint search and recommendation generative model. Our results show that using a bi-encoder model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.