Mitigating Collaborative Semantic ID Staleness in Generative Retrieval

Vladimir Baikalov; Iskander Bagautdinov; Sergey Muravyov

arXiv:2604.13273·cs.IR·April 16, 2026

Mitigating Collaborative Semantic ID Staleness in Generative Retrieval

Vladimir Baikalov, Iskander Bagautdinov, Sergey Muravyov

PDF

TL;DR

This paper addresses the problem of stale semantic IDs in generative retrieval systems caused by evolving user-item interactions, proposing a lightweight SID alignment update to improve retrieval performance and reduce retraining costs.

Contribution

It introduces a model-agnostic SID alignment update method that maintains compatibility with existing vocabularies, enabling efficient fine-tuning without full retraining.

Findings

01

Consistently improves Recall@K and nDCG@K across three benchmarks.

02

Reduces retriever training compute by approximately 8-9 times.

03

Effectively mitigates SID staleness caused by temporal drift.

Abstract

Generative retrieval with Semantic IDs (SIDs) assigns each item a discrete identifier and treats retrieval as a sequence generation problem rather than a nearest-neighbor search. While content-only SIDs are stable, they do not take into account user-item interaction patterns, so recent systems construct interaction-informed SIDs. However, as interaction patterns drift over time, these identifiers become stale, i.e., their collaborative semantics no longer match recent logs. Prior work typically assumes a fixed SID vocabulary during fine-tuning, or treats SID refresh as a full rebuild that requires retraining. However, SID staleness under temporal drift is rarely analyzed explicitly. To bridge this gap, we study SID staleness under strict chronological evaluation and propose a lightweight, model-agnostic SID alignment update. Given refreshed SIDs derived from recent logs, we align them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.