Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer

Huimu Wang; Xingzhi Yao; Yiming Qiu; Qinghong Zhang; Haotian Wang; Yufan Cui; Songlin Wang; Sulong Xu; Mingming Li

arXiv:2602.23978·cs.IR·April 29, 2026

Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer

Huimu Wang, Xingzhi Yao, Yiming Qiu, Qinghong Zhang, Haotian Wang, Yufan Cui, Songlin Wang, Sulong Xu, Mingming Li

PDF

TL;DR

This paper introduces the SA^2CRQ framework, which enhances retrieval efficiency and generalization by adaptive quantization and residual knowledge transfer, especially benefiting cold-start and tail items.

Contribution

The paper proposes a novel adaptive quantization method and residual knowledge transfer technique to improve semantic retrieval for head and tail items.

Findings

01

SA^2CRQ improves retrieval accuracy in industrial and public datasets.

02

The framework effectively handles cold-start and tail item retrieval scenarios.

03

Experimental results show consistent performance gains over baselines.

Abstract

While semantic ID-based generative retrieval enables efficient end-to-end modeling in industrial applications, these methods face a persistent trade-off. On one hand, data-rich head items often suffer from ID collisions, which blur their distinct features and degrade downstream tasks. On the other hand, data-sparse tail items especially cold-start items are prone to semantic fragmentation during quantization; they are often mapped as isolated discrete points, which severely hinders their ability to generalize. To address this issue, we propose the Anchored Curriculum with Sequential Adaptive Quantization ( $S A^{2} C R Q$ ) framework. The framework introduces Sequential Adaptive Residual Quantization (SARQ) to dynamically allocate code lengths based on item path entropy, assigning longer, discriminative IDs to head items and shorter, generalizable IDs to tail items. To mitigate data sparsity,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.