Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL
Jianbo Zhu, Xing Fang, Jing Wang, Mingmin Jin, Bokang Wang, Guangxin Song, Zhenyu Xie, Junjie Bai

TL;DR
This paper introduces CQ-SID and EG-GRPO, a generative retrieval framework for e-commerce search that improves recall efficiency and aligns with ranking goals, demonstrated through offline and online experiments.
Contribution
It proposes a hierarchical semantic encoding method and reinforcement learning approach to enhance generative retrieval's practicality and effectiveness in real-world e-commerce.
Findings
CQ-SID reduces beam search complexity by half.
Offline experiments show up to 26.76% gain in semantic hitrate.
Online A/B tests show GMV increase of 1.15%.
Abstract
Generative retrieval offers a promising alternative by unifying the fragmented multi-stage retrieval process into a single end-to-end model. However, its practical adoption in industrial e-commerce search remains challenging, given the massive and dynamic product catalogs, strict latency requirements, and the need to align retrieval with downstream ranking goals. In this work, we propose a retrieval framework tailored for real-world recall scenarios, positioning generative retrieval as a recall-stage supplement rather than an end-to-end replacement. Our method, CQ-SID (Category-and-Query constrained Semantic ID), employs category-aware and query-item contrastive learning along with Residual Quantized VAEs to encode items into hierarchical semantic cluster identifiers, significantly reducing beam search complexity. Additionally, we develop EG-GRPO (Expert-Guided Group Relative Policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
