Hi-Gen: Generative Retrieval For Large-Scale Personalized E-commerce Search
Yanjing Wu, Yinfu Feng, Jian Wang, Wenji Zhou, Yunan Ye, Rong Xiao and, Jun Xiao

TL;DR
Hi-Gen introduces a hierarchical generative retrieval method tailored for large-scale personalized e-commerce search, effectively encoding efficiency and positional information to improve retrieval accuracy and support real-time online deployment.
Contribution
The paper presents Hi-Gen, a novel hierarchical encoding-decoding approach that incorporates efficiency and positional information, advancing generative retrieval in large-scale e-commerce.
Findings
Achieves 3.30% and 4.62% improvements in Recall@1 over SOTA on public and industry datasets.
Introduces a category-guided hierarchical clustering scheme for docID generation.
Develops variants supporting real-time large-scale online recall.
Abstract
Leveraging generative retrieval (GR) techniques to enhance search systems is an emerging methodology that has shown promising results in recent years. In GR, a text-to-text model maps string queries directly to relevant document identifiers (docIDs), dramatically simplifying the retrieval process. However, when applying most GR models in large-scale E-commerce for personalized item search, we must face two key problems in encoding and decoding. (1) Existing docID generation methods ignore the encoding of efficiency information, which is critical in E-commerce. (2) The positional information is important in decoding docIDs, while prior studies have not adequately discriminated the significance of positional information or well exploited the inherent interrelation among these positions. To overcome these problems, we introduce an efficient Hierarchical encoding-decoding Generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Data Mining Algorithms and Applications · Web Data Mining and Analysis
