Generative Dense Retrieval: Memory Can Be a Burden
Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda, Wang, Xupeng Miao, Kan Li

TL;DR
This paper introduces Generative Dense Retrieval (GDR), a new retrieval paradigm combining coarse cluster-level matching with fine-grained document retrieval, addressing memory and scalability issues of traditional generative retrieval methods.
Contribution
The paper proposes GDR, a hybrid retrieval framework that improves memory efficiency and scalability by combining cluster-based matching with intra-cluster document retrieval.
Findings
GDR achieves an average of 3.0 R@100 improvement on NQ dataset.
GDR demonstrates better scalability compared to traditional generative retrieval.
The proposed strategies enhance intra-cluster matching accuracy.
Abstract
Generative Retrieval (GR), autoregressively decoding relevant document identifiers given a query, has been shown to perform well under the setting of small-scale corpora. By memorizing the document corpus with model parameters, GR implicitly achieves deep interaction between query and document. However, such a memorizing mechanism faces three drawbacks: (1) Poor memory accuracy for fine-grained features of documents; (2) Memory confusion gets worse as the corpus size increases; (3) Huge memory update costs for new documents. To alleviate these problems, we propose the Generative Dense Retrieval (GDR) paradigm. Specifically, GDR first uses the limited memory volume to achieve inter-cluster matching from query to relevant document clusters. Memorizing-free matching mechanism from Dense Retrieval (DR) is then introduced to conduct fine-grained intra-cluster matching from clusters to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
