Generative Retrieval Meets Multi-Graded Relevance

Yubao Tang; Ruqing Zhang; Jiafeng Guo; Maarten de Rijke; Wei Chen,; Xueqi Cheng

arXiv:2409.18409·cs.IR·September 30, 2024

Generative Retrieval Meets Multi-Graded Relevance

Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen,, Xueqi Cheng

PDF

Open Access

TL;DR

This paper introduces GR$^2$, a novel framework for generative retrieval that effectively handles multi-graded relevance by optimizing document identifiers and employing constrained contrastive training, improving retrieval accuracy.

Contribution

The paper extends generative retrieval to multi-graded relevance scenarios by developing a framework that ensures distinct, relevant identifiers and uses relevance-aware contrastive training.

Findings

01

GR$^2$ outperforms existing methods on multi-graded relevance datasets.

02

The approach effectively balances relevance and distinctness of document identifiers.

03

Experimental results show improved retrieval performance with multi-graded relevance data.

Abstract

Generative retrieval represents a novel approach to information retrieval. It uses an encoder-decoder architecture to directly produce relevant document identifiers (docids) for queries. While this method offers benefits, current approaches are limited to scenarios with binary relevance data, overlooking the potential for documents to have multi-graded relevance. Extending generative retrieval to accommodate multi-graded relevance poses challenges, including the need to reconcile likelihood probabilities for docid pairs and the possibility of multiple relevant documents sharing the same identifier. To address these challenges, we introduce a framework called GRaded Generative Retrieval (GR $^{2}$ ). GR $^{2}$ focuses on two key components: ensuring relevant and distinct identifiers, and implementing multi-graded constrained contrastive training. First, we create identifiers that are both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling