TL;DR
This paper introduces a novel pseudo-relevance feedback method for multiple representation dense retrieval, significantly improving retrieval effectiveness by augmenting query embeddings with feedback-derived embeddings.
Contribution
It is the first to explore pseudo-relevance feedback for multiple representation dense retrieval, enhancing retrieval performance with a new embedding augmentation technique.
Findings
MAP improved by up to 26% on TREC 2019 queries
MAP improved by up to 10% on TREC 2020 queries
Effective enhancement of dense retrieval with feedback embeddings
Abstract
Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulness of expanding and reweighting the users' initial queries using information occurring in an initial set of retrieved documents, known as the pseudo-relevant set. Recently, dense retrieval -- through the use of neural contextual language models such as BERT for analysing the documents' and queries' contents and computing their relevance scores -- has shown a promising performance on several information retrieval tasks still relying on the traditional inverted index for identifying documents relevant to a query. Two different dense retrieval families have emerged: the use of single embedded representations for each passage and query (e.g. using BERT's [CLS] token), or via multiple representations (e.g. using an embedding for each token of the query and document). In this work, we conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Weight Decay · Dropout · WordPiece · Layer Normalization · Linear Warmup With Linear Decay
