On Approximate Nearest Neighbour Selection for Multi-Stage Dense   Retrieval

Craig Macdonald; Nicola Tonellotto

arXiv:2108.11480·cs.IR·August 27, 2021

On Approximate Nearest Neighbour Selection for Multi-Stage Dense Retrieval

Craig Macdonald, Nicola Tonellotto

PDF

1 Repo

TL;DR

This paper explores using approximate nearest neighbour scores to efficiently rank candidate documents in dense retrieval, achieving similar effectiveness with half the candidate set and doubling retrieval speed.

Contribution

It introduces a method to reduce candidate set size in dense retrieval by leveraging ANN scores, improving efficiency without sacrificing effectiveness.

Findings

01

Reducing candidate set to 200 documents maintains effectiveness.

02

Using ANN scores speeds up retrieval by 2x.

03

Effective ranking is possible with smaller candidate sets.

Abstract

Dense retrieval, which describes the use of contextualised language models such as BERT to identify documents from a collection by leveraging approximate nearest neighbour (ANN) techniques, has been increasing in popularity. Two families of approaches have emerged, depending on whether documents and queries are represented by single or multiple embeddings. ColBERT, the exemplar of the latter, uses an ANN index and approximate scores to identify a set of candidate documents for each query embedding, which are then re-ranked using accurate document representations. In this manner, a large number of documents can be retrieved for each query, hindering the efficiency of the approach. In this work, we investigate the use of ANN scores for ranking the candidate documents, in order to decrease the number of candidate documents being fully scored. Experiments conducted on the MSMARCO passage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

terrierteam/pyterrier_colbert
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Layer Normalization · Linear Warmup With Linear Decay · Dropout · Softmax · Weight Decay · Adam