Lexically-Accelerated Dense Retrieval

Hrishikesh Kulkarni; Sean MacAvaney; Nazli Goharian; Ophir Frieder

arXiv:2307.16779·cs.IR·August 1, 2023

Lexically-Accelerated Dense Retrieval

Hrishikesh Kulkarni, Sean MacAvaney, Nazli Goharian, Ophir Frieder

PDF

1 Repo

TL;DR

LADR is a novel method that enhances dense retrieval efficiency by combining lexical and proximity graph techniques, achieving near-exhaustive accuracy with significantly reduced computational cost.

Contribution

LADR introduces a simple, effective approach that improves dense retrieval efficiency without sacrificing effectiveness, using lexical seeds and a proximity graph exploration.

Findings

01

LADR outperforms existing approximate methods on efficiency and recall.

02

LADR achieves near-exhaustive search accuracy at around 8ms per query.

03

LADR establishes a new effectiveness-efficiency Pareto frontier.

Abstract

Retrieval approaches that score documents based on learned dense vectors (i.e., dense retrieval) rather than lexical signals (i.e., conventional retrieval) are increasingly popular. Their ability to identify related documents that do not necessarily contain the same terms as those appearing in the user's query (thereby improving recall) is one of their key advantages. However, to actually achieve these gains, dense retrieval approaches typically require an exhaustive search over the document collection, making them considerably more expensive at query-time than conventional lexical approaches. Several techniques aim to reduce this computational overhead by approximating the results of a full dense retriever. Although these approaches reasonably approximate the top results, they suffer in terms of recall -- one of the key advantages of dense retrieval. We introduce 'LADR'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

georgetown-ir-lab/ladr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.