TL;DR
LADR is a novel method that enhances dense retrieval efficiency by combining lexical and proximity graph techniques, achieving near-exhaustive accuracy with significantly reduced computational cost.
Contribution
LADR introduces a simple, effective approach that improves dense retrieval efficiency without sacrificing effectiveness, using lexical seeds and a proximity graph exploration.
Findings
LADR outperforms existing approximate methods on efficiency and recall.
LADR achieves near-exhaustive search accuracy at around 8ms per query.
LADR establishes a new effectiveness-efficiency Pareto frontier.
Abstract
Retrieval approaches that score documents based on learned dense vectors (i.e., dense retrieval) rather than lexical signals (i.e., conventional retrieval) are increasingly popular. Their ability to identify related documents that do not necessarily contain the same terms as those appearing in the user's query (thereby improving recall) is one of their key advantages. However, to actually achieve these gains, dense retrieval approaches typically require an exhaustive search over the document collection, making them considerably more expensive at query-time than conventional lexical approaches. Several techniques aim to reduce this computational overhead by approximating the results of a full dense retriever. Although these approaches reasonably approximate the top results, they suffer in terms of recall -- one of the key advantages of dense retrieval. We introduce 'LADR'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
