Adaptive Retrieval and Scalable Indexing for k-NN Search with   Cross-Encoders

Nishant Yadav; Nicholas Monath; Manzil Zaheer; Rob Fergus; Andrew; McCallum

arXiv:2405.03651·cs.IR·May 7, 2024

Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders

Nishant Yadav, Nicholas Monath, Manzil Zaheer, Rob Fergus, Andrew, McCallum

PDF

Open Access

TL;DR

This paper introduces a scalable, efficient method for k-NN search using cross-encoders that improves recall and speed by approximating CE scores with a novel sparse-matrix factorization approach, reducing computational costs.

Contribution

The authors propose a new sparse-matrix factorization technique for efficient approximation of CE scores, enabling scalable and accurate k-NN retrieval without extensive finetuning.

Findings

01

Up to 5% recall improvement for k=1

02

Up to 54% recall improvement for k=100

03

Speedup of up to 100x over CUR-based methods

Abstract

Cross-encoder (CE) models which compute similarity by jointly encoding a query-item pair perform better than embedding-based models (dual-encoders) at estimating query-item relevance. Existing approaches perform k-NN search with CE by approximating the CE similarity with a vector embedding space fit either with dual-encoders (DE) or CUR matrix factorization. DE-based retrieve-and-rerank approaches suffer from poor recall on new domains and the retrieval with DE is decoupled from the CE. While CUR-based approaches can be more accurate than the DE-based approach, they require a prohibitively large number of CE calls to compute item embeddings, thus making it impractical for deployment at scale. In this paper, we address these shortcomings with our proposed sparse-matrix factorization based method that efficiently computes latent query and item embeddings to approximate CE scores and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Algorithms and Data Compression

MethodsSparse Evolutionary Training · k-Nearest Neighbors