Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders
Nishant Yadav, Nicholas Monath, Manzil Zaheer, Rob Fergus, Andrew, McCallum

TL;DR
This paper introduces a scalable, efficient method for k-NN search using cross-encoders that improves recall and speed by approximating CE scores with a novel sparse-matrix factorization approach, reducing computational costs.
Contribution
The authors propose a new sparse-matrix factorization technique for efficient approximation of CE scores, enabling scalable and accurate k-NN retrieval without extensive finetuning.
Findings
Up to 5% recall improvement for k=1
Up to 54% recall improvement for k=100
Speedup of up to 100x over CUR-based methods
Abstract
Cross-encoder (CE) models which compute similarity by jointly encoding a query-item pair perform better than embedding-based models (dual-encoders) at estimating query-item relevance. Existing approaches perform k-NN search with CE by approximating the CE similarity with a vector embedding space fit either with dual-encoders (DE) or CUR matrix factorization. DE-based retrieve-and-rerank approaches suffer from poor recall on new domains and the retrieval with DE is decoupled from the CE. While CUR-based approaches can be more accurate than the DE-based approach, they require a prohibitively large number of CE calls to compute item embeddings, thus making it impractical for deployment at scale. In this paper, we address these shortcomings with our proposed sparse-matrix factorization based method that efficiently computes latent query and item embeddings to approximate CE scores and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Algorithms and Data Compression
MethodsSparse Evolutionary Training · k-Nearest Neighbors
