MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings
Laxman Dhulipala, Majid Hadian, Rajesh Jayaram, Jason Lee, Vahab, Mirrokni

TL;DR
MUVERA introduces a novel method to convert multi-vector retrieval into single-vector retrieval using fixed dimensional encodings, enabling efficient and accurate information retrieval with theoretical guarantees and improved performance.
Contribution
The paper presents MUVERA, a new retrieval algorithm that approximates multi-vector similarity with single-vector encodings, reducing computational complexity and enabling use of standard MIPS solvers.
Findings
Achieves same recall as state-of-the-art heuristics with fewer candidates.
Retrieves 2-5 times fewer candidates for the same recall.
Improves end-to-end recall by 10% with 90% lower latency.
Abstract
Neural embedding models have become a fundamental component of modern information retrieval (IR) pipelines. These models produce a single embedding per data-point, allowing for fast retrieval via highly optimized maximum inner product search (MIPS) algorithms. Recently, beginning with the landmark ColBERT paper, multi-vector models, which produce a set of embedding per data point, have achieved markedly superior performance for IR tasks. Unfortunately, using these models for IR is computationally expensive due to the increased complexity of multi-vector retrieval and scoring. In this paper, we introduce MUVERA (MUlti-VEctor Retrieval Algorithm), a retrieval mechanism which reduces multi-vector similarity search to single-vector similarity search. This enables the usage of off-the-shelf MIPS solvers for multi-vector retrieval. MUVERA asymmetrically generates Fixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗NeuML/pubmedbert-base-colbertmodel· 79 dl· ♡ 779 dl♡ 7
- 🤗NeuML/biomedbert-base-colbertmodel· 60 dl· ♡ 360 dl♡ 3
- 🤗NeuML/colbert-muvera-micromodel· 4 dl· ♡ 254 dl♡ 25
- 🤗NeuML/colbert-muvera-smallmodel· 4 dl· ♡ 114 dl♡ 11
- 🤗yjoonjang/colbert-ko-v1model· 15 dl· ♡ 1415 dl♡ 14
- 🤗NeuML/bert-hash-femtomodel· 73 dl· ♡ 1973 dl♡ 19
- 🤗NeuML/bert-hash-nanomodel· 178 dl· ♡ 16178 dl♡ 16
- 🤗NeuML/bert-hash-picomodel· 7 dl· ♡ 37 dl♡ 3
- 🤗NeuML/colbert-muvera-nanomodel· 435 dl· ♡ 3435 dl♡ 3
- 🤗NeuML/colbert-muvera-picomodel· 2 dl· ♡ 12 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsSparse Evolutionary Training
