SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes
Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin

TL;DR
SLIM introduces a sparse, high-dimensional lexical space for multi-vector retrieval, enabling efficient, practical implementations compatible with existing search libraries while maintaining competitive accuracy.
Contribution
It pioneers the use of sparse token representations for multi-vector retrieval, combining inverted index efficiency with late interaction accuracy.
Findings
Achieves competitive accuracy on MS MARCO Passages and BEIR datasets.
Much smaller and faster than ColBERT on CPUs.
Fully compatible with off-the-shelf search libraries like Lucene.
Abstract
This paper introduces Sparsified Late Interaction for Multi-vector (SLIM) retrieval with inverted indexes. Multi-vector retrieval methods have demonstrated their effectiveness on various retrieval datasets, and among them, ColBERT is the most established method based on the late interaction of contextualized token embeddings of pre-trained language models. However, efficient ColBERT implementations require complex engineering and cannot take advantage of off-the-shelf search libraries, impeding their practical use. To address this issue, SLIM first maps each contextualized token vector to a sparse, high-dimensional lexical space before performing late interaction between these sparse token embeddings. We then introduce an efficient two-stage retrieval architecture that includes inverted index retrieval followed by a score refinement module to approximate the sparsified late interaction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Image Retrieval and Classification Techniques · Neural Networks and Applications
