DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries
Joshua Engels, Benjamin Coleman, Vihan Lakshman, Anshumali Shrivastava

TL;DR
DESSERT is a new approximate algorithm for vector set search that significantly speeds up semantic search tasks with minimal impact on recall, demonstrated within ColBERT on benchmark datasets.
Contribution
We introduce DESSERT, an efficient approximate search algorithm for vector set queries with strong theoretical guarantees and practical performance improvements.
Findings
2-5x speedup in retrieval benchmarks
Minimal loss in recall with DESSERT integration
Effective for semantic search applications
Abstract
We study the problem of with . This task is analogous to traditional near-neighbor search, with the exception that both the query and each element in the collection are of vectors. We identify this problem as a core subroutine for semantic search applications and find that existing solutions are unacceptably slow. Towards this end, we present a new approximate search algorithm, DESSERT (ESSERT ffeciently earches ets of mbeddings via etrieval ables). DESSERT is a general tool with strong theoretical guarantees and excellent empirical performance. When we integrate DESSERT into ColBERT, a state-of-the-art semantic search model, we find a 2-5x speedup on the MS MARCO and LoTTE retrieval benchmarks with minimal loss in recall, underscoring the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
