Similarity search in the blink of an eye with compressed indices

Cecilia Aguerrebere; Ishwar Bhati; Mark Hildebrand; Mariano Tepper,; Ted Willke

arXiv:2304.04759·cs.LG·July 26, 2023·1 cites

Similarity search in the blink of an eye with compressed indices

Cecilia Aguerrebere, Ishwar Bhati, Mark Hildebrand, Mariano Tepper,, Ted Willke

PDF

Open Access 1 Repo

TL;DR

This paper introduces LVQ, a novel vector compression technique, and a high-performance system for graph-based similarity search, significantly improving speed and reducing memory usage for billion-scale datasets.

Contribution

The paper presents LVQ, a new vector compression method, combined with a high-performance system, establishing state-of-the-art results in speed and memory efficiency for similarity search.

Findings

01

LVQ improves search throughput by up to 20.7x in low-memory scenarios.

02

LVQ reduces memory footprint by up to 3x.

03

The combined system outperforms existing methods in both speed and memory usage.

Abstract

Nowadays, data is represented by vectors. Retrieving those vectors, among millions and billions, that are similar to a given query is a ubiquitous problem, known as similarity search, of relevance for a wide range of applications. Graph-based indices are currently the best performing techniques for billion-scale similarity search. However, their random-access memory pattern presents challenges to realize their full potential. In this work, we present new techniques and systems for creating faster and smaller graph-based indices. To this end, we introduce a novel vector compression method, Locally-adaptive Vector Quantization (LVQ), that uses per-vector scaling and scalar quantization to improve search performance with fast similarity computations and a reduced effective bandwidth, while decreasing memory footprint and barely impacting accuracy. LVQ, when combined with a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IntelLabs/ScalableVectorSearch
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Machine Learning in Bioinformatics