Locally-Adaptive Quantization for Streaming Vector Search
Cecilia Aguerrebere, Mark Hildebrand, Ishwar Singh Bhati and, Theodore Willke, Mariano Tepper

TL;DR
This paper evaluates Locally-Adaptive Vector Quantization (LVQ) and its new variants in streaming similarity search, demonstrating significant performance improvements and robustness to data distribution shifts, and introduces an open-source library for high-performance vector search.
Contribution
The paper introduces Turbo LVQ and multi-means LVQ, enhancing LVQ's performance in streaming similarity search and providing the first comprehensive evaluation of LVQ in evolving data scenarios.
Findings
LVQ and variants outperform competitors by up to 9.4x in static data.
LVQ variants improve search speed by up to 28%.
Robust performance under data distribution shifts.
Abstract
Retrieving the most similar vector embeddings to a given query among a massive collection of vectors has long been a key component of countless real-world applications. The recently introduced Retrieval-Augmented Generation is one of the most prominent examples. For many of these applications, the database evolves over time by inserting new data and removing outdated data. In these cases, the retrieval problem is known as streaming similarity search. While Locally-Adaptive Vector Quantization (LVQ), a highly efficient vector compression method, yields state-of-the-art search performance for non-evolving databases, its usefulness in the streaming setting has not been yet established. In this work, we study LVQ in streaming similarity search. In support of our evaluation, we introduce two improvements of LVQ: Turbo LVQ and multi-means LVQ that boost its search performance by up to 28% and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Metaheuristic Optimization Algorithms Research · Algorithms and Data Compression
MethodsLib
