Quake: Adaptive Indexing for Vector Search
Jason Mohoney, Devesh Sarda, Mengze Tang, Shihabur Rahman Chowdhury, Anil Pacaci, Ihab F. Ilyas, Theodoros Rekatsinas, Shivaram Venkataraman

TL;DR
Quake is an adaptive indexing system for vector search that dynamically adjusts to evolving data and access patterns, significantly improving latency and recall in dynamic workloads.
Contribution
It introduces a multi-level partitioning scheme with a cost model and recall estimation to optimize vector search under changing conditions.
Findings
Achieves 1.5-38x query latency reduction on dynamic workloads.
Reduces update latency by 4.5-126x compared to state-of-the-art.
Maintains high recall with adaptive indexing strategies.
Abstract
Vector search, the task of finding the k-nearest neighbors of a query vector against a database of high-dimensional vectors, underpins many machine learning applications, including retrieval-augmented generation, recommendation systems, and information retrieval. However, existing approximate nearest neighbor (ANN) methods perform poorly under dynamic and skewed workloads where data distributions evolve. We introduce Quake, an adaptive indexing system that maintains low latency and high recall in such environments. Quake employs a multi-level partitioning scheme that adjusts to updates and changing access patterns, guided by a cost model that predicts query latency based on partition sizes and access frequencies. Quake also dynamically sets query execution parameters to meet recall targets using a novel recall estimation model. Furthermore, Quake utilizes NUMA-aware intra-query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Information Retrieval and Search Behavior
