HAKES: Scalable Vector Database for Embedding Search Service
Guoyu Hu, Shaofeng Cai, Tien Tuan Anh Dinh, Zhongle Xie, Cong Yue, Gang Chen, Beng Chin Ooi

TL;DR
HAKES is a scalable, high-throughput distributed vector database designed for efficient approximate nearest neighbor search in high-dimensional embedding spaces, supporting concurrent read-write workloads with improved recall.
Contribution
The paper introduces a novel two-stage ANN index with dynamic search adaptation, a lightweight parameter tuning method, and a disaggregated architecture for scalable, concurrent vector search.
Findings
Outperforms state-of-the-art indexes in high recall and concurrency.
Achieves up to 16x higher throughput than baselines.
Demonstrates scalability on high-dimensional datasets.
Abstract
Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in the embedding vector space to find similar data. Existing vector databases provide indexes for efficient ANN searches, with graph-based indexes being the most popular due to their low latency and high recall in real-world high-dimensional datasets. However, these indexes are costly to build, suffer from significant contention under concurrent read-write workloads, and scale poorly to multiple servers. Our goal is to build a vector database that achieves high throughput and high recall under concurrent read-write workloads. To this end, we first propose an ANN index with an explicit two-stage design combining a fast filter stage with highly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
