HAKES: Scalable Vector Database for Embedding Search Service

Guoyu Hu; Shaofeng Cai; Tien Tuan Anh Dinh; Zhongle Xie; Cong Yue; Gang Chen; Beng Chin Ooi

arXiv:2505.12524·cs.DB·October 1, 2025

HAKES: Scalable Vector Database for Embedding Search Service

Guoyu Hu, Shaofeng Cai, Tien Tuan Anh Dinh, Zhongle Xie, Cong Yue, Gang Chen, Beng Chin Ooi

PDF

TL;DR

HAKES is a scalable, high-throughput distributed vector database designed for efficient approximate nearest neighbor search in high-dimensional embedding spaces, supporting concurrent read-write workloads with improved recall.

Contribution

The paper introduces a novel two-stage ANN index with dynamic search adaptation, a lightweight parameter tuning method, and a disaggregated architecture for scalable, concurrent vector search.

Findings

01

Outperforms state-of-the-art indexes in high recall and concurrency.

02

Achieves up to 16x higher throughput than baselines.

03

Demonstrates scalability on high-dimensional datasets.

Abstract

Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in the embedding vector space to find similar data. Existing vector databases provide indexes for efficient ANN searches, with graph-based indexes being the most popular due to their low latency and high recall in real-world high-dimensional datasets. However, these indexes are costly to build, suffer from significant contention under concurrent read-write workloads, and scale poorly to multiple servers. Our goal is to build a vector database that achieves high throughput and high recall under concurrent read-write workloads. To this end, we first propose an ANN index with an explicit two-stage design combining a fast filter stage with highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.