Passing the Baton: High Throughput Distributed Disk-Based Vector Search with BatANN
Nam Anh Dang (1), Ben Landrum (1), Ken Birman (1) ((1) Cornell University)

TL;DR
BatANN is a novel distributed disk-based vector search system that scales efficiently across multiple servers, maintaining high recall and low latency for billion-scale datasets using standard TCP.
Contribution
Introduces BatANN, the first open-source distributed disk-based vector search system with a global graph, achieving near-linear throughput scaling and improved locality.
Findings
BatANN achieves 3.5-5.59x the scatter-gather baseline at 1B points.
It attains 1.44-2.09x the throughput of DistributedANN.
Mean latency remains below 3 ms on standard TCP.
Abstract
Vector search underpins modern information-retrieval systems, including retrieval-augmented generation (RAG) pipelines and search engines over unstructured text and images. As datasets scale to billions of vectors, disk-based vector search has emerged as a practical solution. However, looking to the future, we must anticipate datasets too large for any single server and throughput demands that exceed the limits of locally attached SSDs. We present BatANN, a distributed disk-based approximate nearest neighbor (ANN) system that retains the logarithmic search efficiency of a single global graph while achieving near-linear throughput scaling in the number of servers. Our core innovation is that when accessing a neighborhood which is stored on another machine, we send the full state of the query to the other machine to continue executing there for improved locality. On 1B-point datasets at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
