In-Place Updates of a Graph Index for Streaming Approximate Nearest Neighbor Search
Haike Xu, Magdalen Dobson Manohar, Philip A. Bernstein, Badrish, Chandramouli, Richard Wen, and Harsha Vardhan Simhadri

TL;DR
This paper introduces IP-DiskANN, an innovative graph index update algorithm that processes insertions and deletions in-place, maintaining stable recall and outperforming batch methods in streaming approximate nearest neighbor search scenarios.
Contribution
IP-DiskANN is the first algorithm to enable in-place updates for graph indices in streaming ANNS, eliminating the need for batch consolidation while maintaining high recall stability.
Findings
IP-DiskANN maintains stable recall over long update sequences.
It achieves higher query throughput and update speed than batch consolidation methods.
Outperforms HNSW in various benchmarks.
Abstract
Indices for approximate nearest neighbor search (ANNS) are a basic component for information retrieval and widely used in database, search, recommendation and RAG systems. In these scenarios, documents or other objects are inserted into and deleted from the working set at a high rate, requiring a stream of updates to the vector index. Algorithms based on proximity graph indices are the most efficient indices for ANNS, winning many benchmark competitions. However, it is challenging to update such graph index at a high rate, while supporting stable recall after many updates. Since the graph is singly-linked, deletions are hard because there is no fast way to find in-neighbors of a deleted vertex. Therefore, to update the graph, state-of-the-art algorithms such as FreshDiskANN accumulate deletions in a batch and periodically consolidate, removing edges to deleted vertices and modifying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Data Stream Mining Techniques · Advanced Image and Video Retrieval Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Adam · Softmax · Dropout · Weight Decay · BART · Linear Layer · WordPiece
