DiskANN++: Efficient Page-based Search over Isomorphic Mapped Graph Index using Query-sensitivity Entry Vertex
Jiongkang Ni, Xiaoliang Xu, Yuxiang Wang, Can Li, Jiajie, Yao, Shihai Xiao, Xuecang Zhang

TL;DR
DiskANN++ enhances large-scale approximate nearest neighbor search by optimizing SSD layout and entry vertex selection, significantly improving query throughput while maintaining high accuracy.
Contribution
It introduces a query-sensitive entry vertex selection and an isomorphic graph mapping to optimize SSD layout and reduce I/O, outperforming DiskANN in efficiency.
Findings
Achieves 1.5X to 2.2X QPS improvement over DiskANN.
Reduces I/O requests through optimized SSD layout.
Maintains comparable search accuracy.
Abstract
Given a vector dataset and a query vector , graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a graph index and approximately return vectors with minimum distances to by searching over . The main drawback of graph-based ANNS is that a graph index would be too large to fit into the memory especially for a large-scale . To solve this, a Product Quantization (PQ)-based hybrid method called DiskANN is proposed to store a low-dimensional PQ index in memory and retain a graph index in SSD, thus reducing memory overhead while ensuring a high search accuracy. However, it suffers from two I/O issues that significantly affect the overall efficiency: (1) long routing path from an entry vertex to the query's neighborhood that results in large number of I/O requests and (2) redundant I/O requests during the routing process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Data Management and Algorithms · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Non Maximum Suppression · 1x1 Convolution · SSD
