NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing
Yitu Wang, Shiyu Li, Qilin Zheng, Linghao Song, Zongwang Li, Andrew, Chang, Hai "Helen" Li, Yiran Chen

TL;DR
NDSEARCH introduces a near-data processing system with in-storage computing architecture to significantly accelerate graph-traversal-based approximate nearest neighbor search, overcoming memory and I/O bottlenecks in large datasets.
Contribution
The paper presents NDSEARCH, a hardware-software co-designed NDP solution with a novel in-storage computing architecture and customized processing model for efficient ANNS processing.
Findings
Up to 31.7x throughput improvement over CPU
Two orders-of-magnitude higher energy efficiency than CPU and GPU
Significant acceleration compared to state-of-the-art systems
Abstract
Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector database and many data center applications, such as person re-identification and recommendation systems. It is also fundamental to retrieval augmented generation (RAG) for large language models (LLM) now. Among all the ANNS algorithms, graph-traversal-based ANNS achieves the highest recall rate. However, as the size of dataset increases, the graph may require hundreds of gigabytes of memory, exceeding the main memory capacity of a single workstation node. Although we can do partitioning and use solid-state drive (SSD) as the backing storage, the limited SSD I/O bandwidth severely degrades the performance of the system. To address this challenge, we present NDSEARCH, a hardware-software co-designed near-data processing (NDP) solution for ANNS processing. NDSEARCH consists of a novel in-storage computing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Data Management and Algorithms · Advanced Image and Video Retrieval Techniques
