CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs
Hiroyuki Ootomo, Akira Naruse, Corey Nolet, Ray Wang, Tamas Feher,, Yong Wang

TL;DR
CAGRA introduces a GPU-optimized graph construction and search algorithm for approximate nearest neighbor search, achieving significant speedups over existing CPU and GPU methods while maintaining high accuracy.
Contribution
The paper presents a novel parallel graph construction and search algorithm tailored for GPUs, significantly improving efficiency and throughput in ANNS tasks.
Findings
CAGRA is 2.2~27x faster in graph construction than HNSW.
In large-batch searches at 90-95% recall, CAGRA is 33~77x faster than HNSW.
For single queries at 95% recall, CAGRA is 3.4~53x faster than HNSW.
Abstract
Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems. Data volumes have soared in recent years and the computational cost of an exhaustive exact nearest neighbor search is often prohibitive, necessitating the adoption of approximate techniques. The balanced performance and recall of graph-based approaches have more recently garnered significant attention in ANNS algorithms, however, only a few studies have explored harnessing the power of GPUs and multi-core processors despite the widespread use of massively parallel and general-purpose computing. To bridge this gap, we introduce a novel parallel computing hardware-based proximity graph and search algorithm. By leveraging the high-performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Caching and Content Delivery
