ScaleGANN: Accelerate Large-Scale ANN Indexing by Cost-effective Cloud GPUs
Lan Lu, Peiqi Yin, Isaac Yang, Tao Luo, Hua Fan, Wenchao Zhou, Feifei Li, Boon Thau Loo

TL;DR
ScaleGANN is a system that accelerates large-scale approximate nearest neighbor indexing using cost-effective cloud GPUs, achieving significant speedups and cost savings over existing methods.
Contribution
It introduces a novel divide-and-merge indexing approach, optimized resource allocation, and a task scheduler for efficient large-scale ANN construction on low-cost spot GPUs.
Findings
Up to 9x faster index build time compared to DiskANN.
Achieves 6x cost reduction while maintaining quality.
Effective handling of large, high-dimensional datasets.
Abstract
Graph-based ANNS algorithms have gained increasing research interest and market adoption due to their efficiency and accuracy in retrieval. Existing approaches primarily rely on CPUs for graph index construction and retrieval, but this often requires significant time, especially for large-scale and high-dimensional datasets. Some studies have explored GPU-based solutions. However, GPUs are costly and their limited memory makes handling large datasets challenging. In this paper, we propose a novel end-to-end system ScaleGANN that enables users to efficiently construct graph indexes for large-scale, high-dimensional datasets by leveraging low-cost spot GPU resources in a distributed cloud system. ScaleGANN utilized the idea of divide-and-merge, with an optimized vector partitioning algorithm to further improve the indexing time and space efficiency while guaranteeing good index quality.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
