ScaleGANN: Accelerate Large-Scale ANN Indexing by Cost-effective Cloud GPUs

Lan Lu; Peiqi Yin; Isaac Yang; Tao Luo; Hua Fan; Wenchao Zhou; Feifei Li; Boon Thau Loo

arXiv:2605.10135·cs.DB·May 12, 2026

ScaleGANN: Accelerate Large-Scale ANN Indexing by Cost-effective Cloud GPUs

Lan Lu, Peiqi Yin, Isaac Yang, Tao Luo, Hua Fan, Wenchao Zhou, Feifei Li, Boon Thau Loo

PDF

TL;DR

ScaleGANN is a system that accelerates large-scale approximate nearest neighbor indexing using cost-effective cloud GPUs, achieving significant speedups and cost savings over existing methods.

Contribution

It introduces a novel divide-and-merge indexing approach, optimized resource allocation, and a task scheduler for efficient large-scale ANN construction on low-cost spot GPUs.

Findings

01

Up to 9x faster index build time compared to DiskANN.

02

Achieves 6x cost reduction while maintaining quality.

03

Effective handling of large, high-dimensional datasets.

Abstract

Graph-based ANNS algorithms have gained increasing research interest and market adoption due to their efficiency and accuracy in retrieval. Existing approaches primarily rely on CPUs for graph index construction and retrieval, but this often requires significant time, especially for large-scale and high-dimensional datasets. Some studies have explored GPU-based solutions. However, GPUs are costly and their limited memory makes handling large datasets challenging. In this paper, we propose a novel end-to-end system ScaleGANN that enables users to efficiently construct graph indexes for large-scale, high-dimensional datasets by leveraging low-cost spot GPU resources in a distributed cloud system. ScaleGANN utilized the idea of divide-and-merge, with an optimized vector partitioning algorithm to further improve the indexing time and space efficiency while guaranteeing good index quality.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.