A GPU-Accelerated Framework for Multi-Attribute Range Filtered Approximate Nearest Neighbor Search
Zhonggen Li, Haoran Yu, Zixuan Xu, Yifan Zhu, Yunjun Gao

TL;DR
Garfield is a GPU-accelerated framework for multi-attribute range filtered approximate nearest neighbor search that significantly improves throughput and reduces index size compared to existing CPU-based solutions.
Contribution
It introduces the GMG index and a hardware-aware execution pipeline to overcome bottlenecks in RFANNS, enabling efficient GPU-based indexing and querying.
Findings
Reduces index size by 4.4x
Achieves 119.8x higher throughput than state-of-the-art methods
Supports datasets exceeding GPU memory with out-of-core processing
Abstract
Range-filtered approximate nearest neighbor search (RFANNS) is increasingly critical for modern vector databases. However, existing solutions suffer from severe index inflation and construction overhead. Furthermore, they rely exclusively on CPUs for the heavy indexing and query processing, significantly restricting the throughput due to the limited memory bandwidth and parallelism. In this paper, we present Garfield, a GPU-accelerated framework for multi-attribute range filtered ANNS that overcomes these bottlenecks through designing a lightweight index structure and hardware-aware execution pipeline. Garfield introduces the GMG index, which partitions data into cells and builds local graph indexes. It guarantees linear storage and indexing overhead by adding a constant number of cross-cell edges. For queries, Garfield utilizes a cluster-guided ordering strategy that reorders…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
