A GPU-Accelerated Framework for Multi-Attribute Range Filtered Approximate Nearest Neighbor Search

Zhonggen Li; Haoran Yu; Zixuan Xu; Yifan Zhu; Yunjun Gao

arXiv:2604.20121·cs.DB·April 28, 2026

A GPU-Accelerated Framework for Multi-Attribute Range Filtered Approximate Nearest Neighbor Search

Zhonggen Li, Haoran Yu, Zixuan Xu, Yifan Zhu, Yunjun Gao

PDF

TL;DR

Garfield is a GPU-accelerated framework for multi-attribute range filtered approximate nearest neighbor search that significantly improves throughput and reduces index size compared to existing CPU-based solutions.

Contribution

It introduces the GMG index and a hardware-aware execution pipeline to overcome bottlenecks in RFANNS, enabling efficient GPU-based indexing and querying.

Findings

01

Reduces index size by 4.4x

02

Achieves 119.8x higher throughput than state-of-the-art methods

03

Supports datasets exceeding GPU memory with out-of-core processing

Abstract

Range-filtered approximate nearest neighbor search (RFANNS) is increasingly critical for modern vector databases. However, existing solutions suffer from severe index inflation and construction overhead. Furthermore, they rely exclusively on CPUs for the heavy indexing and query processing, significantly restricting the throughput due to the limited memory bandwidth and parallelism. In this paper, we present Garfield, a GPU-accelerated framework for multi-attribute range filtered ANNS that overcomes these bottlenecks through designing a lightweight index structure and hardware-aware execution pipeline. Garfield introduces the GMG index, which partitions data into cells and builds local graph indexes. It guarantees linear storage and indexing overhead by adding a constant number of cross-cell edges. For queries, Garfield utilizes a cluster-guided ordering strategy that reorders…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.