SIEVE: Effective Filtered Vector Search with Collection of Indexes

Zhaoheng Li; Silu Huang; Wei Ding; Yongjoo Park; Jianjun Chen

arXiv:2507.11907·cs.DB·July 22, 2025

SIEVE: Effective Filtered Vector Search with Collection of Indexes

Zhaoheng Li, Silu Huang, Wei Ding, Yongjoo Park, Jianjun Chen

PDF

TL;DR

SIEVE introduces a collection of specialized indexes for filtered vector search, enabling efficient retrieval with hard predicates by workload-aware index selection, outperforming existing methods in speed and resource usage.

Contribution

The paper presents a novel collection-based indexing approach for filtered vector search, with an analytical model for index selection and construction, improving efficiency and flexibility.

Findings

01

Achieves up to 8.06x speedup over existing methods.

02

Uses as low as 1% build time compared to other indexes.

03

Requires less than 2.15x memory of standard HNSW graph.

Abstract

Many real-world tasks such as recommending videos with the kids tag can be reduced to finding most similar vectors associated with hard predicates. This task, filtered vector search, is challenging as prior state-of-the-art graph-based (unfiltered) similarity search techniques quickly degenerate when hard constraints are considered. That is, effective graph-based filtered similarity search relies on sufficient connectivity for reaching the most similar items within just a few hops. To consider predicates, recent works propose modifying graph traversal to visit only the items that may satisfy predicates. However, they fail to offer the just-a-few-hops property for a wide range of predicates: they must restrict predicates significantly or lose efficiency if only a small fraction of items satisfy predicates. We propose an opposite approach: instead of constraining traversal, we build…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.