ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data
Liana Patel, Peter Kraft, Carlos Guestrin, Matei Zaharia

TL;DR
ACORN is a novel hybrid search method that efficiently supports predicate-agnostic queries over vector and structured data, outperforming prior approaches in speed and flexibility.
Contribution
ACORN introduces a predicate-agnostic hybrid search approach based on HNSW, enabling efficient, flexible querying over mixed data modalities with broad predicate support.
Findings
Achieves 2-1000x higher throughput at fixed recall compared to prior methods.
Supports a wide range of predicate sets and query semantics.
Demonstrates state-of-the-art performance on multiple benchmark datasets.
Abstract
Applications increasingly leverage mixed-modality data, and must jointly search over vector data, such as embedded images, text and video, as well as structured data, such as attributes and keywords. Proposed methods for this hybrid search setting either suffer from poor performance or support a severely restricted set of search predicates (e.g., only small sets of equality predicates), making them impractical for many applications. To address this, we present ACORN, an approach for performant and predicate-agnostic hybrid search. ACORN builds on Hierarchical Navigable Small Worlds (HNSW), a state-of-the-art graph-based approximate nearest neighbor index, and can be implemented efficiently by extending existing HNSW libraries. ACORN introduces the idea of predicate subgraph traversal to emulate a theoretically ideal, but impractical, hybrid search strategy. ACORN's predicate-agnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
MethodsSparse Evolutionary Training
