HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints
Wei Wu, Junlin He, Yu Qiao, Guoheng Fu, Li Liu, Jin Yu

TL;DR
HQANN is a novel hybrid query processing framework that significantly improves the efficiency of similarity searches involving both structured attributes and unstructured feature vectors, achieving high recall with low latency.
Contribution
It introduces a simple, embedding-compatible framework that fuses vector similarity search with attribute filtering, greatly enhancing hybrid query performance.
Findings
HQANN is 10x faster than existing solutions for the same recall.
It maintains high performance regardless of attribute complexity.
Achieves 99% recall@10 in around 50 microseconds on large datasets.
Abstract
The in-memory approximate nearest neighbor search (ANNS) algorithms have achieved great success for fast high-recall query processing, but are extremely inefficient when handling hybrid queries with unstructured (i.e., feature vectors) and structured (i.e., related attributes) constraints. In this paper, we present HQANN, a simple yet highly efficient hybrid query processing framework which can be easily embedded into existing proximity graph-based ANNS algorithms. We guarantee both low latency and high recall by leveraging navigation sense among attributes and fusing vector similarity search with attribute filtering. Experimental results on both public and in-house datasets demonstrate that HQANN is 10x faster than the state-of-the-art hybrid ANNS solutions to reach the same recall quality and its performance is hardly affected by the complexity of attributes. It can reach 99\%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
