Survey of Filtered Approximate Nearest Neighbor Search over the Vector-Scalar Hybrid Data
Yanjun Lin, Kai Zhang, Zhenying He, Yinan Jing, X. Sean Wang

TL;DR
This survey comprehensively reviews filtered approximate nearest neighbor search over vector-scalar hybrid data, proposing a new classification framework, analyzing query difficulty, and providing practical guidelines for researchers and practitioners.
Contribution
It formally defines hybrid datasets and queries, introduces a pruning-focused classification framework, and analyzes query difficulty, filling gaps in existing literature.
Findings
Proposed a broader, finer-grained classification framework for FANNS algorithms.
Analyzed the distribution-based difficulty of hybrid queries.
Provided practical recommendations and a dataset analysis toolkit.
Abstract
Filtered approximate nearest neighbor search (FANNS), an extension of approximate nearest neighbor search (ANNS) that incorporates scalar filters, has been widely applied to constrained retrieval of vector data. Despite its growing importance, no dedicated survey on FANNS over the vector-scalar hybrid data currently exists, and the field has several problems, including inconsistent definitions of the search problem, insufficient framework for algorithm classification, and incomplete analysis of query difficulty. This survey paper formally defines the concepts of hybrid dataset and hybrid query, as well as the corresponding evaluation metrics. Based on these, a pruning-focused framework is proposed to classify and summarize existing algorithms, providing a broader and finer-grained classification framework compared to the existing ones. In addition, a review is conducted on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Remote-Sensing Image Classification
