An In-Depth Study of Filter-Agnostic Vector Search on a PostgreSQL Database System: [Experiments and Analysis]
Duo Lu, Helena Caminal, Manos Chatzakis, Yannis Papakonstantinou, Yannis Chronis, Vaibhav Jain, Fatma \"Ozcan

TL;DR
This paper provides an in-depth analysis of filter-agnostic vector search algorithms within a production PostgreSQL system, revealing that system-level overheads significantly influence algorithm performance and guiding practical algorithm selection.
Contribution
It systematically evaluates FVS algorithms in a real database environment, highlighting the importance of system-aware considerations over theoretical efficiency.
Findings
Graph-based approaches incur high filter checks and overheads.
Clustering-based indexes often outperform graph-based methods in real systems.
Optimal algorithm choice depends on workload and data access costs.
Abstract
Filtered Vector Search (FVS) is critical for supporting semantic search and GenAI applications in modern database systems. However, existing research most often evaluates algorithms in specialized libraries, making optimistic assumptions that do not align with enterprise-grade database systems. Our work challenges this premise by demonstrating that in a production-grade database system, commonly made assumptions do not hold, leading to performance characteristics and algorithmic trade-offs that are fundamentally different from those observed in isolated library settings. This paper presents the first in-depth analysis of filter-agnostic FVS algorithms within a production PostgreSQL-compatible system. We systematically evaluate post-filtering and inline-filtering strategies across a wide range of selectivities and correlations. Our central finding is that the optimal algorithm is not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Graph Theory and Algorithms
