SQUASH: Serverless and Distributed Quantization-based Attributed Vector Similarity Search
Joe Oakley, Hakan Ferhatosmanoglu

TL;DR
SQUASH is a novel fully serverless vector similarity search system that efficiently supports hybrid queries, achieves high accuracy, reduces costs, and outperforms existing solutions through innovative storage, pruning, and invocation techniques.
Contribution
It introduces SQUASH, the first fully serverless vector search system with hybrid query support, optimized quantization, and a multi-level pruning workflow for improved performance and cost-efficiency.
Findings
Significant performance improvements over state-of-the-art solutions
Lower operational costs in serverless environments
Effective hybrid query support with high accuracy
Abstract
Vector similarity search presents significant challenges in terms of scalability for large and high-dimensional datasets, as well as in providing native support for hybrid queries. Serverless computing and cloud functions offer attractive benefits such as elasticity and cost-effectiveness, but are difficult to apply to data-intensive workloads. Jointly addressing these two main challenges, we present SQUASH, the first fully serverless vector search solution with rich support for hybrid queries. It features OSQ, an optimized and highly parallelizable quantization-based approach for vectors and attributes. Its segment-based storage mechanism enables significant compression in resource-constrained settings and offers efficient dimensional extraction operations. SQUASH performs a single distributed pass to guarantee the return of sufficiently many vectors satisfying the filter predicate,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Management and Algorithms · Neural Networks and Applications
