Efficient Vector Search in the Wild: One Model for Multi-K Queries
Yifan Peng, Jiafei Fan, Xingda Wei, Sijie Shen, Rong Chen, Jianning Wang, Xiaojian Luo, Wenyuan Yu, Jingren Zhou, Haibo Chen

TL;DR
OMEGA is a novel vector search method that generalizes across different K values, achieving high accuracy and efficiency in multi-K queries with minimal preprocessing, outperforming existing methods.
Contribution
The paper introduces OMEGA, a K-generalizable learned top-K search model that uses trajectory-based features and statistical properties to efficiently handle multiple K values.
Findings
OMEGA reduces average latency by 6-33% compared to state-of-the-art methods.
It achieves similar recall with only 16-30% of the preprocessing time.
OMEGA maintains high accuracy across various K values in diverse datasets.
Abstract
Learned top-K search is a promising approach for serving vector queries with both high accuracy and performance. However, current models trained for a specific K value fail to generalize to real-world multi-K queries: they suffer from accuracy degradation (for larger Ks) and performance loss (for smaller Ks). Training the model to generalize on different Ks requires orders of magnitude more preprocessing time and is not suitable for serving vector queries in the wild. We present OMEGA, a K-generalizable learned top-K search method that simultaneously achieves high accuracy, high performance, and low preprocessing cost for multi-K vector queries. The key idea is that a base model properly trained on K=1 with our trajectory-based features can be used to accurately predict larger Ks with a dynamic refinement procedure and smaller Ks with minimal performance loss. To make our refinements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Data Management and Algorithms · Information Retrieval and Search Behavior
