BBC: Improving Large-k Approximate Nearest Neighbor Search with a Bucket-based Result Collector
Ziqi Yin, Gao Cong, Kai Zeng, Jinwei Zhu, Bin Cui

TL;DR
This paper introduces BBC, a bucket-based result collector that significantly improves the efficiency of large-k approximate nearest neighbor searches using quantization methods.
Contribution
The paper proposes a novel bucket-based result collector and re-ranking algorithms that enhance large-k ANN search performance for quantization-based indexes.
Findings
BBC accelerates quantization-based ANN methods by up to 3.8x at high recall.
The bucket-based buffer reduces ranking costs and improves cache efficiency.
Re-ranking algorithms decrease candidate re-ranking overhead.
Abstract
Although Approximate Nearest Neighbor (ANN) search has been extensively studied, large-k ANN queries that aim to retrieve a large number of nearest neighbors remain underexplored, despite their numerous real-world applications. Existing ANN methods face significant performance degradation for such queries. In this work, we first investigate the reasons for the performance degradation of quantization-based ANN indexes: (1) the inefficiency of existing top-k collectors, which incurs significant overhead in candidate maintenance, and (2) the reduced pruning effectiveness of quantization methods, which leads to a costly re-ranking process. To address this, we propose a novel bucket-based result collector (BBC) to enhance the efficiency of existing quantization-based ANN indexes for large-k ANN queries. BBC introduces two key components: (1) a bucket-based result buffer that organizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
