QuIVer: Rethinking ANN Graph Topology via Training-Free Binary Quantization
Wenxuan Xiao, Zhiyou Wang, Chengcheng Li

TL;DR
QuIVer introduces a training-free binary quantization-based graph index for approximate nearest neighbor search, revealing conditions where BQ-native topology is effective and enabling significant performance improvements.
Contribution
The paper presents QuIVer, a novel system that constructs ANN graph topology directly in a binary quantized space without training, and defines the conditions for its effective application.
Findings
BQ-native topology is highly effective on cosine-native embeddings (>=88% Recall@10).
Moderate effectiveness observed on multimodal CLIP data (71--78%).
BQ-native topology is unsuitable for Euclidean-native or structureless data (<15%).
Abstract
Approximate nearest neighbor (ANN) graph indices such as HNSW and Vamana construct their edge topology in full-precision or high-fidelity quantized metric spaces, relegating binary quantization (BQ) to a post-hoc distance estimator during search. This paper asks a different question: Can binary quantization define the graph topology itself -- and if so, under what conditions? We study this question through QuIVer (Quantized Index for Vector Retrieval), a training-free ANN graph index that performs Vamana edge selection, diversity pruning, and beam-search navigation entirely within a 2-bit Sign-Magnitude BQ metric space, accessing float32 vectors only for final reranking. Systematic evaluation on twelve million-scale datasets reveals a sharp applicability boundary: BQ-native topology is highly effective on cosine-native contrastive-learning embeddings (>=88% Recall@10 at ef=64 across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
