Routing-Guided Learned Product Quantization for Graph-Based Approximate Nearest Neighbor Search
Qiang Yue, Xiaoliang Xu, Yuxiang Wang, Yikun Tao, Xuliyuan Luo

TL;DR
This paper introduces Routing-guided learned Product Quantization (RPQ), a novel end-to-end method that enhances graph-based Approximate Nearest Neighbor Search by embedding routing features into compact codes, significantly improving efficiency and accuracy on large-scale datasets.
Contribution
The paper proposes a differentiable, routing-aware product quantization method that integrates neighborhood and routing features into learned codes for improved graph-based ANNS performance.
Findings
Achieves 1.7x to 4.2x improvement in QPS at 95% recall@10.
Effectively embeds graph routing features into compact codes.
Demonstrates superior performance on datasets from 1 million to 1 billion points.
Abstract
Given a vector dataset , a query vector , graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a proximity graph (PG) as an index of and approximately return vectors with minimum distances to by searching over the PG index. It suffers from the large-scale because a PG with full vectors is too large to fit into the memory, e.g., a billion-scale in 128 dimensions would consume nearly 600 GB memory. To solve this, Product Quantization (PQ) integrated graph-based ANNS is proposed to reduce the memory usage, using smaller compact codes of quantized vectors in memory instead of the large original vectors. Existing PQ methods do not consider the important routing features of PG, resulting in low-quality quantized vectors that affect the ANNS's effectiveness. In this paper, we present an end-to-end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods
