ProMIPS: Efficient High-Dimensional c-Approximate Maximum Inner Product Search with a Lightweight Index
Yang Song, Yu Gu, Rui Zhang, Ge Yu

TL;DR
ProMIPS introduces a lightweight, efficient approach for high-dimensional c-Approximate Maximum Inner Product search using low-dimensional projections and a novel probing method, outperforming existing methods in speed and accuracy.
Contribution
The paper presents a new c-AMIP search method with a lightweight index, combining random projections and Quick-Probe to improve efficiency and accuracy over state-of-the-art techniques.
Findings
Requires less index space and pre-processing time.
Achieves higher search quality in ratio and recall.
Reduces page accesses and running time.
Abstract
Due to the wide applications in recommendation systems, multi-class label prediction and deep learning, the Maximum Inner Product (MIP) search problem has received extensive attention in recent years. Faced with large-scale datasets containing high-dimensional feature vectors, the state-of-the-art LSH-based methods usually require a large number of hash tables or long hash codes to ensure the searching quality, which takes up lots of index space and causes excessive disk page accesses. In this paper, we relax the guarantee of accuracy for efficiency and propose an efficient method for c-Approximate Maximum Inner Product (c-AMIP) search with a lightweight iDistance index. We project high-dimensional points to low-dimensional ones via 2-stable random projections and derive probability-guaranteed searching conditions, by which the c-AMIP results can be guaranteed in accuracy with arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Text and Document Classification Technologies
