Understanding and Improving Proximity Graph based Maximum Inner Product Search
Jie Liu, Xiao Yan, Xinyan Dai, Zhirong Li, James Cheng, Ming-Chang, Yang

TL;DR
This paper analyzes the reasons behind the high performance of the ip-NSW method for approximate maximum inner product search, revealing a norm bias and proposing an improved algorithm, ip-NSW+ with better robustness and efficiency.
Contribution
It uncovers the norm bias in MIPS and explains ip-NSW's success, then introduces ip-NSW+ which enhances search accuracy and robustness through an additional angular proximity graph.
Findings
ip-NSW exploits the norm bias by focusing on large norm items.
ip-NSW+ outperforms ip-NSW in speed and robustness.
The angular graph improves candidate selection for MIPS.
Abstract
The inner-product navigable small world graph (ip-NSW) represents the state-of-the-art method for approximate maximum inner product search (MIPS) and it can achieve an order of magnitude speedup over the fastest baseline. However, to date it is still unclear where its exceptional performance comes from. In this paper, we show that there is a strong norm bias in the MIPS problem, which means that the large norm items are very likely to become the result of MIPS. Then we explain the good performance of ip-NSW as matching the norm bias of the MIPS problem - large norm items have big in-degrees in the ip-NSW proximity graph and a walk on the graph spends the majority of computation on these items, thus effectively avoids unnecessary computation on small norm items. Furthermore, we propose the ip-NSW+ algorithm, which improves ip-NSW by introducing an additional angular proximity graph.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Image and Video Retrieval Techniques · Web Data Mining and Analysis
