TL;DR
This paper introduces a novel fast approximate k-NN search method combining multiple random projection trees with a voting scheme, reducing memory and computation for high-dimensional data.
Contribution
The paper presents a new approach that leverages redundancy in random projections to improve speed and efficiency of high-dimensional nearest neighbor searches.
Findings
Faster than existing methods on various datasets
Reduces memory footprint with sparse projections
Enables efficient computation through matrix multiplications
Abstract
Efficient index structures for fast approximate nearest neighbor queries are required in many applications such as recommendation systems. In high-dimensional spaces, many conventional methods suffer from excessive usage of memory and slow response times. We propose a method where multiple random projection trees are combined by a novel voting scheme. The key idea is to exploit the redundancy in a large number of candidate sets obtained by independently generated random projections in order to reduce the number of expensive exact distance evaluations. The method is straightforward to implement using sparse projections which leads to a reduced memory footprint and fast index construction. Furthermore, it enables grouping of the required computations into big matrix multiplications, which leads to additional savings due to cache effects and low-level parallelization. We demonstrate by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
