Fast Online k-nn Graph Building
Thibault Debatty, Pietro Michiardi, Wim Mees

TL;DR
This paper introduces a fast online approximate k-nn graph construction algorithm that efficiently updates graphs with streaming data, utilizing distributed partitioning and improved search methods, and demonstrates comparable accuracy to offline methods.
Contribution
The paper presents a novel online k-nn graph building algorithm with a distributed partitioning strategy and enhanced search procedures, outperforming existing methods in speed while maintaining accuracy.
Findings
Produces graphs similar to offline algorithms
Requires fewer similarity computations
Effective in streaming data scenarios
Abstract
In this paper we propose an online approximate k-nn graph building algorithm, which is able to quickly update a k-nn graph using a flow of data points. One very important step of the algorithm consists in using the current distributed graph to search for the neighbors of a new node. Hence we also propose a distributed partitioning method based on balanced k-medoids clustering, that we use to optimize the distributed search process. Finally, we present the improved sequential search procedure that is used inside each partition. We also perform an experimental evaluation of the different algorithms, where we study the influence of the parameters and compare the result of our algorithms to existing state of the art. This experimental evaluation confirms that the fast online k-nn graph building algorithm produces a graph that is highly similar to the graph produced by an offline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Graph Theory and Algorithms · Advanced Image and Video Retrieval Techniques
