Quantitative Comparison of Nearest Neighbor Search Algorithms
Hanitriniala Malalatiana Rakotondrasoa (1,2,4), Martin Bucher, (1,2,3,4), Ilya Sinayskiy (1,4) ((1) School of Chemistry, Physics,, University of KwaZulu-Natal, Durban, South Africa (2) School of Data Science, and Computational Thinking, University of Stellenbosch, Stellenbosch

TL;DR
This paper provides a comprehensive quantitative comparison of three popular nearest neighbor search algorithms—Orchard, ball tree, and VP-tree—analyzing their efficiency relative to dataset size and dimension to guide optimal algorithm selection.
Contribution
It introduces derived fitting functions for each algorithm's efficiency based on dataset size and dimension, offering new insights into their performance characteristics.
Findings
Ball tree and VP-tree outperform Orchard in high-dimensional spaces.
Efficiency varies significantly with dataset size and dimension.
The derived functions accurately predict algorithm performance.
Abstract
We compare the performance of three nearest neighbor search algorithms: the Orchard, ball tree, and VP-tree algorithms. These algorithms are commonly used for nearest-neighbor searches and are known for their efficiency in large datasets. We analyze the fraction of distances computed in relation to the size of the dataset and its dimension. For each algorithm we derive a fitting function for the efficiency as a function to set size and dimension. The article aims to provide a comprehensive analysis of the performance of these algorithms and help researchers and practitioners choose the best algorithm for their specific application.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Data Mining Algorithms and Applications · Machine Learning and Data Classification
