Fast kNN mode seeking clustering applied to active learning
Robert P.W. Duin, Sergey Verzakov

TL;DR
This paper introduces a faster kNN mode seeking clustering algorithm suitable for high-dimensional data, enabling efficient multi-scale clustering and improving active learning classification performance on large datasets.
Contribution
A novel, computationally efficient kNN mode seeking clustering algorithm with multi-scale hierarchy capability, applicable to high-dimensional data and active learning.
Findings
Algorithm achieves O(n^1.5) time complexity.
Effective clustering of millions of objects within minutes.
Clustering-based classification outperforms traditional classifiers.
Abstract
A significantly faster algorithm is presented for the original kNN mode seeking procedure. It has the advantages over the well-known mean shift algorithm that it is feasible in high-dimensional vector spaces and results in uniquely, well defined modes. Moreover, without any additional computational effort it may yield a multi-scale hierarchy of clusterings. The time complexity is just O(n^1.5). resulting computing times range from seconds for 10^4 objects to minutes for 10^5 objects and to less than an hour for 10^6 objects. The space complexity is just O(n). The procedure is well suited for finding large sets of small clusters and is thereby a candidate to analyze thousands of clusters in millions of objects. The kNN mode seeking procedure can be used for active learning by assigning the clusters to the class of the modal objects of the clusters. Its feasibility is shown by some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMechanical and Optical Resonators · Monoclonal and Polyclonal Antibodies Research · Force Microscopy Techniques and Applications
