A Two-Stage Active Learning Algorithm for $k$-Nearest Neighbors
Nick Rittler, Kamalika Chaudhuri

TL;DR
This paper introduces the first active learning algorithm specifically designed for $k$-nearest neighbor classifiers, improving training efficiency while maintaining the classifier's core voting mechanism and achieving faster convergence under certain conditions.
Contribution
It presents a novel active learning algorithm for $k$-nearest neighbors that retains the voting property and provides theoretical guarantees of improved convergence rates.
Findings
The algorithm retains the $k$-nearest neighbor voting mechanism.
Active training leads to faster convergence to the Bayes optimal classifier.
Theoretical guarantees are provided under smoothness and noise conditions.
Abstract
-nearest neighbor classification is a popular non-parametric method because of desirable properties like automatic adaption to distributional scale changes. Unfortunately, it has thus far proved difficult to design active learning strategies for the training of local voting-based classifiers that naturally retain these desirable properties, and hence active learning strategies for -nearest neighbor classification have been conspicuously missing from the literature. In this work, we introduce a simple and intuitive active learning algorithm for the training of -nearest neighbor classifiers, the first in the literature which retains the concept of the -nearest neighbor vote at prediction time. We provide consistency guarantees for a modified -nearest neighbors classifier trained on samples acquired via our scheme, and show that when the conditional probability function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
