A Two-Stage Active Learning Algorithm for $k$-Nearest Neighbors

Nick Rittler; Kamalika Chaudhuri

arXiv:2211.10773·cs.LG·August 22, 2023·1 cites

A Two-Stage Active Learning Algorithm for $k$-Nearest Neighbors

Nick Rittler, Kamalika Chaudhuri

PDF

Open Access 1 Video

TL;DR

This paper introduces the first active learning algorithm specifically designed for $k$-nearest neighbor classifiers, improving training efficiency while maintaining the classifier's core voting mechanism and achieving faster convergence under certain conditions.

Contribution

It presents a novel active learning algorithm for $k$-nearest neighbors that retains the voting property and provides theoretical guarantees of improved convergence rates.

Findings

01

The algorithm retains the $k$-nearest neighbor voting mechanism.

02

Active training leads to faster convergence to the Bayes optimal classifier.

03

Theoretical guarantees are provided under smoothness and noise conditions.

Abstract

$k$ -nearest neighbor classification is a popular non-parametric method because of desirable properties like automatic adaption to distributional scale changes. Unfortunately, it has thus far proved difficult to design active learning strategies for the training of local voting-based classifiers that naturally retain these desirable properties, and hence active learning strategies for $k$ -nearest neighbor classification have been conspicuously missing from the literature. In this work, we introduce a simple and intuitive active learning algorithm for the training of $k$ -nearest neighbor classifiers, the first in the literature which retains the concept of the $k$ -nearest neighbor vote at prediction time. We provide consistency guarantees for a modified $k$ -nearest neighbors classifier trained on samples acquired via our scheme, and show that when the conditional probability function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Two-Stage Active Learning Algorithm for k-Nearest Neighbors· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques · Machine Learning and Data Classification