A Divide-and-Conquer Approach to Geometric Sampling for Active Learning

Xiaofeng Cao

arXiv:1805.12321·cs.LG·September 29, 2020

A Divide-and-Conquer Approach to Geometric Sampling for Active Learning

Xiaofeng Cao

PDF

Open Access

TL;DR

This paper introduces a geometric sampling method for active learning that bypasses traditional uncertainty evaluation, using clustering and a knight's tour strategy to improve classification with fewer labels.

Contribution

It proposes a novel divide-and-conquer geometric sampling approach for active learning, eliminating dependence on uncertainty evaluation and enhancing performance.

Findings

01

GAL significantly outperforms state-of-the-art baselines in experiments.

02

The method effectively detects cluster boundaries and improves classification accuracy.

03

The approach reduces labeling effort while maintaining high performance.

Abstract

Active learning (AL) repeatedly trains the classifier with the minimum labeling budget to improve the current classification model. The training process is usually supervised by an uncertainty evaluation strategy. However, the uncertainty evaluation always suffers from performance degeneration when the initial labeled set has insufficient labels. To completely eliminate the dependence on the uncertainty evaluation sampling in AL, this paper proposes a divide-and-conquer idea that directly transfers the AL sampling as the geometric sampling over the clusters. By dividing the points of the clusters into cluster boundary and core points, we theoretically discuss their margin distance and {hypothesis relationship}. With the advantages of cluster boundary points in the above two properties, we propose a Geometric Active Learning (GAL) algorithm by knight's tour. Experimental studies of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Analytical Chemistry and Chromatography