Active Learning Via Sequential Design and Uncertainty Sampling

Jing Wang; Eunsik Park; Yuan-chin Ivan Chang

arXiv:1406.4676·stat.ME·June 19, 2014

Active Learning Via Sequential Design and Uncertainty Sampling

Jing Wang, Eunsik Park, Yuan-chin Ivan Chang

PDF

Open Access

TL;DR

This paper introduces a sequential active learning method combining Bayesian design and uncertainty sampling to efficiently build classifiers using minimal labeled data, with demonstrated effectiveness on synthetic and real datasets.

Contribution

It proposes a novel algorithm that integrates Bayesian sequential design with uncertainty sampling for active learning, enhancing classifier efficiency.

Findings

01

The method effectively reduces labeling costs.

02

Numerical experiments show improved classifier performance.

03

The approach is applicable to both synthetic and real data.

Abstract

Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and automatic data collection schemes, we easily encounter with data sets containing large amounts of unlabeled samples. Because to label each of them is usually costly and inefficient, how to utilize these unlabeled data in a classifier construction process becomes an important problem. In machine learning literature, active learning or semi-supervised learning are popular concepts discussed under this situation, where classification algorithms recruit new unlabeled subjects sequentially based on the information learned from previous stages of its learning process, and these new subjects are then labeled and included as new training samples. From a statistical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Advanced Statistical Process Monitoring