Active Learning Via Sequential Design and Uncertainty Sampling
Jing Wang, Eunsik Park, Yuan-chin Ivan Chang

TL;DR
This paper introduces a sequential active learning method combining Bayesian design and uncertainty sampling to efficiently build classifiers using minimal labeled data, with demonstrated effectiveness on synthetic and real datasets.
Contribution
It proposes a novel algorithm that integrates Bayesian sequential design with uncertainty sampling for active learning, enhancing classifier efficiency.
Findings
The method effectively reduces labeling costs.
Numerical experiments show improved classifier performance.
The approach is applicable to both synthetic and real data.
Abstract
Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and automatic data collection schemes, we easily encounter with data sets containing large amounts of unlabeled samples. Because to label each of them is usually costly and inefficient, how to utilize these unlabeled data in a classifier construction process becomes an important problem. In machine learning literature, active learning or semi-supervised learning are popular concepts discussed under this situation, where classification algorithms recruit new unlabeled subjects sequentially based on the information learned from previous stages of its learning process, and these new subjects are then labeled and included as new training samples. From a statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Advanced Statistical Process Monitoring
