Noisy Batch Active Learning with Deterministic Annealing
Gaurav Gupta, Anit Kumar Sahu, Wan-Yi Lin

TL;DR
This paper proposes a robust batch active learning method that incorporates model uncertainty and denoising techniques to improve training with noisy labels across image classification benchmarks.
Contribution
It introduces a novel noisy batch active learning approach with deterministic annealing and a denoising layer for deep networks, enhancing robustness to label noise.
Findings
Improved accuracy over existing active learning strategies on benchmark datasets.
Effective incorporation of model uncertainty to handle small training data.
Significant robustness gains with the denoising layer in deep networks.
Abstract
We study the problem of training machine learning models incrementally with batches of samples annotated with noisy oracles. We select each batch of samples that are important and also diverse via clustering and importance sampling. More importantly, we incorporate model uncertainty into the sampling probability to compensate for poor estimation of the importance scores when the training data is too small to build a meaningful model. Experiments on benchmark image classification datasets (MNIST, SVHN, CIFAR10, and EMNIST) show improvement over existing active learning strategies. We introduce an extra denoising layer to deep networks to make active learning robust to label noises and show significant improvements.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
