Noisy Batch Active Learning with Deterministic Annealing

Gaurav Gupta; Anit Kumar Sahu; Wan-Yi Lin

arXiv:1909.12473·cs.LG·October 30, 2020·6 cites

Noisy Batch Active Learning with Deterministic Annealing

Gaurav Gupta, Anit Kumar Sahu, Wan-Yi Lin

PDF

Open Access 1 Repo

TL;DR

This paper proposes a robust batch active learning method that incorporates model uncertainty and denoising techniques to improve training with noisy labels across image classification benchmarks.

Contribution

It introduces a novel noisy batch active learning approach with deterministic annealing and a denoising layer for deep networks, enhancing robustness to label noise.

Findings

01

Improved accuracy over existing active learning strategies on benchmark datasets.

02

Effective incorporation of model uncertainty to handle small training data.

03

Significant robustness gains with the denoising layer in deep networks.

Abstract

We study the problem of training machine learning models incrementally with batches of samples annotated with noisy oracles. We select each batch of samples that are important and also diverse via clustering and importance sampling. More importantly, we incorporate model uncertainty into the sampling probability to compensate for poor estimation of the importance scores when the training data is too small to build a meaningful model. Experiments on benchmark image classification datasets (MNIST, SVHN, CIFAR10, and EMNIST) show improvement over existing active learning strategies. We introduce an extra denoising layer to deep networks to make active learning robust to label noises and show significant improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gaurav71531/DeAn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning