TL;DR
This paper introduces a novel deep learning method that combines active and semi-supervised learning to achieve high accuracy with minimal labeled data, reducing labeling effort while maintaining performance.
Contribution
The paper presents a new approach integrating active learning with semi-supervised deep neural networks, improving label efficiency without specialized architectures.
Findings
Achieved 2.06% error on MNIST with 300 labels
Achieved 1.06% error on MNIST with 1000 labels
Method does not rely on special architectures or data augmentation
Abstract
In many applications the process of generating label information is expensive and time consuming. We present a new method that combines active and semi-supervised deep learning to achieve high generalization performance from a deep convolutional neural network with as few known labels as possible. In a setting where a small amount of labeled data as well as a large amount of unlabeled data is available, our method first learns the labeled data set. This initialization is followed by an expectation maximization algorithm, where further training reduces classification entropy on the unlabeled data by targeting a low entropy fit which is consistent with the labeled data. In addition the algorithm asks at a specified frequency an oracle for labels of data with entropy above a certain entropy quantile. Using this active learning component we obtain an agile labeling process that achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
