Reliable Label Bootstrapping for Semi-Supervised Learning
Paul Albert, Diego Ortego, Eric Arazo, Noel E. O'Connor, Kevin, McGuinness

TL;DR
ReLaB is an unsupervised pre-processing method that enhances semi-supervised learning by propagating labels through self-supervised features and selecting reliable samples, significantly reducing the need for labeled data.
Contribution
The paper introduces ReLaB, a novel label bootstrapping method that improves semi-supervised learning in low-label scenarios by combining self-supervised features, label propagation, and noise detection.
Findings
ReLaB achieves lower error rates on CIFAR-10, CIFAR-100, and mini-ImageNet.
Effective label propagation depends on network architecture and self-supervised algorithms.
ReLaB significantly reduces the number of labeled samples needed for competitive performance.
Abstract
Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrapping (ReLaB), an unsupervised preprossessing algorithm which improves the performance of semi-supervised algorithms in extremely low supervision settings. Given a dataset with few labeled samples, we first learn meaningful self-supervised, latent features for the data. Second, a label propagation algorithm propagates the known labels on the unsupervised features, effectively labeling the full dataset in an automatic fashion. Third, we select a subset of correctly labeled (reliable) samples using a label noise detection algorithm. Finally, we train a semi-supervised algorithm on the extended subset. We show that the selection of the network architecture and the self-supervised algorithm are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
