Reducing Label Effort: Self-Supervised meets Active Learning
Javad Zolfaghari Bengar, Joost van de Weijer, Bartlomiej Twardowski,, Bogdan Raducanu

TL;DR
This paper investigates combining self-training and active learning for object recognition, finding self-training more efficient and that their combination benefits mainly at higher labeling budgets.
Contribution
It provides an empirical analysis of how self-training and active learning interact, revealing when their combination is most effective.
Findings
Self-training outperforms active learning in reducing labeling effort.
Active learning adds little benefit at low labeling budgets.
Combining both methods is most effective at high labeling budgets.
Abstract
Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a large amount of unlabeled data in an unsupervised way and fine-tunes on few labeled samples. Recent developments in self-training have achieved very impressive results rivaling supervised learning on some datasets. The current work focuses on whether the two paradigms can benefit from each other. We studied object recognition datasets including CIFAR10, CIFAR100 and Tiny ImageNet with several labeling budgets for the evaluations. Our experiments reveal that self-training is remarkably more efficient than active learning at reducing the labeling effort, that for a low labeling budget, active learning offers no benefit to self-training, and finally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
