On the Marginal Benefit of Active Learning: Does Self-Supervision Eat   Its Cake?

Yao-Chun Chan; Mingchen Li; Samet Oymak

arXiv:2011.08121·cs.LG·November 17, 2020·1 cites

On the Marginal Benefit of Active Learning: Does Self-Supervision Eat Its Cake?

Yao-Chun Chan, Mingchen Li, Samet Oymak

PDF

Open Access

TL;DR

This paper investigates the interplay between active learning and self-supervised semi-supervised learning, finding that self-supervised pretraining enhances semi-supervised methods but diminishes the additional benefits of active learning.

Contribution

It introduces a novel framework combining self-supervised pretraining, active learning, and self-training, and systematically evaluates their individual and combined effects.

Findings

01

Self-supervised pretraining improves semi-supervised learning in low-label regimes.

02

Active learning benefits are reduced when combined with self-supervised techniques.

03

State-of-the-art active learning algorithms offer no additional gains with advanced S4L methods.

Abstract

Active learning is the set of techniques for intelligently labeling large unlabeled datasets to reduce the labeling effort. In parallel, recent developments in self-supervised and semi-supervised learning (S4L) provide powerful techniques, based on data-augmentation, contrastive learning, and self-training, that enable superior utilization of unlabeled data which led to a significant reduction in required labeling in the standard machine learning benchmarks. A natural question is whether these paradigms can be unified to obtain superior results. To this aim, this paper provides a novel algorithmic framework integrating self-supervised pretraining, active learning, and consistency-regularized self-training. We conduct extensive experiments with our framework on CIFAR10 and CIFAR100 datasets. These experiments enable us to isolate and assess the benefits of individual components which are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning