Revisiting Pretraining for Semi-Supervised Learning in the Low-Label   Regime

Xun Xu; Jingyi Liao; Lile Cai; Manh Cuong Nguyen; Kangkang Lu; Wanyue; Zhang; Yasin Yazici; Chuan Sheng Foo

arXiv:2205.03001·cs.CV·May 9, 2022

Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime

Xun Xu, Jingyi Liao, Lile Cai, Manh Cuong Nguyen, Kangkang Lu, Wanyue, Zhang, Yasin Yazici, Chuan Sheng Foo

PDF

Open Access

TL;DR

This paper demonstrates that target pretraining enhances semi-supervised learning performance in low-label scenarios by addressing covariate shift, with extensive experiments validating its effectiveness across classification and segmentation tasks.

Contribution

It introduces a contrastive target pretraining step before semi-supervised finetuning, improving SSL in low-label regimes by mitigating covariate shift and leveraging pretrained weights.

Findings

01

Target pretraining improves SSL performance in low-label regimes.

02

Pretrained weights significantly contribute to state-of-the-art results.

03

Extensive experiments confirm the effectiveness across tasks.

Abstract

Semi-supervised learning (SSL) addresses the lack of labeled data by exploiting large unlabeled data through pseudolabeling. However, in the extremely low-label regime, pseudo labels could be incorrect, a.k.a. the confirmation bias, and the pseudo labels will in turn harm the network training. Recent studies combined finetuning (FT) from pretrained weights with SSL to mitigate the challenges and claimed superior results in the low-label regime. In this work, we first show that the better pretrained weights brought in by FT account for the state-of-the-art performance, and importantly that they are universally helpful to off-the-shelf semi-supervised learners. We further argue that direct finetuning from pretrained weights is suboptimal due to covariate shift and propose a contrastive target pretraining step to adapt model weights towards target dataset. We carried out extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Multimodal Machine Learning Applications