Debiased Self-Training for Semi-Supervised Learning

Baixu Chen; Junguang Jiang; Ximei Wang; Pengfei Wan; Jianmin Wang,; Mingsheng Long

arXiv:2202.07136·cs.LG·November 10, 2022·38 cites

Debiased Self-Training for Semi-Supervised Learning

Baixu Chen, Junguang Jiang, Ximei Wang, Pengfei Wan, Jianmin Wang,, Mingsheng Long

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces Debiased Self-Training (DST), a novel method that reduces bias and improves stability in semi-supervised learning by decoupling pseudo label generation and adversarially optimizing representations.

Contribution

The paper proposes DST, which decouples pseudo label generation from training and adversarially optimizes representations to mitigate bias and improve semi-supervised learning performance.

Findings

01

DST achieves 6.3% average improvement over state-of-the-art methods.

02

DST improves stability and class balance in training.

03

DST enhances performance when training from scratch or fine-tuning.

Abstract

Deep neural networks achieve remarkable performances on a wide range of tasks with the aid of large-scale labeled datasets. Yet these datasets are time-consuming and labor-exhaustive to obtain on realistic tasks. To mitigate the requirement for labeled data, self-training is widely used in semi-supervised learning by iteratively assigning pseudo labels to unlabeled samples. Despite its popularity, self-training is well-believed to be unreliable and often leads to training instability. Our experimental studies further reveal that the bias in semi-supervised learning arises from both the problem itself and the inappropriate training with potentially incorrect pseudo labels, which accumulates the error in the iterative self-training process. To reduce the above bias, we propose Debiased Self-Training (DST). First, the generation and utilization of pseudo labels are decoupled by two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Debiased Self-Training for Semi-Supervised Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsDynamic Sparse Training · FixMatch