Debiased Self-Training for Semi-Supervised Learning
Baixu Chen, Junguang Jiang, Ximei Wang, Pengfei Wan, Jianmin Wang,, Mingsheng Long

TL;DR
This paper introduces Debiased Self-Training (DST), a novel method that reduces bias and improves stability in semi-supervised learning by decoupling pseudo label generation and adversarially optimizing representations.
Contribution
The paper proposes DST, which decouples pseudo label generation from training and adversarially optimizes representations to mitigate bias and improve semi-supervised learning performance.
Findings
DST achieves 6.3% average improvement over state-of-the-art methods.
DST improves stability and class balance in training.
DST enhances performance when training from scratch or fine-tuning.
Abstract
Deep neural networks achieve remarkable performances on a wide range of tasks with the aid of large-scale labeled datasets. Yet these datasets are time-consuming and labor-exhaustive to obtain on realistic tasks. To mitigate the requirement for labeled data, self-training is widely used in semi-supervised learning by iteratively assigning pseudo labels to unlabeled samples. Despite its popularity, self-training is well-believed to be unreliable and often leads to training instability. Our experimental studies further reveal that the bias in semi-supervised learning arises from both the problem itself and the inappropriate training with potentially incorrect pseudo labels, which accumulates the error in the iterative self-training process. To reduce the above bias, we propose Debiased Self-Training (DST). First, the generation and utilization of pseudo labels are decoupled by two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
MethodsDynamic Sparse Training · FixMatch
