Learning from Future: A Novel Self-Training Framework for Semantic Segmentation
Ye Du, Yujun Shen, Haochen Wang, Jingjing Fei, Wei Li, Liwei Wu, Rui, Zhao, Zehua Fu, Qingjie Liu

TL;DR
This paper introduces a novel self-training framework for semantic segmentation that learns from the future to improve pseudo-label quality and overall performance, addressing confirmation bias issues in semi-supervised learning.
Contribution
The authors propose a future-self-training strategy that updates the teacher with a virtual future student, enhancing pseudo-label accuracy in semi-supervised segmentation.
Findings
Improves pseudo-label quality and segmentation accuracy.
Effective in unsupervised domain adaptation and semi-supervised settings.
Outperforms existing self-training methods across various benchmarks.
Abstract
Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in turn teach itself. To obtain valid supervision, active attempts typically employ a momentum teacher for pseudo-label prediction yet observe the confirmation bias issue, where the incorrect predictions may provide wrong supervision signals and get accumulated in the training process. The primary cause of such a drawback is that the prevailing self-training framework acts as guiding the current state with previous knowledge, because the teacher is updated with the past student only. To alleviate this problem, we propose a novel self-training strategy, which allows the model to learn from the future. Concretely, at each training step, we first virtually optimize the student (i.e., caching the gradients without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications
