ST++: Make Self-training Work Better for Semi-supervised Semantic   Segmentation

Lihe Yang; Wei Zhuo; Lei Qi; Yinghuan Shi; Yang Gao

arXiv:2106.05095·cs.CV·March 4, 2022·21 cites

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao

PDF

Open Access 1 Repo

TL;DR

This paper enhances semi-supervised semantic segmentation by improving self-training with strong data augmentations and a novel reliability-based re-training strategy, significantly outperforming existing methods.

Contribution

Introduces a strong baseline for self-training with data augmentation and proposes ST++ with holistic prediction stability for better pseudo-label reliability.

Findings

01

ST outperforms existing methods without iterative re-training.

02

SDA effectively reduces overfitting and prediction similarity.

03

ST++ further improves performance through reliability-based selective re-training.

Abstract

Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage unlabeled data. In this work, we first construct a strong baseline of self-training (namely ST) for semi-supervised semantic segmentation via injecting strong data augmentations (SDA) on unlabeled images to alleviate overfitting noisy labels as well as decouple similar predictions between the teacher and student. With this simple mechanism, our ST outperforms all existing methods without any bells and whistles, e.g., iterative re-training. Inspired by the impressive results, we thoroughly investigate the SDA and provide some empirical analysis. Nevertheless, incorrect pseudo labels are still prone to accumulate and degrade the performance. To this end, we further propose an advanced self-training framework (namely ST++), that performs selective re-training via prioritizing reliable unlabeled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LiheYoung/ST-PlusPlus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Infrastructure Maintenance and Monitoring

MethodsAttentive Walk-Aggregating Graph Neural Network