Revisiting consistency for semi-supervised semantic segmentation
Ivan Grubi\v{s}i\'c, Marin Or\v{s}i\'c, Sini\v{s}a \v{S}egvi\'c

TL;DR
This paper explores semi-supervised semantic segmentation, emphasizing one-way consistency and specific perturbation strategies, demonstrating improved performance especially with photometric and geometric augmentations.
Contribution
It introduces a novel perturbation model combining geometric and photometric changes and highlights the benefits of perturbing only one model instance in semi-supervised learning.
Findings
One-way consistency outperforms traditional methods.
Perturbing only the student branch yields better results.
Photometric and geometric perturbations significantly improve segmentation accuracy.
Abstract
Semi-supervised learning an attractive technique in practical deployments of deep models since it relaxes the dependence on labeled data. It is especially important in the scope of dense prediction because pixel-level annotation requires significant effort. This paper considers semi-supervised algorithms that enforce consistent predictions over perturbed unlabeled inputs. We study the advantages of perturbing only one of the two model instances and preventing the backward pass through the unperturbed instance. We also propose a competitive perturbation model as a composition of geometric warp and photometric jittering. We experiment with efficient models due to their importance for real-time and low-power applications. Our experiments show clear advantages of (1) one-way consistency, (2) perturbing only the student branch, and (3) strong photometric and geometric perturbations. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Human Pose and Action Recognition
