AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
Haonan Wang, Qixiang Zhang, Yi Li, Xiaomeng Li

TL;DR
AllSpark introduces a novel transformer-based module that reuses labeled features from unlabeled data to improve semi-supervised semantic segmentation, achieving superior results without complex training pipelines.
Contribution
The paper proposes AllSpark, a new architecture component that enhances semi-supervised segmentation by rebirthing labeled features from unlabeled data using cross-attention, avoiding complex training schemes.
Findings
Outperforms existing methods on Pascal, Cityscapes, and COCO benchmarks.
Effectively integrates into transformer-based segmentation models.
Achieves state-of-the-art results without complicated training procedures.
Abstract
Semi-supervised semantic segmentation (SSSS) has been proposed to alleviate the burden of time-consuming pixel-level manual labeling, which leverages limited labeled data along with larger amounts of unlabeled data. Current state-of-the-art methods train the labeled data with ground truths and unlabeled data with pseudo labels. However, the two training flows are separate, which allows labeled data to dominate the training process, resulting in low-quality pseudo labels and, consequently, sub-optimal results. To alleviate this issue, we present AllSpark, which reborns the labeled features from unlabeled ones with the channel-wise cross-attention mechanism. We further introduce a Semantic Memory along with a Channel Semantic Grouping strategy to ensure that unlabeled features adequately represent labeled features. The AllSpark shed new light on the architecture level designs of SSSS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Handwritten Text Recognition Techniques · Image Processing and 3D Reconstruction
