Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation
Tao Chen, Yazhou Yao, Jinhui Tang

TL;DR
This paper introduces MDBA, an end-to-end model that improves weakly supervised semantic segmentation by denoising pseudo labels and aligning data distributions, achieving state-of-the-art results on PASCAL VOC 2012.
Contribution
The paper proposes a novel multi-granularity denoising and bidirectional alignment framework for WSSS, addressing noisy labels and multi-class generalization challenges.
Findings
Achieves 69.5% mIoU on PASCAL VOC 2012 validation set.
Effective noise filtering and bidirectional alignment improve segmentation accuracy.
Source code and models are publicly available.
Abstract
Weakly supervised semantic segmentation (WSSS) models relying on class activation maps (CAMs) have achieved desirable performance comparing to the non-CAMs-based counterparts. However, to guarantee WSSS task feasible, we need to generate pseudo labels by expanding the seeds from CAMs which is complex and time-consuming, thus hindering the design of efficient end-to-end (single-stage) WSSS approaches. To tackle the above dilemma, we resort to the off-the-shelf and readily accessible saliency maps for directly obtaining pseudo labels given the image-level class labels. Nevertheless, the salient regions may contain noisy labels and cannot seamlessly fit the target objects, and saliency maps can only be approximated as pseudo labels for simple images containing single-class objects. As such, the achieved segmentation model with these simple images cannot generalize well to the complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsTest
