Reliability Does Matter: An End-to-End Weakly Supervised Semantic Segmentation Approach
Bingfeng Zhang, Jimin Xiao, Yunchao Wei, Mingjie Sun, Kaizhu Huang

TL;DR
This paper introduces a simple, end-to-end weakly supervised semantic segmentation method that leverages image-level labels to produce reliable pixel-level annotations, achieving competitive results on Pascal VOC.
Contribution
The proposed fully end-to-end approach simplifies weakly supervised segmentation by avoiding complex pseudo mask generation steps, using class activation maps and a dense energy loss.
Findings
Achieves 62.9% mIoU on Pascal VOC test set with one-step method.
Extending to a two-step approach yields 66.5% mIoU, setting new state-of-the-art.
Simplifies weakly supervised segmentation pipeline while maintaining high performance.
Abstract
Weakly supervised semantic segmentation is a challenging task as it only takes image-level information as supervision for training but produces pixel-level predictions for testing. To address such a challenging task, most recent state-of-the-art approaches propose to adopt two-step solutions, \emph{i.e. } 1) learn to generate pseudo pixel-level masks, and 2) engage FCNs to train the semantic segmentation networks with the pseudo masks. However, the two-step solutions usually employ many bells and whistles in producing high-quality pseudo masks, making this kind of methods complicated and inelegant. In this work, we harness the image-level labels to produce reliable pixel-level annotations and design a fully end-to-end network to learn to predict segmentation maps. Concretely, we firstly leverage an image classification branch to generate class activation maps for the annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
