Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation
Arslan Chaudhry, Puneet K. Dokania, Philip H.S. Torr

TL;DR
This paper introduces a method combining saliency and attention maps to discover class-specific pixels, significantly improving weakly-supervised semantic segmentation performance on PASCAL VOC12.
Contribution
It presents a hierarchical approach to identify class-specific pixels using saliency and attention, serving as approximate ground truth for CNN training in weakly-supervised segmentation.
Findings
Achieved 60.8% mIoU on PASCAL VOC12 val set
Achieved 61.9% mIoU on PASCAL VOC12 test set
Surpassed previous state-of-the-art by over 5% mIoU
Abstract
We propose an approach to discover class-specific pixels for the weakly-supervised semantic segmentation task. We show that properly combining saliency and attention maps allows us to obtain reliable cues capable of significantly boosting the performance. First, we propose a simple yet powerful hierarchical approach to discover the class-agnostic salient regions, obtained using a salient object detector, which otherwise would be ignored. Second, we use fully convolutional attention maps to reliably localize the class-specific regions in a given image. We combine these two cues to discover class-specific pixels which are then used as an approximate ground truth for training a CNN. While solving the weakly supervised semantic segmentation task, we ensure that the image-level classification task is also solved in order to enforce the CNN to assign at least one pixel to each object present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
