Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation
Weixuan Sun, Jing Zhang, Nick Barnes

TL;DR
This paper introduces a novel class-conditional inference strategy and an activation aware mask refinement loss to improve weakly supervised semantic segmentation, eliminating the need for re-training classifiers and achieving better pseudo labels.
Contribution
The authors propose a class-conditional inference approach and a new mask refinement loss that enhance pseudo label quality without re-training classifiers, simplifying the WSSS pipeline.
Findings
Achieves superior WSSS results without classifier re-training
Improves pseudo label quality through class-conditional inference
Refines foreground masks using saliency maps effectively
Abstract
Image-level weakly supervised semantic segmentation (WSSS) relies on class activation maps (CAMs) for pseudo labels generation. As CAMs only highlight the most discriminative regions of objects, the generated pseudo labels are usually unsatisfactory to serve directly as supervision. To solve this, most existing approaches follow a multi-training pipeline to refine CAMs for better pseudo-labels, which includes: 1) re-training the classification model to generate CAMs; 2) post-processing CAMs to obtain pseudo labels; and 3) training a semantic segmentation model with the obtained pseudo labels. However, this multi-training pipeline requires complicated adjustment and additional time. To address this, we propose a class-conditional inference strategy and an activation aware mask refinement loss function to generate better pseudo labels without re-training the classifier. The class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Inferring the Class Conditional Response Map for Weakly Supervised Semantic Segmentation· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
