Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for Weakly Supervised Semantic Segmentation
M. Maruf, Arka Daw, Amartya Dutta, Jie Bu, Anuj Karpatne

TL;DR
This paper compares saliency maps and CAMs for weakly supervised semantic segmentation, showing saliencies can address CAMs' limitations and proposing a stochastic aggregation method to enhance saliency performance.
Contribution
It offers a comprehensive comparison of saliency maps and CAMs, introduces new evaluation metrics, and proposes a stochastic cropping technique to improve saliency-based segmentation.
Findings
Saliency maps effectively address CAMs' non-discriminative region issue.
New metrics provide a thorough assessment of WS3 methods.
Stochastic cropping enhances saliency performance significantly.
Abstract
In recent years, several Weakly Supervised Semantic Segmentation (WS3) methods have been proposed that use class activation maps (CAMs) generated by a classifier to produce pseudo-ground truths for training segmentation models. While CAMs are good at highlighting discriminative regions (DR) of an image, they are known to disregard regions of the object that do not contribute to the classifier's prediction, termed non-discriminative regions (NDR). In contrast, attribution methods such as saliency maps provide an alternative approach for assigning a score to every pixel based on its contribution to the classification prediction. This paper provides a comprehensive comparison between saliencies and CAMs for WS3. Our study includes multiple perspectives on understanding their similarities and dissimilarities. Moreover, we provide new evaluation metrics that perform a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection
MethodsClass-activation map
