Self-supervised Scale Equivariant Network for Weakly Supervised Semantic Segmentation
Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen

TL;DR
This paper introduces SSENet, a novel scale equivariant regularization method that improves class activation maps in weakly supervised semantic segmentation by ensuring consistency across different image resolutions, leading to state-of-the-art results.
Contribution
The work proposes a new scale equivariant regularization for CAMs, enhancing weakly supervised segmentation by leveraging spatial transformation equivariance.
Findings
Achieves state-of-the-art performance on PASCAL VOC 2012.
Improves CAM accuracy and completeness.
Demonstrates robustness across different resolutions.
Abstract
Weakly supervised semantic segmentation has attracted much research interest in recent years considering its advantage of low labeling cost. Most of the advanced algorithms follow the design principle that expands and constrains the seed regions from class activation maps (CAM). As well-known, conventional CAM tends to be incomplete or over-activated due to weak supervision. Fortunately, we find that semantic segmentation has a characteristic of spatial transformation equivariance, which can form a few self-supervisions to help weakly supervised learning. This work mainly explores the advantages of scale equivariant constrains for CAM generation, formulated as a self-supervised scale equivariant network (SSENet). Specifically, a novel scale equivariant regularization is elaborately designed to ensure consistency of CAMs from the same input image with different resolutions. This novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsClass-activation map
