Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator
Chunyan Wang, Dong Zhang, Rui Yan

TL;DR
This paper introduces a novel single-stage weakly-supervised image segmentation method using contrastive learning and cross-representation refinement, significantly improving pseudo-mask quality and outperforming state-of-the-art methods on PASCAL VOC 2012.
Contribution
The paper proposes a new single-stage WSIS approach with a siamese network, contrastive learning, and cross-transform regularization to enhance CAM quality and segmentation accuracy.
Findings
Achieves 67.2% mIoU on PASCAL VOC 2012 val set
Outperforms existing WSIS methods in segmentation accuracy
Extends effectively to weakly supervised object localization
Abstract
Weakly-supervised image segmentation (WSIS) is a critical task in computer vision that relies on image-level class labels. Multi-stage training procedures have been widely used in existing WSIS approaches to obtain high-quality pseudo-masks as ground-truth, resulting in significant progress. However, single-stage WSIS methods have recently gained attention due to their potential for simplifying training procedures, despite often suffering from low-quality pseudo-masks that limit their practical applications. To address this issue, we propose a novel single-stage WSIS method that utilizes a siamese network with contrastive learning to improve the quality of class activation maps (CAMs) and achieve a self-refinement process. Our approach employs a cross-representation refinement method that expands reliable object regions by utilizing different feature representations from the backbone.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
