Maximize the Exploration of Congeneric Semantics for Weakly Supervised Semantic Segmentation
Ke Zhang, Sihong Chen, Qi Ju, Yong Jiang, Yucong Li, Xin He

TL;DR
This paper introduces a novel weakly supervised semantic segmentation method using a patch-level graph neural network and transformer-based learning to better exploit congeneric semantic regions, achieving state-of-the-art results on PASCAL VOC 2012.
Contribution
It proposes a patch-level graph neural network with transformer-based complementary learning for weakly supervised segmentation, enhancing semantic information transfer among similar objects.
Findings
Achieves state-of-the-art performance on PASCAL VOC 2012.
Effectively models congeneric semantic regions with P-GNN.
Improves segmentation accuracy with soft-complementary loss.
Abstract
With the increase in the number of image data and the lack of corresponding labels, weakly supervised learning has drawn a lot of attention recently in computer vision tasks, especially in the fine-grained semantic segmentation problem. To alleviate human efforts from expensive pixel-by-pixel annotations, our method focuses on weakly supervised semantic segmentation (WSSS) with image-level tags, which are much easier to obtain. As a huge gap exists between pixel-level segmentation and image-level labels, how to reflect the image-level semantic information on each pixel is an important question. To explore the congeneric semantic regions from the same class to the maximum, we construct the patch-level graph neural network (P-GNN) based on the self-detected patches from different images that contain the same class labels. Patches can frame the objects as much as possible and include as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection
MethodsGraph Neural Network
