Exploiting Shape Cues for Weakly Supervised Semantic Segmentation

Sungpil Kho; Pilhyeon Lee; Wonyoung Lee; Minsong Ki; Hyeran Byun

arXiv:2208.04286·cs.CV·August 9, 2022

Exploiting Shape Cues for Weakly Supervised Semantic Segmentation

Sungpil Kho, Pilhyeon Lee, Wonyoung Lee, Minsong Ki, Hyeran Byun

PDF

TL;DR

This paper introduces a shape-aware approach to weakly supervised semantic segmentation that leverages shape cues and an online refinement process, significantly improving accuracy and boundary alignment over existing methods.

Contribution

It proposes a novel shape exploitation method combined with an online refinement technique, enabling end-to-end training and superior segmentation performance.

Findings

01

Outperforms state-of-the-art single-stage methods on PASCAL VOC 2012.

02

Achieves new best results among multi-stage approaches in a simple pipeline.

03

Produces more precise and boundary-aligned segmentation masks.

Abstract

Weakly supervised semantic segmentation (WSSS) aims to produce pixel-wise class predictions with only image-level labels for training. To this end, previous methods adopt the common pipeline: they generate pseudo masks from class activation maps (CAMs) and use such masks to supervise segmentation networks. However, it is challenging to derive comprehensive pseudo masks that cover the whole extent of objects due to the local property of CAMs, i.e., they tend to focus solely on small discriminative object parts. In this paper, we associate the locality of CAMs with the texture-biased property of convolutional neural networks (CNNs). Accordingly, we propose to exploit shape information to supplement the texture-biased CNN features, thereby encouraging mask predictions to be not only comprehensive but also well-aligned with object boundaries. We further refine the predictions in an online…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.