Learning structure-aware semantic segmentation with image-level   supervision

Jiawei Liu; Jing Zhang; Yicong Hong; Nick Barnes

arXiv:2104.07216·cs.CV·January 6, 2022

Learning structure-aware semantic segmentation with image-level supervision

Jiawei Liu, Jing Zhang, Yicong Hong, Nick Barnes

PDF

1 Repo

TL;DR

This paper enhances weakly-supervised semantic segmentation by incorporating structure-aware techniques, including boundary detection and smoothness constraints, to improve the quality of class activation maps and segmentation accuracy.

Contribution

It introduces an auxiliary boundary detection module and smoothness loss to preserve structure information in weakly-supervised segmentation, addressing limitations of traditional CAM-based methods.

Findings

01

Improved segmentation accuracy on PASCAL-VOC dataset

02

Enhanced boundary sharpness and consistency in predictions

03

Demonstrated effectiveness of structure-aware supervision

Abstract

Compared with expensive pixel-wise annotations, image-level labels make it possible to learn semantic segmentation in a weakly-supervised manner. Within this pipeline, the class activation map (CAM) is obtained and further processed to serve as a pseudo label to train the semantic segmentation model in a fully-supervised manner. In this paper, we argue that the lost structure information in CAM limits its application in downstream semantic segmentation, leading to deteriorated predictions. Furthermore, the inconsistent class activation scores inside the same object contradicts the common sense that each region of the same object should belong to the same semantic category. To produce sharp prediction with structure information, we introduce an auxiliary semantic boundary detection module, which penalizes the deteriorated predictions. Furthermore, we adopt smoothness loss to encourage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Carlisle-Liu/SBNet
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsClass-activation map