Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation
Jinlong Li, Zequn Jie, Xu Wang, Xiaolin Wei, Lin Ma

TL;DR
This paper introduces a novel Expansion and Shrinkage scheme using deformable convolution offset learning to enhance the quality of class activation maps, thereby improving weakly-supervised semantic segmentation accuracy.
Contribution
It proposes a two-stage offset learning approach with expansion and shrinkage samplers to sequentially improve recall and precision of object localization in CAMs.
Findings
Outperforms state-of-the-art methods on PASCAL VOC 2012
Achieves superior results on MS COCO 2014
Demonstrates effective improvement in localization maps quality
Abstract
Generating precise class-aware pseudo ground-truths, a.k.a, class activation maps (CAMs), is essential for weakly-supervised semantic segmentation. The original CAM method usually produces incomplete and inaccurate localization maps. To tackle with this issue, this paper proposes an Expansion and Shrinkage scheme based on the offset learning in the deformable convolution, to sequentially improve the recall and precision of the located object in the two respective stages. In the Expansion stage, an offset learning branch in a deformable convolution layer, referred as "expansion sampler" seeks for sampling increasingly less discriminative object regions, driven by an inverse supervision signal that maximizes image-level classification loss. The located more complete object in the Expansion stage is then gradually narrowed down to the final object region during the Shrinkage stage. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsClass-activation map · Convolution · Deformable Convolution
