Puzzle-CAM: Improved localization via matching partial and full features

Sanghyun Jo; In-Jae Yu

arXiv:2101.11253·cs.CV·September 24, 2021

Puzzle-CAM: Improved localization via matching partial and full features

Sanghyun Jo, In-Jae Yu

PDF

4 Repos

TL;DR

Puzzle-CAM enhances weakly-supervised semantic segmentation by matching features from image patches and the whole image, leading to more comprehensive object localization without extra parameters.

Contribution

It introduces a novel puzzle module with regularization to improve object region activation in WSSS using only image-level supervision.

Findings

01

Outperforms previous state-of-the-art on PASCAL VOC 2012

02

Activates more complete object regions

03

Does not require additional parameters

Abstract

Weakly-supervised semantic segmentation (WSSS) is introduced to narrow the gap for semantic segmentation performance from pixel-level supervision to image-level supervision. Most advanced approaches are based on class activation maps (CAMs) to generate pseudo-labels to train the segmentation network. The main limitation of WSSS is that the process of generating pseudo-labels from CAMs that use an image classifier is mainly focused on the most discriminative parts of the objects. To address this issue, we propose Puzzle-CAM, a process that minimizes differences between the features from separate patches and the whole image. Our method consists of a puzzle module and two regularization terms to discover the most integrated region in an object. Puzzle-CAM can activate the overall region of an object using image-level supervision without requiring extra parameters. % In experiments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.