Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation
Ziyi Kou, Guofeng Cui, Shaojie Wang, Wentian Zhao, Chenliang Xu

TL;DR
This paper introduces a novel approach for Weakly Supervised Object Localization that enhances CAM by incorporating confidence segmentation and co-supervised augmentation, leading to state-of-the-art results.
Contribution
It proposes ConfSeg and CoAug modules that improve foreground-background separation and localization accuracy without extra hyper-parameters.
Findings
Achieves 37.69% Top-1 localization error on CUB-200
Achieves 48.81% Top-1 localization error on ILSVRC
Outperforms previous WSOL methods significantly
Abstract
Weakly Supervised Object Localization (WSOL) methods generate both classification and localization results by learning from only image category labels. Previous methods usually utilize class activation map (CAM) to obtain target object regions. However, most of them only focus on improving foreground object parts in CAM, but ignore the important effect of its background contents. In this paper, we propose a confidence segmentation (ConfSeg) module that builds confidence score for each pixel in CAM without introducing additional hyper-parameters. The generated sample-specific confidence mask is able to indicate the extent of determination for each pixel in CAM, and further supervises additional CAM extended from internal feature maps. Besides, we introduce Co-supervised Augmentation (CoAug) module to capture feature-level representation for foreground and background parts in CAM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsClass-activation map
