Background Activation Suppression for Weakly Supervised Object Localization
Pingyu Wu, Wei Zhai, Yang Cao

TL;DR
This paper introduces Background Activation Suppression (BAS), a novel method for weakly supervised object localization that suppresses background activation to improve object region learning, outperforming existing methods on benchmark datasets.
Contribution
The paper proposes a new BAS method with an Activation Map Constraint module to enhance object localization by suppressing background activation in weakly supervised learning.
Findings
BAS significantly improves localization accuracy over baseline methods.
BAS achieves state-of-the-art results on CUB-200-2011 and ILSVRC datasets.
Extensive experiments validate the effectiveness of background activation suppression.
Abstract
Weakly supervised object localization (WSOL) aims to localize objects using only image-level labels. Recently a new paradigm has emerged by generating a foreground prediction map (FPM) to achieve localization task. Existing FPM-based methods use cross-entropy (CE) to evaluate the foreground prediction map and to guide the learning of generator. We argue for using activation value to achieve more efficient learning. It is based on the experimental observation that, for a trained network, CE converges to zero when the foreground mask covers only part of the object region. While activation value increases until the mask expands to the object boundary, which indicates that more object areas can be learned by using activation value. In this paper, we propose a Background Activation Suppression (BAS) method. Specifically, an Activation Map Constraint module (AMC) is designed to facilitate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
