Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement Maps
Xiaolin Zhang, Yunchao Wei, Yi Yang, Fei Wu

TL;DR
This paper introduces a new evaluation metric for weakly supervised object localization maps, proposes a self-enhancement method for more accurate localization using only category labels, and achieves state-of-the-art results on benchmark datasets.
Contribution
The work presents a direct pixel-wise evaluation metric for localization maps, annotates object masks for the dataset, and introduces a self-enhancement approach to improve localization accuracy with minimal supervision.
Findings
Achieved 54.88% localization accuracy on ILSVRC.
Proposed IoU-Threshold curves for better evaluation.
Developed a self-enhancement method that outperforms previous approaches.
Abstract
Recently, remarkable progress has been made in weakly supervised object localization (WSOL) to promote object localization maps. The common practice of evaluating these maps applies an indirect and coarse way, i.e., obtaining tight bounding boxes which can cover high-activation regions and calculating intersection-over-union (IoU) scores between the predicted and ground-truth boxes. This measurement can evaluate the ability of localization maps to some extent, but we argue that the maps should be measured directly and delicately, i.e., comparing the maps with the ground-truth object masks pixel-wisely. To fulfill the direct evaluation, we annotate pixel-level object masks on the ILSVRC validation set. We propose to use IoU-Threshold curves for evaluating the real quality of localization maps. Beyond the amended evaluation metric and annotated object masks, this work also introduces a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
