TL;DR
This paper introduces BBAM, a novel method that leverages object detector behavior to generate bounding-box attribution maps, significantly improving weakly supervised segmentation accuracy on PASCAL VOC and MS COCO datasets.
Contribution
The paper proposes a new approach using detector behavior for weakly supervised segmentation, outperforming existing methods and providing detailed analysis of the BBAM technique.
Findings
Outperforms recent methods on PASCAL VOC and MS COCO benchmarks
Utilizes higher-level detector information instead of low-level image features
Provides deeper insights into the behavior of the bounding-box attribution maps
Abstract
Weakly supervised segmentation methods using bounding box annotations focus on obtaining a pixel-level mask from each box containing an object. Existing methods typically depend on a class-agnostic mask generator, which operates on the low-level information intrinsic to an image. In this work, we utilize higher-level information from the behavior of a trained object detector, by seeking the smallest areas of the image from which the object detector produces almost the same result as it does from the whole image. These areas constitute a bounding-box attribution map (BBAM), which identifies the target object in its bounding box and thus serves as pseudo ground-truth for weakly supervised semantic and instance segmentation. This approach significantly outperforms recent comparable techniques on both the PASCAL VOC and MS COCO benchmarks in weakly supervised semantic and instance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
