Where are the Masks: Instance Segmentation with Image-level Supervision
Issam H. Laradji, David Vazquez, Mark Schmidt

TL;DR
This paper introduces a new two-stage framework for instance segmentation that uses only image-level labels, significantly reducing annotation effort while achieving state-of-the-art results on PASCAL VOC 2012.
Contribution
The paper presents a simple, adaptable pipeline that trains with image-level labels and produces pseudo masks for effective instance segmentation.
Findings
Achieves new state-of-the-art results on PASCAL VOC 2012.
Demonstrates major performance improvements over existing weakly supervised methods.
Framework is easy to implement and adaptable to different segmentation techniques.
Abstract
A major obstacle in instance segmentation is that existing methods often need many per-pixel labels in order to be effective. These labels require large human effort and for certain applications, such labels are not readily available. To address this limitation, we propose a novel framework that can effectively train with image-level labels, which are significantly cheaper to acquire. For instance, one can do an internet search for the term "car" and obtain many images where a car is present with minimal effort. Our framework consists of two stages: (1) train a classifier to generate pseudo masks for the objects of interest; (2) train a fully supervised Mask R-CNN on these pseudo masks. Our two main contribution are proposing a pipeline that is simple to implement and is amenable to different segmentation methods; and achieves new state-of-the-art results for this problem setup. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · Multimodal Machine Learning Applications
MethodsRegion Proposal Network · Softmax · Convolution · RoIAlign · Mask R-CNN
