Salient Instance Segmentation with Region and Box-level Annotations
Jialun Pei, He Tang, Tianyang Cheng, Chuanbo Chen

TL;DR
This paper introduces CGCNet, a novel inexact supervision framework for salient instance segmentation that leverages existing datasets with bounding boxes and salient regions, achieving competitive results without extensive labeling.
Contribution
The paper proposes a new inexact supervision approach using bounding boxes and salient regions, along with a global feature refining layer and label updating scheme, to train salient instance segmentation models.
Findings
Achieves a mask AP of 58.3% on Dataset1K
Outperforms state-of-the-art fully supervised methods
Demonstrates effectiveness of inexact supervision in segmentation
Abstract
Salient instance segmentation is a new challenging task that received widespread attention in the saliency detection area. The new generation of saliency detection provides a strong theoretical and technical basis for video surveillance. Due to the limited scale of the existing dataset and the high mask annotations cost, plenty of supervision source is urgently needed to train a well-performing salient instance model. In this paper, we aim to train a novel salient instance segmentation framework by an inexact supervision without resorting to laborious labeling. To this end, we present a cyclic global context salient instance segmentation network (CGCNet), which is supervised by the combination of salient regions and bounding boxes from the ready-made salient object detection datasets. To locate salient instance more accurately, a global feature refining layer is proposed that dilates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
