Weakly-Supervised Salient Object Detection Using Point Supervision
Shuyong Gao, Wei Zhang, Yan Wang, Qianyu Guo, Chenglong Zhang, Yangji, He, Wenqiang Zhang

TL;DR
This paper introduces a novel weakly-supervised salient object detection approach using point annotations, employing an adaptive flood filling and a transformer model, with a new dataset and iterative refinement, outperforming fully supervised methods.
Contribution
The paper proposes the first point-supervised saliency detection method with a new dataset and a two-stage training process including Non-Salient Suppression.
Findings
Outperforms state-of-the-art weakly supervised methods
Surpasses some fully supervised models in accuracy
Introduces a new point-supervised dataset (P-DUTS)
Abstract
Current state-of-the-art saliency detection models rely heavily on large datasets of accurate pixel-wise annotations, but manually labeling pixels is time-consuming and labor-intensive. There are some weakly supervised methods developed for alleviating the problem, such as image label, bounding box label, and scribble label, while point label still has not been explored in this field. In this paper, we propose a novel weakly-supervised salient object detection method using point supervision. To infer the saliency map, we first design an adaptive masked flood filling algorithm to generate pseudo labels. Then we develop a transformer-based point-supervised saliency detection model to produce the first round of saliency maps. However, due to the sparseness of the label, the weakly supervised model tends to degenerate into a general foreground detection model. To address this issue, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
