Weakly Supervised Video Salient Object Detection
Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, and Junwei Han

TL;DR
This paper introduces the first weakly supervised video salient object detection model that uses relabeled fixation-guided scribble annotations, employing novel modules and loss functions to achieve competitive performance.
Contribution
It proposes a new weakly supervised approach with an appearance-motion fusion module, bidirectional ConvLSTM, and a foreground-background similarity loss, reducing annotation effort.
Findings
Effective on six benchmark datasets
Achieves competitive performance with less annotation effort
Introduces a pseudo-label boosting strategy
Abstract
Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain. To relieve the burden of data annotation, we present the first weakly supervised video salient object detection model based on relabeled "fixation guided scribble annotations". Specifically, an "Appearance-motion fusion module" and bidirectional ConvLSTM based framework are proposed to achieve effective multi-modal learning and long-term temporal context modeling based on our new weak annotations. Further, we design a novel foreground-background similarity loss to further explore the labeling similarity across frames. A weak annotation boosting strategy is also introduced to boost our model performance with a new pseudo-label generation technique. Extensive experimental results on six…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsTanh Activation · Sigmoid Activation · Convolution · ConvLSTM
