Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context
Yicheng Song, Shuyong Gao, Haozhe Xing, Yiting Cheng, Yan Wang,, Wenqiang Zhang

TL;DR
This paper introduces a self-supervised, end-to-end framework for unsupervised salient object detection that leverages top-down context and contrastive learning to improve semantic understanding and suppress non-salient objects.
Contribution
It proposes a novel self-supervised approach with a detail-boosting refiner and non-salient suppression, advancing unsupervised saliency detection without relying on noisy labels.
Findings
Achieves state-of-the-art performance among end-to-end unsupervised methods.
Effectively suppresses non-salient objects in diverse scenarios.
Demonstrates robustness and accuracy on benchmark datasets.
Abstract
Unsupervised salient object detection aims to detect salient objects without using supervision signals eliminating the tedious task of manually labeling salient objects. To improve training efficiency, end-to-end methods for USOD have been proposed as a promising alternative. However, current solutions rely heavily on noisy handcraft labels and fail to mine rich semantic information from deep features. In this paper, we propose a self-supervised end-to-end salient object detection framework via top-down context. Specifically, motivated by contrastive learning, we exploit the self-localization from the deepest feature to construct the location maps which are then leveraged to learn the most instructive segmentation guidance. Further considering the lack of detailed information in deepest features, we exploit the detail-boosting refiner module to enrich the location labels with details.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Tactile and Sensory Interactions
