Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs
Youbao Tang, Xiangqian Wu

TL;DR
This paper introduces a novel saliency detection approach that combines region-level and pixel-level predictions using CNNs, achieving superior performance on benchmark datasets.
Contribution
It proposes a unified CNN framework that integrates region-based and pixel-based saliency estimation for improved accuracy.
Findings
Outperforms state-of-the-art methods on four benchmark datasets.
Effectively combines region-level and pixel-level saliency predictions.
Joint learning of CNN components enhances detection accuracy.
Abstract
This paper proposes a novel saliency detection method by combining region-level saliency estimation and pixel-level saliency prediction with CNNs (denoted as CRPSD). For pixel-level saliency prediction, a fully convolutional neural network (called pixel-level CNN) is constructed by modifying the VGGNet architecture to perform multi-scale feature learning, based on which an image-to-image prediction is conducted to accomplish the pixel-level saliency detection. For region-level saliency estimation, an adaptive superpixel based region generation technique is first designed to partition an image into regions, based on which the region-level saliency is estimated by using a CNN model (called region-level CNN). The pixel-level and region-level saliencies are fused to form the final salient map by using another CNN (called fusion CNN). And the pixel-level CNN and fusion CNN are jointly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
