Saliency Detection via Combining Region-Level and Pixel-Level   Predictions with CNNs

Youbao Tang; Xiangqian Wu

arXiv:1608.05186·cs.CV·August 19, 2016

Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs

Youbao Tang, Xiangqian Wu

PDF

Open Access

TL;DR

This paper introduces a novel saliency detection approach that combines region-level and pixel-level predictions using CNNs, achieving superior performance on benchmark datasets.

Contribution

It proposes a unified CNN framework that integrates region-based and pixel-based saliency estimation for improved accuracy.

Findings

01

Outperforms state-of-the-art methods on four benchmark datasets.

02

Effectively combines region-level and pixel-level saliency predictions.

03

Joint learning of CNN components enhances detection accuracy.

Abstract

This paper proposes a novel saliency detection method by combining region-level saliency estimation and pixel-level saliency prediction with CNNs (denoted as CRPSD). For pixel-level saliency prediction, a fully convolutional neural network (called pixel-level CNN) is constructed by modifying the VGGNet architecture to perform multi-scale feature learning, based on which an image-to-image prediction is conducted to accomplish the pixel-level saliency detection. For region-level saliency estimation, an adaptive superpixel based region generation technique is first designed to partition an image into regions, based on which the region-level saliency is estimated by using a CNN model (called region-level CNN). The pixel-level and region-level saliencies are fused to form the final salient map by using another CNN (called fusion CNN). And the pixel-level CNN and fusion CNN are jointly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications