Visual Saliency Detection Based on Multiscale Deep CNN Features

Guanbin Li; Yizhou Yu

arXiv:1609.02077·cs.CV·November 3, 2016

Visual Saliency Detection Based on Multiscale Deep CNN Features

Guanbin Li, Yizhou Yu

PDF

2 Repos

TL;DR

This paper presents a novel deep CNN-based model for visual saliency detection that combines multiscale features and handcrafted low-level features, achieving state-of-the-art results on multiple benchmarks.

Contribution

Introduces a multiscale deep CNN architecture with a discriminative high-level feature for saliency detection and creates a large annotated dataset for evaluation.

Findings

01

Achieves state-of-the-art performance on public benchmarks.

02

Improves F-measure by over 6% on DUT-OMRON.

03

Reduces mean absolute error significantly on new dataset.

Abstract

Visual saliency is a fundamental problem in both cognitive and computational sciences, including computer vision. In this paper, we discover that a high-quality visual saliency model can be learned from multiscale features extracted using deep convolutional neural networks (CNNs), which have had many successes in visual recognition tasks. For learning such saliency models, we introduce a neural network architecture, which has fully connected layers on top of CNNs responsible for feature extraction at three different scales. The penultimate layer of our neural network has been confirmed to be a discriminative high-level feature vector for saliency detection, which we call deep contrast feature. To generate a more robust feature, we integrate handcrafted low-level features with our deep contrast feature. To promote further research and evaluation of visual saliency models, we also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.