TL;DR
This paper presents a novel deep CNN-based model for visual saliency detection that combines multiscale features and handcrafted low-level features, achieving state-of-the-art results on multiple benchmarks.
Contribution
Introduces a multiscale deep CNN architecture with a discriminative high-level feature for saliency detection and creates a large annotated dataset for evaluation.
Findings
Achieves state-of-the-art performance on public benchmarks.
Improves F-measure by over 6% on DUT-OMRON.
Reduces mean absolute error significantly on new dataset.
Abstract
Visual saliency is a fundamental problem in both cognitive and computational sciences, including computer vision. In this paper, we discover that a high-quality visual saliency model can be learned from multiscale features extracted using deep convolutional neural networks (CNNs), which have had many successes in visual recognition tasks. For learning such saliency models, we introduce a neural network architecture, which has fully connected layers on top of CNNs responsible for feature extraction at three different scales. The penultimate layer of our neural network has been confirmed to be a discriminative high-level feature vector for saliency detection, which we call deep contrast feature. To generate a more robust feature, we integrate handcrafted low-level features with our deep contrast feature. To promote further research and evaluation of visual saliency models, we also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
