Shallow and Deep Convolutional Networks for Saliency Prediction

Junting Pan; Kevin McGuinness; Elisa Sayrol; Noel O'Connor; Xavier; Giro-i-Nieto

arXiv:1603.00845·cs.CV·March 3, 2016·132 cites

Shallow and Deep Convolutional Networks for Saliency Prediction

Junting Pan, Kevin McGuinness, Elisa Sayrol, Noel O'Connor, Xavier, Giro-i-Nieto

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first end-to-end convolutional neural networks for saliency prediction, demonstrating that data-driven deep learning approaches can outperform traditional hand-crafted methods in speed and accuracy.

Contribution

It presents two novel CNN architectures for saliency prediction, one shallow trained from scratch and one deeper with transfer learning, pioneering end-to-end deep learning in this domain.

Findings

01

Deep CNNs achieve high accuracy in saliency prediction

02

End-to-end training outperforms traditional methods

03

Deep models are computationally efficient

Abstract

The prediction of salient areas in images has been traditionally addressed with hand-crafted features based on neuroscience principles. This paper, however, addresses the problem with a completely data-driven approach by training a convolutional neural network (convnet). The learning process is formulated as a minimization of a loss function that measures the Euclidean distance of the predicted saliency map with the provided ground truth. The recent publication of large datasets of saliency prediction has provided enough data to train end-to-end architectures that are both fast and accurate. Two designs are proposed: a shallow convnet trained from scratch, and a another deeper solution whose first three layers are adapted from another network trained for classification. To the authors knowledge, these are the first end-to-end CNNs trained and tested for the purpose of saliency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

imatge-upc/saliency-2016-cvpr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques