Predicting Human Eye Fixations via an LSTM-based Saliency Attentive   Model

Marcella Cornia; Lorenzo Baraldi; Giuseppe Serra; Rita Cucchiara

arXiv:1611.09571·cs.CV·July 10, 2018

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model

Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara

PDF

2 Repos

TL;DR

This paper introduces a novel LSTM-based saliency model that uses neural attention mechanisms and learned priors to improve the accuracy of predicting human eye fixations, outperforming existing methods.

Contribution

The paper presents a new neural network architecture combining Convolutional LSTM and attention mechanisms for saliency prediction, including learned prior maps to address center bias.

Findings

01

Outperforms state-of-the-art saliency prediction models

02

Effectively incorporates neural attention for focus on salient regions

03

Learned prior maps improve fixation prediction accuracy

Abstract

Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neural Networks for predicting gaze fixations. In this paper we go beyond standard approaches to saliency prediction, in which gaze maps are computed with a feed-forward network, and present a novel model which can predict accurate saliency maps by incorporating neural attentive mechanisms. The core of our solution is a Convolutional LSTM that focuses on the most salient regions of the input image to iteratively refine the predicted saliency map. Additionally, to tackle the center bias typical of human eye fixations, our model can learn a set of prior maps generated with Gaussian functions. We show, through an extensive evaluation, that the proposed architecture outperforms the current state of the art on public saliency prediction datasets. We further study the contribution of each key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory