# Visual Saliency Prediction Using a Mixture of Deep Neural Networks

**Authors:** Samuel Dodge, Lina Karam

arXiv: 1702.00372 · 2018-07-04

## TL;DR

This paper introduces a novel deep learning model for visual saliency prediction that combines local and global scene information through a mixture of experts, resulting in improved accuracy over traditional models.

## Contribution

The paper proposes a mixture of deep neural networks with a gating mechanism that incorporates global scene semantics for enhanced saliency prediction.

## Key findings

- Improved saliency prediction performance over non-mixture models.
- Effective integration of global scene information via a gating network.
- End-to-end training of expert and gating networks.

## Abstract

Visual saliency models have recently begun to incorporate deep learning to achieve predictive capacity much greater than previous unsupervised methods. However, most existing models predict saliency using local mechanisms limited to the receptive field of the network. We propose a model that incorporates global scene semantic information in addition to local information gathered by a convolutional neural network. Our model is formulated as a mixture of experts. Each expert network is trained to predict saliency for a set of closely related images. The final saliency map is computed as a weighted mixture of the expert networks' output, with weights determined by a separate gating network. This gating network is guided by global scene information to predict weights. The expert networks and the gating network are trained simultaneously in an end-to-end manner. We show that our mixture formulation leads to improvement in performance over an otherwise identical non-mixture model that does not incorporate global scene information.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.00372/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1702.00372/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1702.00372/full.md

---
Source: https://tomesphere.com/paper/1702.00372