Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet
Matthias K\"ummerer, Lucas Theis, Matthias Bethge

TL;DR
This paper introduces a novel saliency prediction model that leverages pretrained deep neural networks for object recognition, significantly outperforming existing models and providing new insights into fixation behavior.
Contribution
It reuses pretrained deep neural networks for object recognition to improve saliency prediction, achieving state-of-the-art results and offering insights into fixation mechanisms.
Findings
Outperforms all state-of-the-art models on MIT Saliency Benchmark
Reuses pretrained networks to enhance saliency prediction
Provides new insights into fixation psychophysics
Abstract
Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting fixations. This lack in performance has been attributed to an inability to model the influence of high-level image features such as objects. Recent seminal advances in applying deep neural networks to tasks like object recognition suggests that they are able to capture this kind of structure. However, the enormous amount of training data necessary to train these networks makes them difficult to apply directly to saliency prediction. We present a novel way of reusing existing neural networks that have been pretrained on the task of object recognition in models of fixation prediction. Using the well-known network of Krizhevsky et al. (2012), we come up with a new saliency model that significantly outperforms all state-of-the-art models on the MIT Saliency Benchmark. We show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Aesthetic Perception and Analysis
