Supersaliency: A Novel Pipeline for Predicting Smooth Pursuit-Based Attention Improves Generalizability of Video Saliency
Mikhail Startsev, Michael Dorr

TL;DR
This paper introduces a new approach to video saliency prediction by explicitly modeling smooth pursuit eye movements, called supersaliency, which enhances the generalizability and accuracy of attention prediction models across diverse datasets.
Contribution
The study presents a novel pipeline that distinguishes supersaliency from fixations, improving model performance and generalization in dynamic scene attention prediction.
Findings
Models outperform state-of-the-art in supersaliency and traditional saliency tasks.
Supersaliency model shows superior generalization across datasets.
Explicit modeling of smooth pursuit improves attention prediction accuracy.
Abstract
Predicting attention is a popular topic at the intersection of human and computer vision. However, even though most of the available video saliency data sets and models claim to target human observers' fixations, they fail to differentiate them from smooth pursuit (SP), a major eye movement type that is unique to perception of dynamic scenes. In this work, we highlight the importance of SP and its prediction (which we call supersaliency, due to greater selectivity compared to fixations), and aim to make its distinction from fixations explicit for computational models. To this end, we (i) use algorithmic and manual annotations of SP and fixations for two well-established video saliency data sets, (ii) train Slicing Convolutional Neural Networks for saliency prediction on either fixation- or SP-salient locations, and (iii) evaluate our and 26 publicly available dynamic saliency models on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Visual perception and processing mechanisms · Face Recognition and Perception
