Disentangling top-down vs. bottom-up and low-level vs. high-level influences on eye movements over time
Heiko H. Sch\"utt, Lars O. M. Rothkegel, Hans A. Trukenbrod, Ralf, Engbert, Felix A. Wichmann

TL;DR
This study analyzes how top-down, bottom-up, low-level, and high-level factors influence eye movements over time, revealing a phased exploration process and the dominance of high-level and top-down influences in scene viewing.
Contribution
The paper introduces a novel temporal analysis of eye movement influences using combined saliency models and large datasets, clarifying the roles of different factors over time.
Findings
Initial saccades are driven mainly by low-level bottom-up factors.
High-level features significantly influence fixation locations after 200 ms.
Top-down processes dominate fixation control during search tasks.
Abstract
Bottom-up and top-down, as well as low-level and high-level factors influence where we fixate when viewing natural scenes. However, the importance of each of these factors and how they interact remains a matter of debate. Here, we disentangle these factors by analysing their influence over time. For this purpose we develop a saliency model which is based on the internal representation of a recent early spatial vision model to measure the low-level bottom-up factor. To measure the influence of high-level bottom-up features, we use a recent DNN-based saliency model. To account for top-down influences, we evaluate the models on two large datasets with different tasks: first, a memorisation task and, second, a search task. Our results lend support to a separation of visual scene exploration into three phases: The first saccade, an initial guided exploration characterised by a gradual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Visual perception and processing mechanisms · Advanced Image and Video Retrieval Techniques
