Guiding Visual Attention in Deep Convolutional Neural Networks Based on   Human Eye Movements

Leonard E. van Dyck; Sebastian J. Denzler; Walter R. Gruber

arXiv:2206.10587·cs.CV·September 7, 2022

Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements

Leonard E. van Dyck, Sebastian J. Denzler, Walter R. Gruber

PDF

TL;DR

This study explores a data-driven method to guide deep neural networks' visual attention using human eye movement data, aiming to enhance biological plausibility and understanding of face detection.

Contribution

It introduces a novel approach to modify training data based on human eye tracking to influence neural network attention patterns during object recognition.

Findings

01

Non-human-like models focus on dissimilar image parts compared to humans.

02

Effects are category-specific, influenced by animacy and face presence.

03

Guided focus manipulation does not significantly increase human-likeness.

Abstract

Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological vision, have evolved into best current computational models of object recognition, and consequently indicate strong architectural and functional parallelism with the ventral visual pathway throughout comparisons with neuroimaging and neural time series data. As recent advances in deep learning seem to decrease this similarity, computational neuroscience is challenged to reverse-engineer the biological plausibility to obtain useful models. While previous studies have shown that biologically inspired architectures are able to amplify the human-likeness of the models, in this study, we investigate a purely data-driven approach. We use human eye tracking data to directly modify training examples and thereby guide the models' visual attention during object recognition in natural images either…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.