Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements
Leonard E. van Dyck, Sebastian J. Denzler, Walter R. Gruber

TL;DR
This study explores a data-driven method to guide deep neural networks' visual attention using human eye movement data, aiming to enhance biological plausibility and understanding of face detection.
Contribution
It introduces a novel approach to modify training data based on human eye tracking to influence neural network attention patterns during object recognition.
Findings
Non-human-like models focus on dissimilar image parts compared to humans.
Effects are category-specific, influenced by animacy and face presence.
Guided focus manipulation does not significantly increase human-likeness.
Abstract
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological vision, have evolved into best current computational models of object recognition, and consequently indicate strong architectural and functional parallelism with the ventral visual pathway throughout comparisons with neuroimaging and neural time series data. As recent advances in deep learning seem to decrease this similarity, computational neuroscience is challenged to reverse-engineer the biological plausibility to obtain useful models. While previous studies have shown that biologically inspired architectures are able to amplify the human-likeness of the models, in this study, we investigate a purely data-driven approach. We use human eye tracking data to directly modify training examples and thereby guide the models' visual attention during object recognition in natural images either…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
