Gravitational Models Explain Shifts on Human Visual Attention
Dario Zanca, Marco Gori, Stefano Melacci, Alessandra Rufa

TL;DR
This paper introduces a gravitational model for human visual attention shifts, which predicts attentional movements more accurately than traditional winner-take-all models by considering features as attractors.
Contribution
The paper proposes a novel gravitational model that describes attentional shifts without relying solely on a centralized saliency map, improving prediction accuracy.
Findings
The gravitational model outperforms winner-take-all in predicting attention shifts.
Quantitative results on large datasets validate the model's effectiveness.
The model suggests a decentralized approach to visual saliency.
Abstract
Visual attention refers to the human brain's ability to select relevant sensory information for preferential processing, improving performance in visual and cognitive tasks. It proceeds in two phases. One in which visual feature maps are acquired and processed in parallel. Another where the information from these maps is merged in order to select a single location to be attended for further and more complex computations and reasoning. Its computational description is challenging, especially if the temporal dynamics of the process are taken into account. Numerous methods to estimate saliency have been proposed in the last three decades. They achieve almost perfect performance in estimating saliency at the pixel level, but the way they generate shifts in visual attention fully depends on winner-take-all (WTA) circuitry. WTA is implemented} by the biological hardware in order to select a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
