TL;DR
This paper introduces GASP, a neural model that integrates social cues like gaze and affect to improve saliency prediction, demonstrating that social information enhances the accuracy of attention modeling.
Contribution
The paper presents a novel two-stage neural model that incorporates social cues into saliency prediction, with new fusion techniques and sub-networks for dynamic attention guidance.
Findings
Gaze and affective cues improve saliency prediction accuracy by at least 5%.
Fusion methods outperform non-fusion approaches for static integration.
Affective representations significantly enhance the model's performance.
Abstract
Saliency prediction refers to the computational task of modeling overt attention. Social cues greatly influence our attention, consequently altering our eye movements and behavior. To emphasize the efficacy of such features, we present a neural model for integrating social cues and weighting their influences. Our model consists of two stages. During the first stage, we detect two social cues by following gaze, estimating gaze direction, and recognizing affect. These features are then transformed into spatiotemporal maps through image processing operations. The transformed representations are propagated to the second stage (GASP) where we explore various techniques of late fusion for integrating social cues and introduce two sub-networks for directing attention to relevant stimuli. Our experiments indicate that fusion approaches achieve better results for static integration methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Long Short-Term Memory · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Average Pooling · Squeeze-and-Excitation Block · fast speak--How do I Speak to someone at Expedia? · Gated Linear Unit · Convolution
