The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks
Xiaoliang Luo, Brett D. Roads, Bradley C. Love

TL;DR
This paper introduces a goal-directed attention mechanism for deep convolutional neural networks that improves task performance by modulating mid-level features, balancing benefits and costs in visual recognition tasks.
Contribution
We developed a novel, biologically inspired attention layer for DCNNs that enhances task-specific focus and outperforms traditional transfer learning methods.
Findings
Increased attention improves hit rates and sensitivity.
Moderate attention increases false alarms and bias.
Mid-level attention outperforms retraining the final layer.
Abstract
People deploy top-down, goal-directed attention to accomplish tasks, such as finding lost keys. By tuning the visual system to relevant information sources, object recognition can become more efficient (a benefit) and more biased toward the target (a potential cost). Motivated by selective attention in categorisation models, we developed a goal-directed attention mechanism that can process naturalistic (photographic) stimuli. Our attention mechanism can be incorporated into any existing deep convolutional neural network (DCNNs). The processing stages in DCNNs have been related to ventral visual stream. In that light, our attentional mechanism incorporates top-down influences from prefrontal cortex (PFC) to support goal-directed behaviour. Akin to how attention weights in categorisation models warp representational spaces, we introduce a layer of attention weights to the mid-level of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Neural and Behavioral Psychology Studies
MethodsDiffusion-Convolutional Neural Networks
