Self-Attention-Based Contextual Modulation Improves Neural System Identification
Isaac Lin, Tianye Wang, Shang Gao, Shiming Tang, Tai Sing Lee

TL;DR
This paper demonstrates that self-attention mechanisms enhance neural response predictions in visual cortex models, outperforming traditional CNNs by better capturing contextual modulation and top feature preferences.
Contribution
It introduces self-attention as a novel mechanism to improve modeling of contextual effects in neural systems, showing its effectiveness over standard CNN approaches.
Findings
Self-attention improves tuning curve correlation.
Self-attention enhances peak tuning accuracy.
Local receptive fields are crucial for overall tuning, surround information is key for peak tuning.
Abstract
Convolutional neural networks (CNNs) have been shown to be state-of-the-art models for visual cortical neurons. Cortical neurons in the primary visual cortex are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs integrate global contextual information to model contextual modulation via two mechanisms: successive convolutions and a fully connected readout layer. In this paper, we find that self-attention (SA), an implementation of non-local network mechanisms, can improve neural response predictions over parameter-matched CNNs in two key metrics: tuning curve correlation and peak tuning. We introduce peak tuning as a metric to evaluate a model's ability to capture a neuron's top feature preference. We factorize networks to assess each context mechanism, revealing that information in the local receptive field is most important for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
