Testing Components of the Attention Schema Theory in Artificial Neural Networks
Kathryn T. Farrell, Kirsten Ziman, Michael S. A. Graziano

TL;DR
This study investigates how incorporating an attention schema into neural networks enhances their ability to interpret, predict, and cooperate based on attention states, providing insights into both artificial and biological attention mechanisms.
Contribution
It demonstrates that adding an attention schema improves agents' ability to understand and predict each other's attention states without increasing overall network complexity.
Findings
Agents with an attention schema better categorize others' attention states.
Attention schemas lead to more interpretable attention patterns.
Performance in joint prediction tasks improves with an attention schema.
Abstract
Growing evidence suggests that the brain uses an attention schema, or a simplified model of attention, to help control what it attends to. One proposed benefit of this model is to allow agents to model the attention states of other agents, and thus predict and interact with other agents. The effects of an attention schema may be examined in artificial agents. Although attention mechanisms in artificial agents are different from in biological brains, there may be some principles in common. In both cases, select features or representations are emphasized for better performance. Here, using neural networks with transformer attention mechanisms, we asked whether the addition of an attention schema affected the ability of agents to make judgements about and cooperate with each other. First, we found that an agent with an attention schema is better at categorizing the attention states of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Science and Mapping
MethodsSoftmax · Attention Is All You Need
