Attention Schema in Neural Agents
Dianbo Liu, Samuele Bolotta, He Zhu, Yoshua Bengio, Guillaume Dumas

TL;DR
This paper explores the concept of an Attention Schema in neural agents, proposing that modeling attention as a higher-order internal process can improve social coordination and performance in multi-agent systems.
Contribution
It introduces the idea of an Attention Schema in neural agents and demonstrates that implementing it as a recurrent control enhances social intelligence and coordination.
Findings
Agents with an Attention Schema perform better in multi-agent tasks.
Implementing AS as a recurrent control improves agent performance.
Equipping agents with attention models enhances social intelligence.
Abstract
Attention has become a common ingredient in deep learning architectures. It adds a dynamical selection of information on top of the static selection of information supported by weights. In the same way, we can imagine a higher-order informational filter built on top of attention: an Attention Schema (AS), namely, a descriptive and predictive model of attention. In cognitive neuroscience, Attention Schema Theory (AST) supports this idea of distinguishing attention from AS. A strong prediction of this theory is that an agent can use its own AS to also infer the states of other agents' attention and consequently enhance coordination with other agents. As such, multi-agent reinforcement learning would be an ideal setting to experimentally test the validity of AST. We explore different ways in which attention and AS interact with each other. Our preliminary results indicate that agents that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Advanced Memory and Neural Computing · EEG and Brain-Computer Interfaces
MethodsTest
