From Cognition to Computation: A Comparative Review of Human Attention and Transformer Architectures
Minglu Zhao, Dehong Xu, Tao Gao

TL;DR
This paper compares human attention and Transformer architectures, highlighting their differences and similarities to inform future AI development through interdisciplinary insights.
Contribution
It provides a comprehensive comparative review of human and artificial attention mechanisms from a cognitive-functional perspective.
Findings
Human and Transformer attention differ in capacity constraints
Attention pathways and intentional mechanisms vary between humans and models
The review identifies open research questions for AI advancement
Abstract
Attention is a cornerstone of human cognition that facilitates the efficient extraction of information in everyday life. Recent developments in artificial intelligence like the Transformer architecture also incorporate the idea of attention in model designs. However, despite the shared fundamental principle of selectively attending to information, human attention and the Transformer model display notable differences, particularly in their capacity constraints, attention pathways, and intentional mechanisms. Our review aims to provide a comparative analysis of these mechanisms from a cognitive-functional perspective, thereby shedding light on several open research questions. The exploration encourages interdisciplinary efforts to derive insights from human attention mechanisms in the pursuit of developing more generalized artificial intelligence.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Context-Aware Activity Recognition Systems · Gaze Tracking and Assistive Technology
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Byte Pair Encoding · Layer Normalization · Label Smoothing · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam
