From Cognition to Computation: A Comparative Review of Human Attention   and Transformer Architectures

Minglu Zhao; Dehong Xu; Tao Gao

arXiv:2407.01548·q-bio.OT·July 3, 2024

From Cognition to Computation: A Comparative Review of Human Attention and Transformer Architectures

Minglu Zhao, Dehong Xu, Tao Gao

PDF

Open Access

TL;DR

This paper compares human attention and Transformer architectures, highlighting their differences and similarities to inform future AI development through interdisciplinary insights.

Contribution

It provides a comprehensive comparative review of human and artificial attention mechanisms from a cognitive-functional perspective.

Findings

01

Human and Transformer attention differ in capacity constraints

02

Attention pathways and intentional mechanisms vary between humans and models

03

The review identifies open research questions for AI advancement

Abstract

Attention is a cornerstone of human cognition that facilitates the efficient extraction of information in everyday life. Recent developments in artificial intelligence like the Transformer architecture also incorporate the idea of attention in model designs. However, despite the shared fundamental principle of selectively attending to information, human attention and the Transformer model display notable differences, particularly in their capacity constraints, attention pathways, and intentional mechanisms. Our review aims to provide a comparative analysis of these mechanisms from a cognitive-functional perspective, thereby shedding light on several open research questions. The exploration encourages interdisciplinary efforts to derive insights from human attention mechanisms in the pursuit of developing more generalized artificial intelligence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Context-Aware Activity Recognition Systems · Gaze Tracking and Assistive Technology

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Byte Pair Encoding · Layer Normalization · Label Smoothing · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam