Attention as Binding: A Vector-Symbolic Perspective on Transformer Reasoning
Sahil Rajesh Dhayalkar

TL;DR
This paper presents a vector-symbolic perspective on transformer models, interpreting their internal mechanisms as implementing approximate Vector Symbolic Architectures (VSA), which enhances understanding and guides architectural improvements for reasoning tasks.
Contribution
It introduces a unified algebraic framework for transformer attention, relates it to symbolic reasoning, and proposes VSA-inspired architectural biases and metrics for improved interpretability and robustness.
Findings
Transformer internals relate to chain-of-thought reasoning.
VSA-inspired biases improve symbolic manipulation.
Metrics for measuring VSA-likeness are proposed.
Abstract
Transformer-based language models display impressive reasoning-like behavior, yet remain brittle on tasks that require stable symbolic manipulation. This paper develops a unified perspective on these phenomena by interpreting self-attention and residual streams as implementing an approximate Vector Symbolic Architecture (VSA). In this view, queries and keys define role spaces, values encode fillers, attention weights perform soft unbinding, and residual connections realize superposition of many bound structures. We use this algebraic lens to relate transformer internals to chain-of-thought traces, program-based reasoning, and memory-augmented tool use, and to explain characteristic failure modes such as variable confusion and inconsistency across logically related prompts. Building on this perspective, we propose VSA-inspired architectural biases, including explicit binding/unbinding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Ferroelectric and Negative Capacitance Devices
