Attention as Binding: A Vector-Symbolic Perspective on Transformer Reasoning

Sahil Rajesh Dhayalkar

arXiv:2512.14709·cs.AI·December 18, 2025

Attention as Binding: A Vector-Symbolic Perspective on Transformer Reasoning

Sahil Rajesh Dhayalkar

PDF

Open Access 1 Video

TL;DR

This paper presents a vector-symbolic perspective on transformer models, interpreting their internal mechanisms as implementing approximate Vector Symbolic Architectures (VSA), which enhances understanding and guides architectural improvements for reasoning tasks.

Contribution

It introduces a unified algebraic framework for transformer attention, relates it to symbolic reasoning, and proposes VSA-inspired architectural biases and metrics for improved interpretability and robustness.

Findings

01

Transformer internals relate to chain-of-thought reasoning.

02

VSA-inspired biases improve symbolic manipulation.

03

Metrics for measuring VSA-likeness are proposed.

Abstract

Transformer-based language models display impressive reasoning-like behavior, yet remain brittle on tasks that require stable symbolic manipulation. This paper develops a unified perspective on these phenomena by interpreting self-attention and residual streams as implementing an approximate Vector Symbolic Architecture (VSA). In this view, queries and keys define role spaces, values encode fillers, attention weights perform soft unbinding, and residual connections realize superposition of many bound structures. We use this algebraic lens to relate transformer internals to chain-of-thought traces, program-based reasoning, and memory-augmented tool use, and to explain characteristic failure modes such as variable confusion and inconsistency across logically related prompts. Building on this perspective, we propose VSA-inspired architectural biases, including explicit binding/unbinding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Attention as Binding: A Vector-Symbolic Perspective on Transformer Reasoning· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Ferroelectric and Negative Capacitance Devices