Toward Manifest Relationality in Transformers via Symmetry Reduction

J. Fran\c{c}ois; L. Ravera

arXiv:2602.18948·cs.LG·February 24, 2026

Toward Manifest Relationality in Transformers via Symmetry Reduction

J. Fran\c{c}ois, L. Ravera

PDF

Open Access

TL;DR

This paper introduces a symmetry reduction framework for Transformer models that reformulates their components in terms of invariant relational quantities, reducing redundancy and providing a geometric perspective.

Contribution

It presents a novel symmetry reduction approach that reformulates Transformer representations and attention mechanisms using invariant relational structures, enhancing efficiency and interpretability.

Findings

01

Reduces parameter redundancy in Transformers.

02

Provides a geometric framework for analyzing optimization.

03

Operates directly on relational structures.

Abstract

Transformer models contain substantial internal redundancy arising from coordinate-dependent representations and continuous symmetries, in model space and in head space, respectively. While recent approaches address this by explicitly breaking symmetry, we propose a complementary framework based on symmetry reduction. We reformulate representations, attention mechanisms, and optimization dynamics in terms of invariant relational quantities, eliminating redundant degrees of freedom by construction. This perspective yields architectures that operate directly on relational structures, providing a principled geometric framework for reducing parameter redundancy and analyzing optimization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Generative Adversarial Networks and Image Synthesis · Advanced Memory and Neural Computing