Transformer models are gauge invariant: A mathematical connection   between AI and particle physics

Leo van Nierop

arXiv:2412.14543·cs.LG·December 20, 2024

Transformer models are gauge invariant: A mathematical connection between AI and particle physics

Leo van Nierop

PDF

Open Access

TL;DR

This paper reveals that transformer models inherently possess gauge invariance properties similar to those in particle physics, highlighting a fundamental symmetry in AI architectures.

Contribution

It establishes a mathematical connection between transformer models and gauge invariance in particle physics, showing that transformers exhibit similar symmetry properties.

Findings

01

Transformers display gauge invariance-like properties.

02

Default transformer representations partially retain gauge invariance.

03

The work bridges concepts between AI and particle physics.

Abstract

In particle physics, the fundamental forces are subject to symmetries called gauge invariance. It is a redundancy in the mathematical description of any physical system. In this article I will demonstrate that the transformer architecture exhibits the same properties, and show that the default representation of transformers has partially, but not fully removed the gauge invariance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Computational Physics and Python Applications · Advanced Data Processing Techniques