Provably Overwhelming Transformer Models with Designed Inputs

Lev Stambler; Seyed Sajjad Nezhadi; Matthew Coudron

arXiv:2502.06038·cs.LG·May 27, 2025

Provably Overwhelming Transformer Models with Designed Inputs

Lev Stambler, Seyed Sajjad Nezhadi, Matthew Coudron

PDF

Open Access

TL;DR

This paper presents an algorithm that can prove when a trained transformer model's output becomes insensitive to certain inputs, demonstrating a form of model overwhelm and providing theoretical guarantees about transformer behavior.

Contribution

The authors introduce a novel algorithm to formally verify when a transformer model is overwhelmed by specific inputs, advancing understanding of model insensitivity and over-squashing.

Findings

01

Algorithm can generate proofs of model overwhelm in polynomial time

02

Proves a strong form of over-squashing in transformers

03

Empirical testing on a single-layer transformer confirms effectiveness

Abstract

We develop an algorithm which, given a trained transformer model $M$ as input, as well as a string of tokens $s$ of length $n_{f i x}$ and an integer $n_{f r ee}$ , can generate a mathematical proof that $M$ is ``overwhelmed'' by $s$ , in time and space $O (n_{f i x}^{2} + n_{f r ee}^{3})$ . We say that $M$ is ``overwhelmed'' by $s$ when the output of the model evaluated on this string plus any additional string $t$ , $M (s + t)$ , is completely insensitive to the value of the string $t$ whenever length( $t$ ) $\leq n_{f r ee}$ . Along the way, we prove a particularly strong worst-case form of ``over-squashing'', which we use to bound the model's behavior. Our technique uses computer-aided proofs to establish this type of operationally relevant guarantee about transformer models. We empirically test our algorithm on a single layer transformer complete with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Power Quality and Harmonics · Power System Optimization and Stability

MethodsSoftmax · Attention Is All You Need