Loading paper
On the Universality of Transformer Architectures; How Much Attention Is Enough? | Tomesphere