Loading paper
Reducing the Transformer Architecture to a Minimum | Tomesphere