Loading paper
Intra-Layer Recurrence in Transformers for Language Modeling | Tomesphere