Loading paper
The Recurrent Transformer: Greater Effective Depth and Efficient Decoding | Tomesphere