Loading paper
Accelerating Transformer Decoding via a Hybrid of Self-attention and Recurrent Neural Network | Tomesphere