Loading paper
Linear Self-Attention Approximation via Trainable Feedforward Kernel | Tomesphere