Loading paper
cosFormer: Rethinking Softmax in Attention | Tomesphere