Loading paper
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks | Tomesphere