Loading paper
SSAN: Separable Self-Attention Network for Video Representation Learning | Tomesphere