Loading paper
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition | Tomesphere