Loading paper
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding | Tomesphere