Loading paper
Multi-modal Transformer for Video Retrieval | Tomesphere